Figure - PMC

Skip to main content

An official website of the United States government

Here's how you know

Here's how you know

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.

View full-text article in PMC

. 2022 Jul 8;3(7):100520. doi: 10.1016/j.patter.2022.100520

Search in PMC
Search in PubMed
View in NLM Catalog
Add to search

© 2022 The Author(s)

This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).

PMC Copyright notice

Comparison of different hierarchical architectures for classification models

After patch embedding, the feature map size is $h \times w \times c$ , where h, w, and c are the height, width, and channel numbers. There is a patch merging operation between every two stages, usually $2 \times 2$ patches are merged, and the number of channels doubles. The resolutions of the feature maps are different, usually $h = H / 16$ in single stage, $h = H / 7$ in two stage, and $h = H / 4$ in pyramid, where H and W are the height and width of the input image.