Figure - PMC

Skip to main content

An official website of the United States government

Here's how you know

Here's how you know

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.

View full-text article in PMC

. 2023 Nov 15;9(11):248. doi: 10.3390/jimaging9110248

Search in PMC
Search in PubMed
View in NLM Catalog
Add to search

© 2023 by the authors.

Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

PMC Copyright notice

The proposed marginalization-based method: A 2D feature sequence, $F = (F_{1, 1}, \dots, F_{H^{'}, W^{'}})$ , is produced by a 2D feature extractor such as a ViT backbone. $F$ is fed to a linear layer to produce $S = (S_{1, 1}, \dots, S_{H^{'}, W^{'}})$ from which a softmax normalization is performed over both $H^{'}$ and C dimensions. Next, the normalized $U = (U_{1, 1}, \dots, U_{H^{'}, W^{'}})$ is marginalized over the $H^{'}$ dimension to produce $P = (P_{1}, \dots, P_{W^{'}})$ that is fed to a CTC decoder. D and C are the feature and class dimensions, respectively.