Figure - PMC

Skip to main content

An official website of the United States government

Here's how you know

Here's how you know

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.

View full-text article in PMC

. 2020 Sep 28;117(42):25966–25974. doi: 10.1073/pnas.1910416117

Search in PMC
Search in PubMed
View in NLM Catalog
Add to search

Published under the PNAS license.

PMC Copyright notice

Fig. 3. — Query-based attention (QBA). To constrain the interpretation of the word “bat” in the context “John hit the ball with the __,” a query generated from “bat” can be used to construct a weighted attention vector, which shapes the word’s interpretation. The query is compared with each of the learned representation vectors (RepVs) of the context words; this creates a set of similarity scores (Sims), which in turn, produce a set of weightings (Ws; a set of positive numbers summing to one). The Ws are used to scale the RepVs of the context words, creating Scaled RepVs. The weighted attention vector is the element-wise sum of the Scaled RepVs. The Query, RepVs, Sims, Scaled RepVs, and weighted attention vector use red color intensity for positive magnitudes and blue for negative magnitudes. Ws are shown as green color intensity. White = 0 throughout. The Query and RepVs were made up for illustration, inspired by ref. 24. Mathematical details: for query $q$ and representation vector $v_{j}$ for context word $j$ , the similarity score $s_{j}$ is $\cos (q, v_{j})$ . The $s_{j}$ are converted into weightings $w_{j}$ by the softmax function, $w_{j} = e^{(g s_{j})} / (Σ_{j'} e^{(g s_{j'})})$ , where the sum in the denominator runs over all words in the context span, and $g$ is a scale factor.