Figure - PMC

Skip to main content

An official website of the United States government

Here's how you know

Here's how you know

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.

View full-text article in PMC

. 2024 Jun 3;2:51. doi: 10.1038/s44271-024-00091-8

Search in PMC
Search in PubMed
View in NLM Catalog
Add to search

© The Author(s) 2024

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

PMC Copyright notice

Fig. 2 — a Scatter plot of Open AI models (excluding ChatGPT and GPT-4 because their log-scores are not available) as a function of the two main principal components (PC2 and PC3), calculated from an estimation of the KL-Divergence between models (see Methods for details). The size of each datapoint is proportional to the size of the model. Grey datapoints designate models excluded from cognitive analysis, while red ones designate included ones. Note that, while by definition PC1 did explain more variance, in actuality it only captured the separation between AD1 and the remaining models. Thus, while not apparent in the present figure, it should be kept in mind that AD1 was far-removed from CDV2 and all other models along the PC1 axis. b Scatter plots where each datapoint represents an experimental vignette item (i.e., verbal stimuli) for the Cognitive Reflection Test (CRT; cyan, blue) and the Linda/Bill problem (L/B; pink, purple). Points are plotted as a function of the parameters of an exponential function fitted on the log-probability. Light colors (pink, cyan) represent the canonical items of each test as currently known. Dark colors (purple, blue) identify the new items that we created for this study. Results are plotted for davinci-003.