Skip to main content

This is a preprint.

It has not yet been peer reviewed by a journal.

The National Library of Medicine is running a pilot to include preprints that result from research funded by NIH in PMC and PubMed.

ArXiv logoLink to ArXiv
[Preprint]. 2024 Oct 17:arXiv:2410.10899v2. [Version 2]

GPTON: Generative Pre-trained Transformers enhanced with Ontology Narration for accurate annotation of biological data

Rongbin Li, Wenbo Chen, Jinbo Li, Hanwen Xing, Hua Xu, Zhao Li, W Jim Zheng
PMCID: PMC11527096  PMID: 39483354

Abstract

By leveraging GPT-4 for ontology narration, we developed GPTON to infuse structured knowledge into LLMs through verbalized ontology terms, achieving accurate text and ontology annotations for over 68% of gene sets in the top five predictions. Manual evaluations confirm GPTON's robustness, highlighting its potential to harness LLMs and structured knowledge to significantly advance biomedical research beyond gene set annotation.

Full Text Availability

The license terms selected by the author(s) for this preprint version do not permit archiving in PMC. The full text is available from the preprint server.

25 pages, 6 figures


Articles from ArXiv are provided here courtesy of arXiv

RESOURCES