Skip to main content
Nucleic Acids Research logoLink to Nucleic Acids Research
editorial
. 2024 Dec 10;53(D1):D1–D9. doi: 10.1093/nar/gkae1220

The 2025 Nucleic Acids Research database issue and the online molecular biology database collection

Daniel J Rigden 1,, Xosé M Fernández 2
PMCID: PMC11701706  PMID: 39658041

Abstract

The 2025 Nucleic Acids Research database issue contains 185 papers spanning biology and related areas. Seventy three new databases are covered, while resources previously described in the issue account for 101 update articles. Databases most recently published elsewhere account for a further 11 papers. Nucleic acid databases include EXPRESSO for multi-omics of 3D genome structure (this issue’s chosen Breakthrough Resource and Article) and NAIRDB for Fourier transform infrared data. New protein databases include structure predictions for human isoforms at ASpdb and for viral proteins at BFVD. UniProt, Pfam and InterPro have all provided updates: metabolism and signalling are covered by new descriptions of STRING, KEGG and CAZy, while updated microbe-oriented databases include Enterobase, VFDB and PHI-base. Biomedical research is supported, among others, by ClinVar, PubChem and DrugMAP. Genomics-related resources include Ensembl, UCSC Genome Browser and dbSNP. New plant databases cover the Solanaceae (SolR) and Asteraceae (AMIR) families while an update from NCBI Taxonomy also features. The Database Issue is freely available on the Nucleic Acids Research website (https://academic.oup.com/nar). At the NAR online Molecular Biology Database Collection (http://www.oxfordjournals.org/nar/database/c/), 932 entries have been reviewed in the last year, 74 new resources added and 226 discontinued URLs eliminated bringing the current total to 2236 databases.

New and updated databases

The 32nd Nucleic Acids Research Database issue once again spans all of biology with a total of 185 papers. This year sees 73 new databases feature (Table 1) alongside 101 updated descriptions of resources previously covered in the issue. The final 11 are updates from databases most recently published elsewhere (Table 2). In customary fashion, annual overviews from the major database providers at the European Bioinformatics Institute (EBI), the U.S. National Center for Biotechnology Information (NCBI), and the National Genomics Data Center in China (1–3) lead off. The following papers are then grouped in the usual way: (i) nucleic acid sequence and structure, transcriptional regulation; (ii) protein sequence and structure, proteomics; (iii) metabolic and signalling pathways, enzymes and networks; (iv) genomics of viruses, bacteria, protozoa and fungi; (v) genomics of human and model organisms plus comparative genomics; (vi) human genomic variation, diseases and drugs; (vii) plants and (viii) other topics. As many databases range broadly, some papers defy easy categorisation and readers are encouraged to browse the full list.

Table 1.

Descriptions of new databases in the 2025 NAR Database issue

Database name URL Short description
AMIR https://yanglab.hzau.edu.cn/AMIR/ Asteraceae multi-omics information resource
ASpdb https://biodataai.uth.edu/ASpdb/ Structures of human protein isoforms
BAR https://bar.utoronto.ca Bio-Analytic Resource for Plant Biology
BFVD https://bfvd.foldseek.com/ AlphaFold-predicted structures of viral proteins
BGC Atlas https://bgc-atlas.cs.uni-tuebingen.de Biosynthetic gene clusters from metagenomes
CELLxGENE https://cellxgene.cziscience.com/ Curated, standardized single-cell transcriptomic data
Chem(Pro)2 https://idrblab.org/chemprosquare/ Chemoproteomic probes labelling human proteins
CircaKB https://cdsic.njau.edu.cn/CircaKB Circadian genes
CRISPRepi http://crisprepi.maolab.org/ CRISPR-mediated epigenome editing
CRISPRoffT https://ccsm.uth.edu/CRISPRoffT/ CRISPR/Cas off-target data
CVD Atlas https://ngdc.cncb.ac.cn/cvd Cardiovascular diseases
dog10k http://dog10k.kiz.ac.cn/ Dog omics
DIONYSUS http://www.dsimb.inserm.fr/DIONYSUS/ Protein-carbohydrate interfaces
DMRdb http://www.inbirg.com/DMRdb/ Mendelian randomization-based causal relationships
ECBD https://ecbd.eu European chemical biology database
EXPRESSO https://expresso.sustech.edu.cn The 3D genome, the epigenome and gene expression
GDP https://biogdp.com/ Biomedical graphics
GoFCards http://www.genemed.tech/gofcards Known and predicted gain-of-function variants
GTO http://www.inbirg.com/gto Gene therapy omnibus
GutMetaNet https://gutmetanet.deepomics.org/ Horizontal gene transfer and functional redundancy in the human gut microbiome
GWAShug http://www.gwashug.com Diverse GWAS methods
HSCGD http://www.hscgd.com Human single-cell genomes
Immunosenescence Inventory https://ngdc.cncb.ac.cn/iaa/home Multi-omics of immune aging
LncPepAtlas http://www.cnitbiotool.net/LncPepAtlas/ Long non-coding RNA Peptide Atlas
MAPbrain http://bigdata.ibp.ac.cn/mapBRAIN/ Multi-omics Atlas of the Primate Brain
MDRepo https://mdrepo.org Molecular dynamics simulations
metsDB https://relab.xidian.edu.cn/metsDB/ Cancer metastasis with bulk, single-cell and spatial transcriptomics
MicrobiomeNet https://www.microbiomenet.com/ Microbial communities and their metabolism
MicroEpitope http://bio-bigdata.hrbmu.edu.cn/MicroEpitope Immune epitopes derived from cancer microbiome
MolBiC https://idrblab.org/MolBiC/ Molecular bioactivities in cells
mOTUs-db https://motus-db.org Prokaryotic (metagenome-assembled) genomes
NAIRDB https://nairdb.genesilico.pl Nucleic Acid InfraRed Data Bank
NASA OSDR https://osdr.nasa.gov/bio/ NASA Open Science Data Repository
ncPlantDB https://bis.zju.edu.cn/ncPlantDB Plant ncRNAs and potential encoded peptides
OncoSexome https://idrblab.org/OncoSexome/ Sex-based differences in cancer risk, biomarkers and responses
OrgXenomics https://idrblab.org/orgxenomics Proteomics of organoids and xenografts
Pairpot http://123.207.205.5 Paired spatial and single-cell transcriptomics
PanKB http://pankb.org Microbial pangenomes
PDCdb http://47.99.36.124/PDCdb/ Peptide-drug conjugates
PerturBase http://www.perturbase.cn/ Single-cell perturbation data
PerturbAtlas https://perturbatlas.kratoss.site Bulk genetic perturbation data
PerturbDB http://research.gzsys.org.cn/perturbdb Perturb-seq data for gene function and drug target identification
PharmFreq https://pharmfreq.com/ Global distribution of pharmacogenomic allele frequencies
PGxDB https://pgx-db.org/ Pharmacogenomics
PhageDive https://phagedive.dsmz.de Bacteriophages and archaeal viruses
PIPdb https://nmdc.cn/pipdb/ Plasmids in pathogens database
PLAbDab-nano https://opig.stats.ox.ac.uk/webapps/plabdab-nano/ Nanobody sequences taken from patents and literature
PlantCircRNA https://plant.deepbiology.cn/PlantCircRNA/ Plant Circular RNAs
PlasmidScope https://plasmid.deepomics.org/ Plasmids with rich annotations
PWAS https://pwas.huji.ac.il/ Proteome-wide association studies
RDBSB https://www.biosino.org/rdbsb/ Registry and database of bioparts for synthetic biology
RodentGMPOmics Atlas http://210.22.121.250:8888/Rodent/index/homePage Rodent genome and pathogen microbiome database
sc2GWAS https://bio.liclab.net/sc2GWAS/ Relationships between GWAS traits and individual cells
scCancerExplorer https://bianlab.cn/scCancerExplorer Single-cell multi-omics data of human cancer
scDrugAct http://bio-bigdata.hrbmu.edu.cn/scDrugAct/ Single-cell Drug Action Atlas
Sci-ModoM https://scimodom.dieterichlab.org/ Transcriptome-wide high-throughput RNA modification sites
scImmOmics https://bio.liclab.net/scImmOmics/home Single-cell multi-omics immune database
scLTdb https://scltdb.com Single-cell lineage tracing database
scMMO-atlas https://www.biosino.org/scMMO-atlas/ Single-cell multimodal omics
scProAtlas https://relab.xidian.edu.cn/scProAtlas/#/ Single-cell spatial proteomics imaging in human tissues
ScRAPdb https://www.evomicslab.org/db/ScRAPdb/ Saccharomyces cerevisiae reference assembly panel
scTML http://sctml.xglab.tech/ Single-cell tumor mutation landscape
scTWASAtlas https://ngdc.cncb.ac.cn/sctwas/ Single-cell transcriptome-wide association studies
SMC https://smc.jgi.doe.gov Secondary metabolism collaboratory
SoIR http://soir.bio2db.com Solanaceae information resource
SPathDB http://bio-bigdata.hrbmu.edu.cn/SPathDB/ Heterogeneity of functional activity from spatial transcriptomics
SpatialRef http://www.liclab.net/spatialref/ Manually curated spatial omics
stSNV http://bio-bigdata.hrbmu.edu.cn/stSNV/index.jsp Spatial SNVs in human
SubCELL https://idrblab.org/subcell Subcellular compartment-specific interactions
SV4GD http://bio-computing.hrbmu.edu.cn/SV4GD/ Structural variation for genetic diseases
TranscriptDB https://transcriptdb.cobius.usherbrooke.ca Eukaryotic transcript conservation and evolution
VDGE https://ngdc.cncb.ac.cn/vdge Variation database of gene-edited animals
xQTLatlas http://www.hitxqtl.org.cn/ xQTL at cellular resolution

Table 2.

Updated descriptions of databases most recently published elsewhere

Database name URL Short description
COCONUT https://coconut.naturalproducts.net Natural products
Enterobase https://enterobase.warwick.ac.uk, https://enterobase.dsmz.de Pathogen genomic analyses
FunCoup https://funcoup.org Functional association networks
Harmonizome https://maayanlab.cloud/Harmonizome/ Harmonized gene-attribute associations
MFIB https://mfib.pbrg.hu Mutual folding induced by binding
NCBI Taxonomy https://www.ncbi.nlm.nih.gov/taxonomy, https://www.ncbi.nlm.nih.gov/datasets/ Curated taxonomy across all domains of life
oriTDB https://bioinfo-mml.sjtu.edu.cn/oriTDB2/ Origin-of-transfer regions of bacterial mobile genetic elements
PMN https://plantcyc.org Plant metabolic network
PTMD https://ptmd.biocuckoo.cn/ Post-translational modifications linked to disease
SSBD https://ssbd.riken.jp Bioimaging data

The ‘Nucleic acid databases’ section contains this Issue’s chosen Breakthrough article, describing the new database for exploring 3D human genomics information, EXPRESSO (4). This collects large amounts of data across 46 tissues and cross-links it to epigenomics and transcriptomics information. Novel visualization tools map this information to architectural features such as compartments and stripes. Options for programmatic access will certainly enhance the database’s reach and the authors plan to extend the database to other species, incorporate new data modalities and allow processing of user data. Other interesting new arrivals include NAIRDB (5), which records Fourier transform infrared data for nucleic acids, a methodology that sheds valuable light on structure and interactions, especially conformational transitions; and Sci-ModoM, a standardized and integrated pan-modification database for RNA epitranscriptomic markers (6).

The standout update of the section comes from the cornerstone resource Refseq, which celebrates 25 years with a paper reflecting on its massive growth of its curated reference gene, transcript and protein sequences (7). The paper describes how genomes are selected for annotation and the different pipelines applied to different kingdoms; and reports on collaborations with many other resources familiar to the Database Issue readership. Also returning after a long hiatus, the VISTA Enhancer Browser reports (8) a 20-fold increase in data including manually curated comparisons between allele data that shed light on enhancers’ roles in human disease. Rfam also contributes a multi-faceted update (9) describing numerous improvements and expansions, from exploiting the latest RNA structure prediction methods to better annotate secondary structure to entries, and better mapping of families to Gene Ontology terms (10), to completion of a process to align Rfam content with the microRNA database miRBase (11).

Among other updates, microRNAs (miRNAs) are particularly well-served by a quintet of returning databases. MirGeneDB (12) catalogues families of miRNAs across species, now covering five new phyla, enabling valuable efforts towards a uniform nomenclature. miRNATissueAtlas (13) captures the expression of miRNAs in human and mouse and reports a seven-fold increase in tissues as well as interesting cross-species comparisons with potential relevance to miRNA therapeutic development. miRStart (14) covers miRNA transcription start sites and now integrates regulation of miRNA expression by transcription factors (TFs). The remaining miRNA databases cover the regulatory functions of the molecules: miRTarBase (15) reports a near-doubling of miRNA-target interactions and integration of further databases to introduce coverage, for example, of miRNA links to drug resistance (16), and miRNAs as biomarkers (17); while TransMir (18) significantly expands in content and introduces more disease-specific TF-miRNA networks and novel sex-specific networks.

Finally, plasmids are covered by the returning PLSDB—which doubles in size, now covers Archaea, and offers more detailed information on host, ecosystem and encoded genes (19)—and two new databases PIPdb (20) and PlasmidScope (21). The former focuses on plasmids in pathogens to help address questions around host range, transmission of antibiotic resistance genes and so on; while the later majors on rich annotation, up to and including predicted protein structures, of almost a million plasmids.

The protein section contains updates from four pillars of bioinformatics research hosted by the EBI. The UniProt paper (22) offers a fascinating overview of how different streams of information feed into manual curation, from the curators’ own targeted papers and other database content [Rhea (23) is a recent focus] to community-contributed batches of papers. The Pfam paper (24) reports especially on the use of AI tools to expand coverage and refine domain boundaries, both using structure predictions from the AlphaFold Protein Structure Database [AFDB; (25)]; and with the Pfam-N approach, where deep learning adds to HMM-based domain detection, and so significantly extends sequence coverage. Interpro (26), which encompasses Pfam and many other domain and family databases, describes how, like many other databases, it is exploring the possibilities of large language models to automatically generate text descriptions of entries. Selection and visualization of new ‘Representative Domains’ is also a small but significant improvement to usability, condensing the sometimes complex multi-track displays for large proteins. Finally from the EBI, the PRIDE database reaches 20 years: its contribution (27) reports increased efforts to reuse, reanalyse and disseminate information to other well-known databases. AI also appears in the form of a chatbot to facilitate users’ navigation of the PRIDE documentation. Elsewhere there are updates too from COG (28), another InterPro member, where secretion systems have been a special focus, alongside RNA modification and multi-domain signal transduction protein families; and from the RCSB (29) which makes its motif search and structural similarity tools accessible to users’ own structures.

The AlphaFold 2 (AF2) revolution continues to be reflected heavily throughout this Issue. The AFDB is now digested by the popular general databases of protein structural domains ECOD (30) and CATH (31), and by the specialist resource RepeatsDB (32). ECOD now classifies domains from over a million proteins and showcases changes to family classification resulting from collaboration with Pfam and initial efforts to include domain-ligand interactions. CATH expands numbers of folds and superfamilies, mainly through additions from the AFDB-derived Encyclopedia of Domains (TED) resource (33): a visually appealing figure of selected new architectures enhances the paper. RepeatsDB, meanwhile, has tripled its content of curated repeat proteins and made improvements to its website. AF2 predictions are a key part of the new database ASpdb (34), which covers human protein isoform structures, using predictions to plug the many gaps in our experimental knowledge. Another important new database in the area, BFVD (35), covers AF2 predictions of viral protein structures, a conspicuous absence in the AFDB. Finally, immune-related proteins are covered by three database updates and a new resource. The updates are from the immune epitope database (36), which celebrates 20 years with custom 3D molecular viewers and improved links to other databases; IPD-MHC (37), which now includes predicted 3D protein structures (and comparisons) and coverage that now includes cetaceans; and TCR3D (38), which showcases a range of improvements such as inclusion of TCR mimic antibodies, of considerable therapeutic interest. The new database, PLAbDab-nano mines the literature and patents to comprehensively cover nanobodies (39).

In the section for metabolism, signalling and enzymes, there are important updates from two foundational databases. STRING (40), for protein associations, now allows users to access functional, physical and regulatory networks separately, and offers upgraded facilities for gene set enrichment analysis; while the KEGG (41) database of pathways and systems describes improvements in coverage of viruses, tools for genome alignment using KEGG-assigned genes and better taxonomic mapping. Equally exciting is an update from CAZY (42), the database of carbohydrate-active enzymes reporting on CAZac (standing for CAZyme activity descriptor), a new framework to systematically describe function which offers fresh insights into the evolution of substrate specificity and mechanisms of CAZymes. Elsewhere, major themes are biosynthetic gene clusters (BGCs) and secondary metabolism; and the natural products that result from their activity. The popular MIBiG resource for BGCs reports the results of a huge community effort that has improved its coverage, accuracy and completeness (43); and is joined by both the BGC Atlas (44) focussing on BGCs in MGnify (45) metagenomics datasets, and by the Secondary Metabolism Collaboratory (46) which adopts a tool-agnostic approach to define 13 million BGCs, encouraging community efforts towards their annotation. The natural products themselves are covered by a trio of updates. NP-MRD (47) focuses on their NMR characterization and reports a seven-fold increase in content; the Natural Products Atlas (48) reports new content, including chemically novel structures, and careful reassignment/correction of many entries; and COCONUT [(49); appearing in NAR for the first time] has an expanded array of data sources resulting in a collection of nearly 700 000 unique structures.

The next section covers viruses and microbes. A new resource, PhageDIVE [sister to BacDIVE, that also updates in this Issue (50)] covers bacteriophages and archaeal viruses (51). New to NAR, and now driven by an international consortium, is the popular Enterobase (52), which covers the genomic epidemiology of bacterial pathogens and now holds over a million genome sequences. Microbiomes, especially the gut microbiome are well covered by an update paper and two interesting new resources. The update comes from gutMGene (53), whose content carefully distinguishes causal and correlative associations between microbes, metabolites and host genes; and which now includes associations derived from metabolic reconstitution. The new databases are MicrobiomeNet (54), which contains around 6 million microbial associations and genome-scale metabolic models for thousands of microbes; and GutMetaNet (55), which looks at horizontal gene transfer and functional redundancy in the human gut microbiome. Also in the section are updates from two important databases in host-pathogen interaction: PHI-base (56) reports the results of an exercise in analysing use case scenarios, resulting in seven distinct modes; while VFDB (57) now expands to include coverage of 900 anti-virulence compounds and their associations with virulence factors.

In the human, model organism and comparative genomics section, arguably the most notable update comes from the major resource dbSNP (58), which celebrates 25 years and where SNPs now number in the billions. SNPs in miRNAs are the focus of the popular miRNASNP (59), where a major expansion away from the original human-only focus sees a further 16 animals covered. For the interpretation of genetic variability, the major resource GWAS Catalog (60) now captures data from 7000 papers; and is joined by GWAShug (61), which offers a notably comprehensive range of analytical and visualization options. The popular WebTWAS (62) also publishes an update reporting significantly increased content and a wider range of algorithmic options. In a similar space, the new xQTLatlas (63) covers an impressive range of omics modalities and analyses tailored to specific cell types and -states.

Two stalwarts of the genomics space, Ensembl (64) and UCSC Genome Browser (63) each publish their usual updates. A major focus for Ensembl is keeping pace with annotation of outputs from biodiversity projects such as the Darwin Tree of Life (65) with a resulting doubling of content on the Rapid Release site. The Browser has 25 new annotation tracks and an array of usability improvements. Feeding into both of these is GENCODE, for human and mouse reference gene annotations, which reaches 20 years with an update (66) discussing current challenges such as distinguishing protein-coding genes from pseudogenes and reliable detection of protein coding potential for short ORFs. Elsewhere there is a huge focus on single-cell and spatial data. Single cell transcriptomes are the focus of the new CELLxGENE platform (67), which collects curated and standardised single-cell transcriptomic data for 93 million cells; and of the returning DISCO resource (68), which has similarly impressive data content and tools, as well as a cell type ontology and associated set of differentially expressed gene signatures. In the same territory, Bgee (69) adds data from 23 more species including those of agronomic and veterinary importance, and nonhuman primates. Paired multi-modal single-cell omics is the focus of scMMO-atlas (70), which illustrates how its content can identify novel cell subsets and developmental states; and scImmOmics (71), which focuses specifically on immune cells and contains data, including multi-modal omics data, for 2.9 million cells. Single-cell genomes are usefully covered by the new HSCGD resource (72), which contains 74 000 genomes and offers useful visualization of copy number variation and single nucleotide variants. Rounding off this section, the Issue contains no fewer than four new databases in the spatial omics space. SpatialRef (73) aggregates millions of manually annotated spots across 17 species and offers abundant spatial transcriptomics analyses, while SPathDB (74) similarly aims to characterize the spatial distribution of pathway function. Finally, stSNV (75) focuses on single nucleotide variants in the human spatial transcriptome, while Pairpot (76) offers paired single-cell and spatial transcriptomics.

Cancer has its usual strong presence in the section on human genomic variation, diseases and drugs. New databases focusing on distinct aspects include metsDB (77) focussed on metastasis from a (single, spatial) gene expression perspective; scDrugAct (78) looking at the influence of tumour microenvironment on drug action using single-cell expression data; and OncoSexome (79), which comprehensively covers sex-based differences in cancer incidence, drug response, etc. Cardiovascular diseases also get their own dedicated database in the form of CVD Atlas (80), which compiles multi-omics information and offers an interactive graph of disease-gene associations alongside other user-friendly browsing and analytical tools. Several other databases look at how genetic variation is linked to disease. Prominent among them, ClinVar (81) contributes an update that describes how it now seeks to expand coverage to somatic cancer variants. CAUSALdb, for linking causal variants and traits, has a strong disease focus and its update (82) reports on a significant increase in content and more sophisticated analytical tools.

Elsewhere, the new, manually curated SV4GD (83) covers the relationships between genome structural variations and genetic disease. Covering the chemical space from which drugs are drawn are both the returning PubChem (84), which now draws on >1000 data sources and contains >100 million compounds; and the new European chemical biology database (85), which includes screening assay data for over 100 000 compounds and offers a wide range of data visualization options. The popular DrugMAP database, which contributes our cover image, now includes information on new aspects such as combinatorial drug treatments, repurposed drugs and off-target drug effects (86). Other updating databases cover specific chemical/drug categories, which are all the focus of intense research. CovalentInDB (87) covers covalent inhibitors and reports the inclusion of cocrystal structures, a profiling of potential covalent binding sites across the high-resolution human structural proteome, and a library of >2 million commercially available chemicals for virtual screening. PROTAC-DB (88) reports a near-doubling of PROTAC number and the addition of pharmacokinetic data where available. Elsewhere in the section, the Comparative Toxicogenomics Database celebrates its 20th anniversary with a suitably reflective paper (89) but one also reporting significant improvements in options to interrogate and visualize its 94 million toxicogenomic associations between chemicals, genes/proteins, phenotypes and diseases. Pharmacogenomics are covered both by the comprehensive knowledge graph approach of PGxDB (90) and by PharmFreq which looks at the variation in pharmacogenes across ethnicities and geography (91).

The plant database section mainly comprises reports on new databases. Sibling databases from the same authors cover different families of plants: SolR (92) features diverse data on 81 species of Solanaceae, among them potato, tomato and pepper, while AMIR (93) offers a similarly multi-dimensional view of Asteraceae such as lettuce, sunflower and chrysanthemum. Two further databases cover classes of RNA in plants: ncPlantDB (94) covers plant non-coding RNAs, their regulatory interactions, and their (contradictorily) encoded peptides; while PlantCircRNA (95) focuses on plant circular RNAs, drawing from the literature, from RNA-seq data and from foregoing more specialized resources. There are also update papers from the venerable Rice Genome Annotation Project (96), which reports a range of enhancements, such as the inclusion of variants from pan-genome projects and genomes from near relatives; and from the PMN resource of plant metabolic pathways, publishing in NAR for the first time, which now covers 155 species and well over a million enzymes (97). The final section features a pleasing variety of databases not readily categorized above. The heavily used NCBI Taxonomy database has an update (98) that focuses particularly on integration into the Datasets portal. Other popular returning databases cover small molecules—Bitterdb reports a major expansion to cover 66 species (99), while mVOC covers volatile organic compounds (100)—or their binding to proteins, with BindingDB describing significant expansion, mainly driven by information mined from patents (101).

NAR online molecular biology database collection

As part of our ongoing efforts, we have continued updating the NAR online Molecular Biology Database Collection (freely available at http://www.oxfordjournals.org/nar/database/c/) by monitoring unresponsive databases. Over the year we have reviewed and updated 932 entries. 74 new resources listed in this issue were added to the database and 226 entries have been removed. Further curation efforts were made reviewing recent updates and user feedback bringing the total to 2236 databases in this new update. We encourage authors to submit their updates to XMF at xose.m.fernandez@gmail.com in plain text, ideally according to the template found in http://www.oxfordjournals.org/nar/database/summary/1.

Acknowledgements

We thank Dr Martine Bernardes-Silva, especially, and the rest of the Oxford University Press team led by Rhiannon Meaden for their help in compiling this issue.

Contributor Information

Daniel J Rigden, Department of Biochemistry, Cell and Systems Biology, Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Crown Street, Liverpool L69 7ZB, UK.

Xosé M Fernández, IQVIA Ltd., The Point, 37 North Wharf Road, London W2 1AF, UK.

Funding

Funding for open access charge: Oxford University Press.

Conflict of interest statement. The authors’ opinions do not necessarily reflect the views of their respective institutions.

References

  • 1. Thakur M., Brooksbank C., Gurwitz K.T., Finn R.D., Firth H.V., Foreman J., Freeberg M., Perova Z., Harrison M., Sarkans U.et al.. EMBL’s European Bioinformatics Institute (EMBL-EBI) in 2024. Nucleic Acids Res. 2024; 10.1093/nar/gkae1089. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2. Sayers E.W., Beck J., Bolton E.E., Brister J.R., Chan J., Connor R., Feldgarden M., Fine A.M., Funk K., Hoffman J.et al.. Database resources of the National Center for Biotechnology Information in 2025. Nucleic Acids Res. 2024; 10.1093/nar/gkae979. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3. CNCB-NGDC Members and Partners Database Resources of the National Genomics Data Center, China National Center for Bioinformation in 2025. Nucleic Acids Res. 2024; 10.1093/nar/gkae978. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4. Cai L., Qiao J., Zhou R., Wang X., Li Y., Jiang L., Zhou Q., Li G., Xu T., Feng Y.. EXPRESSO: a multi-omics database to explore multi-layered 3D genomic organization. Nucleic Acids Res. 2024; 10.1093/nar/gkae999. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5. Balduzzi E., Geinguenaud F., Sordyl D., Maiti S., Farsani M.A., Nikolaev G., Arluison V., Bujnicki J.M.. NAIRDB: a database of fourier transform infrared (FTIR) data for nucleic acids. Nucleic Acids Res. 2024; 10.1093/nar/gkae885. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6. Boileau E., Wilhelmi H., Busch A., Cappannini A., Hildebrand A., Bujnicki J.M., Dieterich C.. Sci-ModoM: a quantitative database of transcriptome-wide high-throughput RNA modification sites. Nucleic Acids Res. 2024; 10.1093/nar/gkae972. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7. Goldfarb T., Kodali V.K., Pujar S., Brover V., Robbertse B., Farrell C.M., Oh D.H., Astashyn A., Ermolaeva O., Haddad D.et al.. NCBI RefSeq: reference sequence standards through 25 years of curation and annotation. Nucleic Acids Res. 2024; 10.1093/nar/gkae1038. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8. Kosicki M., Baltoumas F.A., Kelman G., Boverhof J., Ong Y., Cook L.E., Dickel D.E., Pavlopoulos G.A., Pennacchio L.A., Visel A.. VISTA Enhancer Browser: an updated database of tissue-specific developmental enhancers. Nucleic Acids Res. 2024; 10.1093/nar/gkae940. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9. Ontiveros-Palacios N., Cooke E., Nawrocki E.P., Triebel S., Marz M., Rivas E., Griffiths-Jones S., Petrov A.I., Bateman A., Sweeney B.. Rfam 15: RNA families database in 2025. Nucleic Acids Res. 2024; 10.1093/nar/gkae1023. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10. The Gene Ontology Consortium “The Gene Ontology resource: enriching a GOld mine”. Nucleic Acids Res. 2021; 49:D325–D334. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11. Kozomara A., Birgaoanu M., Griffiths-Jones S.. miRBase: from microRNA sequences to function. Nucleic Acids Res. 2019; 47:D155–D162. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12. Clarke A.W., Høye E., Hembrom A.A., Paynter V.M., Vinther J., Wyrożemski Ł., Biryukova I., Formaggioni A., Ovchinnikov V., Herlyn H.et al.. MirGeneDB 3.0: improved taxonomic sampling, uniform nomenclature of novel conserved microRNA families, and updated covariance models. Nucleic Acids Res. 2024; 10.1093/nar/gkae1094. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13. Rishik S., Hirsch P., Grandke F., Fehlmann T., Keller A.. miRNATissueAtlas 2025: an update to the uniformly processed and annotated human and mouse non-coding RNA tissue atlas. Nucleic Acids Res. 2024; 10.1093/nar/gkae1036. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14. Xu J., Wan J., Huang H.Y., Chen Y., Huang Y., Huang J., Zhang Z., Su C., Zhou Y., Lin X.et al.. miRStart 2.0: enhancing miRNA regulatory insights through deep learning-based TSS identification. Nucleic Acids Res. 2024; 10.1093/nar/gkae1086. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15. Cui S., Yu S., Huang H.Y., Lin Y.C., Huang Y., Zhang B., Xiao J., Zuo H., Wang J., Li Z.et al.. miRTarBase 2025: updates to the collection of experimentally validated microRNA-target interactions. Nucleic Acids Res. 2024; 10.1093/nar/gkae1072. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16. Cao X., Zhou X., Hou F., Huang Y.E., Yuan M., Long M., Chen S., Lei W., Zhu J., Chen J.et al.. ncRNADrug: a database for validated and predicted ncRNAs associated with drug resistance and targeted by drugs. Nucleic Acids Res. 2024; 52:D1393–D1399. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17. Zhang Y., Zhou Y., Zhou Y., Yu X., Shen X., Hong Y., Zhang Y., Wang S., Mou M., Zhang J.et al.. TheMarker: a comprehensive database of therapeutic biomarkers. Nucleic Acids Res. 2024; 52:D1450–D1464. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18. Liang M., Zhang C., Yang Y., Cui Q., Zhang J., Cui C.. TransmiR v3.0: an updated transcription factor-microRNA regulation database. Nucleic Acids Res. 2024; 10.1093/nar/gkae1081. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19. Molano L.G., Hirsch P., Hannig M., Müller R., Keller A.. The PLSDB 2025 update: enhanced annotations and improved functionality for comprehensive plasmid research. Nucleic Acids Res. 2024; 10.1093/nar/gkae1095. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20. Zhu Q., Chen Q., Gao S., Li Z., Zhou H., Cui Z., Fan G., Liu X., Wu X., Ma J.et al.. PIPdb: a comprehensive plasmid sequence resource for tracking the horizontal transfer of pathogenic factors and antimicrobial resistance genes. Nucleic Acids Res. 2024; 10.1093/nar/gkae952. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21. Li Y., Feng X., Chen X., Yang S., Zhao Z., Chen Y., Li S.C.. PlasmidScope: a comprehensive plasmid database with rich annotations and online analytical tools. Nucleic Acids Res. 2024; 10.1093/nar/gkae930. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22. UniProt Consortium UniProt: the Universal protein knowledgebase in 2025. Nucleic Acids Res. 2024; 10.1093/nar/gkae1010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23. Bansal P., Morgat A., Axelsen K.B., Muthukrishnan V., Coudert E., Aimo L., Hyka-Nouspikel N., Gasteiger E., Kerhornou A., Neto T.B.et al.. Rhea, the reaction knowledgebase in 2022. Nucleic Acids Res. 2022; 50:D693–D700. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24. Paysan-Lafosse T., Andreeva A., Blum M., Chuguransky S.R., Grego T., Pinto B.L., Salazar G.A., Bileschi M.L., Llinares-López F., Meng-Papaxanthos L.et al.. The Pfam protein families database: embracing AI/ML. Nucleic Acids Res. 2024; 10.1093/nar/gkae997. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25. Varadi M., Bertoni D., Magana P., Paramval U., Pidruchna I., Radhakrishnan M., Tsenkov M., Nair S., Mirdita M., Yeo J.et al.. AlphaFold Protein Structure Database in 2024: providing structure coverage for over 214 million protein sequences. Nucleic Acids Res. 2024; 52:D368–D375. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26. Blum M., Andreeva A., Florentino L.C., Chuguransky S.R., Grego T., Hobbs E., Pinto B.L., Orr A., Paysan-Lafosse T., Ponamareva I.et al.. InterPro: the protein sequence classification resource in 2025. Nucleic Acids Res. 2024; 10.1093/nar/gkae1082. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27. Perez-Riverol Y., Bandla C., Kundu D.J., Kamatchinathan S., Bai J., Hewapathirana S., John N.S., Prakash A., Walzer M., Wang S.et al.. The PRIDE database at 20 years: 2025 update. Nucleic Acids Res. 2024; 10.1093/nar/gkae1011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28. Galperin M.Y., Vera Alvarez R., Karamycheva S., Makarova K.S., Wolf Y.I., Landsman D., Koonin E.V.. COG database update 2024. Nucleic Acids Res. 2024; 10.1093/nar/gkae983. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29. Burley S.K., Bhatt R., Biester A., Bhikadiya C., Bi C., Biswas P., Bittrich S., Blaumann S., Brown R., Chao H.et al.. Updated resources for exploring experimental PDB structures and computed structure models at the RCSB Protein Data Bank. Nucleic Acids Res. 2024; 10.1093/nar/gkae1091. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30. Schaeffer R.D., Medvedev K.E., Andreeva A., Chuguransky S.R., Pinto B.L., Zhang J., Cong Q., Bateman A., Grishin N.V.. ECOD: integrating classifications of protein domains from experimental and predicted structures. Nucleic Acids Res. 2024; 10.1093/nar/gkae1029. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31. Waman V.P., Bordin N., Lau A., Kandathil S., Wells J., Miller D., Velankar S., Jones D.T., Sillitoe I., Orengo C.. CATH v4.4: major expansion of CATH by experimental and predicted structural data. Nucleic Acids Res. 2024; gkae1087. 10.1093/nar/gkae1087. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32. Clementel D., Arrías P.N., Mozaffari S., Osmanli Z., Castro X.A., curators R.D.B., Ferrari C., Kajava A.V., Tosatto S.C.E., Monzon A.M. RepeatsDB in 2025: expanding annotations of structured tandem repeats proteins on AlphaFoldDB. Nucleic Acids Res. 2024; 10.1093/nar/gkae965. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33. Lau A.M., Bordin N., Kandathil S.M., Sillitoe I., Waman V.P., Wells J., Orengo C.A., Jones D.T.. Exploring structural diversity across the protein universe with the Encyclopedia of Domains. Science. 2024; 386:eadq4946. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34. Yang Y., Kumar H., Xie Y., Li Z., Li R., Chen W., Diala C.S., Ali M.A., Xu Y., Wu A.et al.. ASpdb: an integrative knowledgebase of human protein isoforms from experimental and AI-predicted structures. Nucleic Acids Res. 2024; 10.1093/nar/gkae1018. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35. Kim R.S., Levy Karin E., Mirdita M., Chikhi R., Steinegger M.. BFVD-a large repository of predicted viral protein structures. Nucleic Acids Res. 2024; 10.1093/nar/gkae1119. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36. Vita R., Blazeska N., Marrama D.IEDB Curation Team Members IEDB Curation Team Members Duesing S., Bennett J., Greenbaum J., De Almeida Mendes M., Mahita J., Wheeler D.K.et al.. The Immune Epitope Database (IEDB): 2024 update. Nucleic Acids Res. 2024; 10.1093/nar/gkae1092. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37. Maccari G., Robinson J., Barker D.J., Yates A.D., Hammond J.A., Marsh S.G.E.. The 2024 IPD-MHC database update: a comprehensive resource for major histocompatibility complex studies. Nucleic Acids Res. 2024; 10.1093/nar/gkae932. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38. Lin V., Cheung M., Gowthaman R., Eisenberg M., Baker B.M., Pierce B.G.. TCR3d 2.0: expanding the T cell receptor structure database with new structures, tools and interactions. Nucleic Acids Res. 2024; 10.1093/nar/gkae840. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39. Gordon G.L., Greenshields-Watson A., Agarwal P., Wong A., Boyles F., Hummer A., Lujan Hernandez A.G., Deane C.M.. PLAbDab-nano: a database of camelid and shark nanobodies from patents and literature. Nucleic Acids Res. 2024; 10.1093/nar/gkae881. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40. Szklarczyk D., Nastou K., Koutrouli M., Kirsch R., Mehryary F., Hachilif R., Hu D., Peluso M.E., Huang Q., Fang T.et al.. The STRING database in 2025: protein networks with directionality of regulation. Nucleic Acids Res. 2024; 10.1093/nar/gkae1113. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41. Kanehisa M., Furumichi M., Sato Y., Matsuura Y., Ishiguro-Watanabe M.. KEGG: biological systems database as a model of the real world. Nucleic Acids Res. 2024; 10.1093/nar/gkae909. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42. Lombard V., Henrissat B., Garron M.L.. CAZac: an activity descriptor for carbohydrate-active enzymes. Nucleic Acids Res. 2024; 10.1093/nar/gkae1045. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43. Zdouc M.M., Blin K., Louwen N.L.L., Navarro J., Loureiro C., Bader C.D., Bailey C.B., Barra L., Booth T.J., Bozhüyük K.A.J.et al.. MIBiG 4.0: advancing biosynthetic gene cluster curation through global collaboration. Nucleic Acids Res. 2024; 10.1093/nar/gkae1115. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44. Bağcı C., Nuhamunada M., Goyat H., Ladanyi C., Sehnal L., Blin K., Kautsar S.A., Tagirdzhanov A., Gurevich A., Mantri S.et al.. BGC Atlas: a web resource for exploring the global chemical diversity encoded in bacterial genomes. Nucleic Acids Res. 2024; 10.1093/nar/gkae953. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45. Richardson L., Allen B., Baldi G., Beracochea M., Bileschi M.L., Burdett T., Burgin J., Caballero-Pérez J., Cochrane G., Colwell L.J.et al.. MGnify: the microbiome sequence data analysis resource in 2023. Nucleic Acids Res. 2023; 51:D753–D759. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46. Udwary D.W., Doering D.T., Foster B., Smirnova T., Kautsar S.A., Mouncey N.J.. The secondary metabolism collaboratory: a database and web discussion portal for secondary metabolite biosynthetic gene clusters. Nucleic Acids Res. 2024; 10.1093/nar/gkae1060. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47. Wishart D.S., Sajed T., Pin M., Poynton E.F., Goel B., Lee B.L., Guo A.C., Saha S., Sayeeda Z., Han S.et al.. The Natural Products Magnetic Resonance Database (NP-MRD) for 2025. Nucleic Acids Res. 2024; 10.1093/nar/gkae1067. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48. Poynton E.F., van Santen J.A., Pin M., Contreras M.M., McMann E., Parra J., Showalter B., Zaroubi L., Duncan K.R., Linington R.G.. The Natural Products Atlas 3.0: extending the database of microbially derived natural products. Nucleic Acids Res. 2024; 10.1093/nar/gkae1093. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49. Chandrasekhar V., Rajan K., Kanakam S.R.S., Sharma N., Weißenborn V., Schaub J., Steinbeck C.. COCONUT 2.0: a comprehensive overhaul and curation of the collection of open natural products database. Nucleic Acids Res. 2024; 10.1093/nar/gkae1063. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50. Schober I., Koblitz J., Sardà Carbasse J., Ebeling C., Schmidt M.L., Podstawka A., Gupta R., Ilangovan V., Chamanara J., Overmann J.et al.. BacDive in 2025: the core database for prokaryotic strain data. Nucleic Acids Res. 2024; 10.1093/nar/gkae959. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51. Rolland C., Wittmann J., Reimer L.C., Sardà Carbasse J., Schober I., Dudek C.A., Ebeling C., Koblitz J., Bunk B., Overmann J.. PhageDive: the comprehensive strain database of prokaryotic viral diversity. Nucleic Acids Res. 2024; 10.1093/nar/gkae878. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52. Dyer N.P., Päuker B., Baxter L., Gupta A., Bunk B., Overmann J., Diricks M., Dreyer V., Niemann S., Holt K.E.et al.. EnteroBase in 2025: exploring the genomic epidemiology of bacterial pathogens. Nucleic Acids Res. 2024; 10.1093/nar/gkae902. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53. Qi C., He G., Qian K., Guan S., Li Z., Liang S., Liu J., Ke X., Zhang S., Lu M.et al.. gutMGene v2.0: an updated comprehensive database for target genes of gut microbes and microbial metabolites. Nucleic Acids Res. 2024; 10.1093/nar/gkae1002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54. Lu Y., Hui F., Zhou G., Xia J.. MicrobiomeNet: exploring microbial associations and metabolic profiles for mechanistic insights. Nucleic Acids Res. 2024; 10.1093/nar/gkae944. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55. Jiang Y., Wang Y., Che L., Yang S., Zhang X., Lin Y., Shi Y., Zou N., Wang S., Zhang Y.et al.. GutMetaNet: an integrated database for exploring horizontal gene transfer and functional redundancy in the human gut microbiome. Nucleic Acids Res. 2024; 10.1093/nar/gkae1007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56. Urban M., Cuzick A., Seager J., Nonavinakere N., Sahoo J., Sahu P., Iyer V.L., Khamari L., Martinez M.C., Hammond-Kosack K.E.. PHI-base - the multi-species pathogen-host interaction database in 2025. Nucleic Acids Res. 2024; 10.1093/nar/gkae1084. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57. Zhou S., Liu B., Zheng D., Chen L., Yang J.. VFDB 2025: an integrated resource for exploring anti-virulence compounds. Nucleic Acids Res. 2024; 10.1093/nar/gkae968. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58. Phan L., Zhang H., Wang Q., Villamarin R., Hefferon T., Ramanathan A., Kattman B.. The evolution of dbSNP: 25 years of impact in genomic research. Nucleic Acids Res. 2024; 10.1093/nar/gkae977. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59. Cao W., He J., Feng J., Wu X., Wu T., Wang D., Min C., Niu X., Gao Z., Guo A.Y.et al.. miRNASNP-v4: a comprehensive database for miRNA-related SNPs across 17 species. Nucleic Acids Res. 2024; 10.1093/nar/gkae888. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60. Cerezo M., Sollis E., Ji Y., Lewis E., Abid A., Bircan K.O., Hall P., Hayhurst J., John S., Mosaku A.et al.. The NHGRI-EBI GWAS Catalog: standards for reusability, sustainability and diversity. Nucleic Acids Res. 2024; 10.1093/nar/gkae1070. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61. Cao C., Tian M., Li Z., Zhu W., Huang P., Yang S.. GWAShug: a comprehensive platform for decoding the shared genetic basis between complex traits based on summary statistics. Nucleic Acids Res. 2024; 10.1093/nar/gkae873. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62. Cao C., Shao M., Wang J., Li Z., Chen H., You T., Li M.J., Ding Y., Zou Q.. webTWAS 2.0: update platform for identifying complex disease susceptibility genes through transcriptome-wide association study. Nucleic Acids Res. 2024; 10.1093/nar/gkae1022. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63. Jia Y., Dong H., Li L., Wang F., Juan L., Wang Y., Guo H., Zhao T.. xQTLatlas: a comprehensive resource for human cellular-resolution multi-omics genetic regulatory landscape. Nucleic Acids Res. 2024; 10.1093/nar/gkae837. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64. Dyer S.C., Austine-Orimoloye O., Azov A.G., Barba M., Barnes I., Barrera-Enriquez V.P., Becker A., Bennett R., Beracochea M., Berry A.et al.. Ensembl 2025. Nucleic Acids Res. 2024; 10.1093/nar/gkae1071. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65. Darwin Tree of Life Project C Sequence locally, think globally: the Darwin Tree of Life Project. Proc. Natl Acad. Sci. U.S.A. 2022; 119:e2115642118. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66. Mudge J.M., Carbonell-Sala S., Diekhans M., Martinez J.G., Hunt T., Jungreis I., Loveland J.E., Arnan C., Barnes I., Bennett R.et al.. GENCODE 2025: reference gene annotation for human and mouse. Nucleic Acids Res. 2024; 10.1093/nar/gkae1078. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67. CZI Cell Science Program Abdulla S., Aevermann B., Assis P., Badajoz S., Bell S.M., Bezzi E., Cakir B., Chaffer J., Chambers S.et al.. CZ CELLxGENE Discover: a single-cell data platform for scalable exploration, analysis and modeling of aggregated data. Nucleic Acids Res. 2024; 10.1093/nar/gkae1142. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68. Li M., Ang K.S., Teo B., Rom U., Nguyen M.N., Maurer-Stroh S., Chen J.. Rediscovering publicly available single-cell data with the DISCO platform. Nucleic Acids Res. 2024; 10.1093/nar/gkae1108. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69. Bastian F.B., Cammarata A.B., Carsanaro S., Detering H., Huang W., Joye S., Niknejad A., Nyamari M., Farias T.M.d., Moretti S.et al.. Bgee in 2024: focus on curated single-cell RNA-seq datasets, and query tools. Nucleic Acids Res. 2024; 10.1093/nar/gkae1118. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70. Cheng W., Yin C., Yu S., Chen X., Hong N., Jin W.. scMMO-atlas: a single cell multimodal omics atlas and portal for exploring fine cell heterogeneity and cell dynamics. Nucleic Acids Res. 2024; 10.1093/nar/gkae821. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71. Li Y.Y., Zhou L.W., Qian F.C., Fang Q.L., Yu Z.M., Cui T., Dong F.J., Cai F.H., Yu T.T., Li L.D.et al.. scImmOmics: a manually curated resource of single-cell multi-omics immune data. Nucleic Acids Res. 2024; 10.1093/nar/gkae985. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 72. Fu J., He S., Yang Y., Chen Z., Qiao Y., Lu N., Lu Z., Tu J.. HSCGD: a comprehensive database of single-cell whole-genome data and metadata. Nucleic Acids Res. 2024; 10.1093/nar/gkae971. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73. Cui T., Li Y.Y., Li B.L., Zhang H., Yu T.T., Zhang J.N., Qian F.C., Yin M.X., Fang Q.L., Hu Z.H.et al.. SpatialRef: a reference of spatial omics with known spot annotation. Nucleic Acids Res. 2024; 10.1093/nar/gkae892. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74. Li F., Song X., Fan W., Pei L., Liu J., Zhao R., Zhang Y., Li M., Song K., Sun Y.et al.. SPathDB: a comprehensive database of spatial pathway activity atlas. Nucleic Acids Res. 2024; 10.1093/nar/gkae1041. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 75. Yang C., Liu Y., Wang X., Jia Q., Fan Y., Lu Z., Shi J., Liu Z., Chen G., Li J.et al.. stSNV: a comprehensive resource of SNVs in spatial transcriptome. Nucleic Acids Res. 2024; 10.1093/nar/gkae945. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 76. Ruan Z., Lin F., Zhang Z., Cao J., Xiang W., Wei X., Liu J.. Pairpot: a database with real-time lasso-based analysis tailored for paired single-cell and spatial transcriptomics. Nucleic Acids Res. 2024; 10.1093/nar/gkae986. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 77. Wu S., Zhang J., Wang Y., Qin X., Zhang Z., Lu Z., Kim P., Zhou X., Huang L.. metsDB: a knowledgebase of cancer metastasis at bulk, single-cell and spatial levels. Nucleic Acids Res. 2024; 10.1093/nar/gkae916. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 78. Xu Y., Zhang Y., Song K., Liu J., Zhao R., Zhang X., Pei L., Li M., Chen Z., Zhang C.et al.. ScDrugAct: a comprehensive database to dissect tumor microenvironment cell heterogeneity contributing to drug action and resistance across human cancers. Nucleic Acids Res. 2024; 10.1093/nar/gkae994. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 79. Shen X., Zhang Y., Li J., Zhou Y., Butensky S.D., Zhang Y., Cai Z., DeWan A.T., Khan S.A., Yan H.et al.. OncoSexome: the landscape of sex-based differences in oncologic diseases. Nucleic Acids Res. 2024; gkae1003. 10.1093/nar/gkae1003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 80. Qian Q., Xue R., Xu C., Wang F., Zeng J., Xiao J.. CVD Atlas: a multi-omics database of cardiovascular disease. Nucleic Acids Res. 2024; 10.1093/nar/gkae848. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 81. Landrum M.J., Chitipiralla S., Kaur K., Brown G., Chen C., Hart J., Hoffman D., Jang W., Liu C., Maddipatla Z.et al.. ClinVar: updates to support classifications of both germline and somatic variants. Nucleic Acids Res. 2024; 10.1093/nar/gkae1090. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 82. Wang J., Ouyang L., You T., Yang N., Xu X., Zhang W., Yang H., Yi X., Huang D., Zhou W.et al.. CAUSALdb2: an updated database for causal variants of complex traits. Nucleic Acids Res. 2024; 10.1093/nar/gkae1096. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 83. Shi L., Zhang S., Li Y., Li H., Wang X., Du M., Zhang M., Ke L., Zhang Y., Xu C.et al.. SV4GD: a comprehensive structural variation database specially for genetic diseases. Nucleic Acids Res. 2024; 10.1093/nar/gkae1015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 84. Kim S., Chen J., Cheng T., Gindulyte A., He J., He S., Li Q., Shoemaker B.A., Thiessen P.A., Yu B.et al.. PubChem 2025 update. Nucleic Acids Res. 2024; 10.1093/nar/gkae1059. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 85. Škuta C., Müller T., Voršilák M., Popr M., Epp T., Skopelitou K.E., Rossella F., Stechmann B., Gribbon P., Bartůněk P.. ECBD: European chemical biology database. Nucleic Acids Res. 2024; 10.1093/nar/gkae904. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 86. Li F., Mou M., Li X., Xu W., Yin J., Zhang Y., Zhu F.. DrugMAP 2.0: molecular atlas and pharma-information of all drugs. Nucleic Acids Res. 2024; 10.1093/nar/gkae791. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 87. Du H., Zhang X., Wu Z., Zhang O., Gu S., Wang M., Zhu F., Li D., Hou T., Pan P.. CovalentInDB 2.0: an updated comprehensive database for structure-based and ligand-based covalent inhibitor design and screening. Nucleic Acids Res. 2024; 10.1093/nar/gkae946. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 88. Ge J., Li S., Weng G., Wang H., Fang M., Sun H., Deng Y., Hsieh C.Y., Li D., Hou T.. PROTAC-DB 3.0: an updated database of PROTACs with extended pharmacokinetic parameters. Nucleic Acids Res. 2024; 10.1093/nar/gkae768. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 89. Davis A.P., Wiegers T.C., Sciaky D., Barkalow F., Strong M., Wyatt B., Wiegers J., McMorran R., Abrar S., Mattingly C.J.. Comparative toxicogenomics database's 20th anniversary: update 2025. Nucleic Acids Res. 2024; 10.1093/nar/gkae883. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 90. Duong Nguyen T.T., Tanoli Z., Hassan S., Özcan U.O., Caroli J., Kooistra A.J., Gloriam D.E., Hauser A.S.. PGxDB: an interactive web-platform for pharmacogenomics research. Nucleic Acids Res. 2024; 10.1093/nar/gkae1127. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 91. Tremmel R., Zhou Y., Camara M.D., Laarif S., Eliasson E., Lauschke V.M.. PharmFreq: a comprehensive atlas of ethnogeographic allelic variation in clinically important pharmacogenes. Nucleic Acids Res. 2024; 10.1093/nar/gkae1016. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 92. Liu Z., Shen S., Li C., Zhang C., Chen X., Fu Y., Yu T., Zhou R., Liu D., Yang Q.Y.et al.. SoIR: a comprehensive Solanaceae information resource for comparative and functional genomic study. Nucleic Acids Res. 2024; 10.1093/nar/gkae1040. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 93. Liu D., Luo C., Dai R., Huang X., Chen X., He L., Mao H., Li J., Zhang L., Yang Q.Y.et al.. AMIR: a multi-omics data platform for Asteraceae plants genetics and breeding research. Nucleic Acids Res. 2024; 10.1093/nar/gkae833. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 94. Liu L., Liu E., Hu Y., Li S., Zhang S., Chao H., Hu Y., Zhu Y., Chen Y., Xie L.et al.. ncPlantDB: a plant ncRNA database with potential ncPEP information and cell type-specific interaction. Nucleic Acids Res. 2024; 10.1093/nar/gkae1017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 95. He S., Bing J., Zhong Y., Zheng X., Zhou Z., Wang Y., Hu J., Sun X.. PlantCircRNA: a comprehensive database for plant circular RNAs. Nucleic Acids Res. 2024; 10.1093/nar/gkae709. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 96. Hamilton J.P., Li C., Buell C.R.. The rice genome annotation project: an updated database for mining the rice genome. Nucleic Acids Res. 2024; 10.1093/nar/gkae1061. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 97. Hawkins C., Xue B., Farida Y., Wyatt G., Zerbe P., Rhee S.Y.. Plant Metabolic Network 16: expansion of underrepresented plant groups and experimentally supported enzyme data. Nucleic Acids Res. 2024; 10.1093/nar/gkae991. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 98. Cox E., Tsuchiya M.T.N., Ciufo S., Torcivia J., Falk R., Anderson W.R., Holmes J.B., Hem V., Breen L., Davis E.et al.. NCBI taxonomy: enhanced access via NCBI datasets. Nucleic Acids Res. 2024; 10.1093/nar/gkae967. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 99. Ziaikin E., David M., Uspenskaya S., Niv M.Y.. BitterDB: 2024 update on bitter ligands and taste receptors. Nucleic Acids Res. 2024; 10.1093/nar/gkae1044. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 100. Kemmler E., Lemfack M.C., Goede A., Gallo K., Toguem S.M.T., Ahmed W., Millberg I., Preissner S., Piechulla B., Preissner R.. mVOC 4.0: a database of microbial volatiles. Nucleic Acids Res. 2024; 10.1093/nar/gkae961. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 101. Liu T., Hwang L., Burley S.K., Nitsche C.I., Southan C., Walters W.P., Gilson M.K.. BindingDB in 2024: a FAIR knowledgebase of protein-small molecule binding data. Nucleic Acids Res. 2024; 10.1093/nar/gkae1075. [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from Nucleic Acids Research are provided here courtesy of Oxford University Press

RESOURCES