The COVID-19 pandemic has driven demand for integrated genomics, resulting in fast-tracked development of AusTrakka, Australia’s pathogen genomics platform. This facilitated rapid data sharing, democratised access to computational and bioinformatic resources and expertise, and achieved national real-time genomic surveillance.
Subject terms: Infectious diseases, Policy and public health in microbiology, Comparative genomics, SARS-CoV-2, Viral epidemiology
The COVID-19 pandemic has accelerated the demand for near real-time analysis and dissemination of pathogen genomic data. In this comment, the authors describe how Australia has developed and rolled out its SARS-CoV-2 genomics platform, AusTrakka, and used it to support public health action.
The COVID-19 pandemic has highlighted the power of pathogen genomics to enhance infectious diseases surveillance and inform targeted public health responses1. Australia is a federation of states and territories, and public health laboratory activities have often been siloed between jurisdictional boundaries, hampering rapid genomic data sharing. This represents a major “missed opportunity” as pathogens do not respect borders and sharing of genomic information identifies and informs more rapid responses to infectious disease outbreaks2,3.
Recognising the significant opportunities afforded by pathogen genomics, there has been major government investments in the development and implementation of whole-genome sequencing (WGS), predominantly in high-income settings. The United Kingdom rapidly established the COVID-19 Genomics UK Consortium (COG-UK) and the United States government invested US$1.7 billion to expand genomics capacity for COVID-194. In Australia, the pandemic also highlighted the need for an integrated public health genomics surveillance system.
International examples of a genomics analysis platform
In 2010, the GenomeTrakr network, established by the US Food and Drug Administration, was the first large-scale experiment that facilitated real-time genomic data sharing directly between laboratories from foodborne pathogens on a national level5. Its successes in earlier detection of outbreaks demonstrated that genomic data sharing could inform public health interventions5.
In the United Kingdom, Public Health England (PHE) invested heavily in centralised sequencing capacity, launching the PHE Pathogen Genomics Service in 20146. The national scope of activities covered by PHE allowed for national level integration of genomic and epidemiological data which meant challenges were around harmonisation of information systems and rapid reporting of data to referral laboratories.
Canada, like Australia, has a complex health system whereby each Canadian Province has its own public health laboratory. To facilitate real-time data sharing the Integrated Infectious Disease Analysis (IRIDA) project (https://irida.ca/) was developed. The IRIDA software operates independently for each public health laboratory with an interface to communicate between IRIDA database. The open-source nature of this platform enabled laboratories to establish genomic epidemiology capacity, however, the ability for laboratories to operate the software independently meant that there was no guarantee for real-time data sharing.
Demand for national genomics surveillance in Australia
In 2015, the Communicable Diseases Genomics Network (cdgn.org.au), comprised of representatives from public health laboratories across all jurisdictions in Australia and New Zealand, was established with an overarching aim to implement pathogen genomics in public health. AusTrakka was first proposed by the CDGN in 2016 as a system that would facilitate consistency in genomic data sharing and reporting to improve public health surveillance and response. The AusTrakka platform was developed and deployed by the Microbiological Diagnostic Unit Public Health Laboratory and governed by the CDGN. This platform was built by public health laboratories, for public health laboratories, ensuring that the architecture and the bioinformatics pipelines and analysis protocols were created within a National Association of Testing Authorities (NATA)/International Organization for Standardization (ISO) accredited public health laboratory environment.
The COVID-19 pandemic was the catalyst for further development and deployment of AusTrakka in 2020, with key factors leading to fast-tracking including: (i) national funding for COVID-19 genomics sequencing and analysis through CDGN and AusTrakka; (ii) expert advocacy that led to Commonwealth recognition of the importance of real-time genomics data for surveillance; (iii) early demonstrations of the utility of genomics for COVID-19 from some jurisdictions - landmark events including the genomic identification of a hotel quarantine breach being the single source of a large second wave in Victoria, Australia, which catapulted genomics into the public domain;1,7,8 (iv) demand for investigation into ‘mystery cases’ detected despite restricted international travel, hotel quarantine policies and internal border closures; and (v) a strong desire for rapid detection and reporting of SARS-CoV-2 Variants of Concern in a nationally consistent way to inform public health control measures such as length of hotel quarantine and lockdowns. Subsequently, public and political demand for genomic investigation into essentially every case nationwide became an expectation that could only be met by rapid and open data sharing between health departments and public health laboratories. AusTrakka played a crucial role in facilitating genomic data sharing and providing national surveillance and ‘fine grain’ analyses, particularly into ‘mystery’ cases, revealing transmission within and between jurisdictions.
AusTrakka: Australia’s national genomics platform
AusTrakka is used by all public health laboratories in every jurisdiction in Australia and has extended to New Zealand. As public health laboratories upload new sequence data, the phylogenomic tree including all available sequences is updated. For SARS-CoV-2 a Maximum Likelihood tree is visualised, however the platform is able to visualise trees using any method. This tree is immediately accessible to public health laboratories to view their sequences in the national context. Given the sensitivity of this information, a specific framework for SARS-CoV-2 data sharing and analysis using AusTrakka was developed, formalising the endorsement from all jurisdictions to rapidly upload all SARS-CoV-2 sequence data to the AusTrakka platform. A multi-disciplinary and multi-jurisdictional AusTrakka team comprised of bioinformaticians, software engineers and genomic epidemiologists was established to support the development and operation (Fig. 1).
Public health laboratories had varying degrees of sequencing and bioinformatics capacity, however, through the deployment of AusTrakka it became clear that equitable access was important to ensure a comprehensive national dataset and reduce potential biases in the data. To facilitate this, the dedicated AusTrakka team provides analysis, interpretation, and reporting support to jurisdictions with limited capacity. Implementing this centralised and collaborative system ensured harmonisation of analyses, consistent national reporting, and provides all public health laboratories with the same computational resources and capability. At a national level, AusTrakka provides a focal point for integrated SARS-CoV-2 genomic surveillance information for public health agencies: outputs include reporting against genomics surveillance indicators as part of the Australian Government Communicable Diseases Intelligence Report, weekly reporting of Variants of Concern and investigation into multi-jurisdictional outbreaks (Fig. 1).
The AusTrakka development process revealed significant gaps and opportunities in the national public health system, reflecting Australia’s status as a federation. Public health in Australia is a complex multi-levelled system, with each jurisdiction largely responsible for their own public health service delivery, creating barriers to rapid genomic data sharing. Genomic data is generated by the public health laboratories and epidemiological metadata is collected by the health departments. Through the deployment of AusTrakka the significant benefits of integrated genomic data sharing were highlighted, however some limitations were also revealed, including the lack of interoperability between surveillance systems at all levels and the fact custodianship of genomic and epidemiological data is managed by separate systems.
A crucial element in the success of AusTrakka was its endorsement by, and compliance with, jurisdictional and federal governance structures for rapid data sharing between states and territories through a centralised platform. On 1 October 2020, AusTrakka was endorsed by the Australian Health Protection Principal Committee (AHPPC), the key decision-making committee for public health emergencies, however a key feature of this endorsement was ensuring access to the data was restricted to approved public health laboratories and agencies.
AusTrakka streamlines genomic and epidemiological data sharing and enables fine-grained access controls of data between public health laboratories. Within 12 months from endorsement, more than 33,000 high-quality SARS-CoV-2 sequences had been uploaded to AusTrakka, equal to approximately 50% of all COVID-19 cases in Australia, from over 80 users across all public health laboratories. AusTrakka has informed public health decision making throught demonstration of inter-jurisdictional transmissions resulting in internal border closures1, detection of Variants of Concern in real-time and in silico testing of primers and probes to identify target failures that may impact diagnostic tests (Fig. 1). The next stages of AusTrakka sees the real-time sharing and enhanced integration of epidemiological data on a national level and developing interoperability within existing national systems.
Conclusion
AusTrakka has transformed the way multi-jurisdictional infectious diseases outbreaks are investigated and managed in Australia, demonstrating that barriers to genomics data sharing and centralisation could be overcome. As genomics capacity continues to develop globally, AusTrakka has provided a roadmap for integrating pathogen genomics into public health systems. There are, however, significant differences in global access to genomics technologies, though the principles of robust data governance, and the need for systems for rapid data sharing and analysis, and equitable access to computational capacity to enable real-time pathogen genomics surveillance, universally applies. So far, AusTrakka has been used as a core public health tool, although it has potential to be a valuable resource for advancing genomics research that will be explored in future.
It is imperative that the learnings and achievements experienced during the pandemic are sustained for the ongoing modernisation of infectious diseases surveillance and control. AusTrakka is adapting the approach to COVID-19 for other pathogens for broader public health impact, similar to that of the COG-UK, who have plans to forward the experience to the UK Health Security Agency to inform the development of a national pathogen genomics service. Sustained investment and engagement are critical to take full advantage of this enhanced genomics surveillance capability for other nationally notifiable pathogens that require close monitoring and targeted responses to reduce the public health, social and economic impacts of infectious diseases.
Acknowledgements
The authors gratefully acknowledge the contributions from all Communicable Diseases Genomics Network (CDGN) laboratory members, the broader AusTrakka team in Victoria, New South Wales and Queensland and the AusTrakka development team. This work is funded by the Australian Government Department of Health (B.P.H.), the Victorian Government Department of Health (B.P.H.), a National Health and Medical Research Council Fellowship (GNT1196103) (B.P.H.) and the Medical Research Future Fund (FSPGN000049) (A.V.J., D.A.W., T.S. and B.P.H.). Extended COVID-19 Laboratory Members of the Communicable Diseases Genomics Network (CDGN) include: Microbiological Diagnostic Unit Public Health Laboratory, The University of Melbourne at the Peter Doherty Institute for Infection and Immunity; The Victorian Infectious Diseases Reference Laboratory, Royal Melbourne Hospital at the Peter Doherty Institute for Infection and Immunity; Centre for Infectious Diseases and Microbiology Laboratory Services, Institute of Clinical Pathology and Medical Research (ICPMR), NSW Health Pathology, Westmead Hospital, Westmead, New South Wales, Australia; Virology Research Laboratory, Serology and Virology Division, NSW Health Pathology, Prince of Wales Hospital, Randwick, NSW, Australia; Department of Infectious Disease and Microbiology, Royal Prince Alfred Hospital, Sydney, NSW, Australia; Public Health Microbiology and Virology, Forensic and Scientific Services, Queensland Department of Health, Brisbane, QLD, Australia; Public Health Laboratory, Microbiology and Infectious Diseases, SA Pathology, Adelaide, SA, Australia; Department of Microbiology, PathWest Laboratory Medicine, QEII Medical Centre, Perth, WA, Australia; Royal Hobart Hospital, Hobart, TAS, Australia; ACT Pathology, ACT, Australia; Research Biology School, Australian National University, Canberra, ACT, Australia; Pathology, Royal Darwin Hospital, Darwin, NT, Australia; The Institute of Environmental Science and Research, New Zealand.
Author contributions
T.S., A.GdS. and B.P.H. conceptualised the AusTrakka platform. T.H., A.GdS., A.V.J., D.A.W., B.P.H. and T.S. contributed to implementation and governance. T.H., T.S. and B.P.H. drafted the manuscript which was reviewed and updated by A.GdS., D.A.W. and A.V.J.
Peer review
Peer review information
Nature Communications thanks the anonymous reviewer(s) for their contribution to the peer review of this work.
Data availability
No relevant data is available for this publication.
Code availability
No relevant code is available for this publication.
Competing interests
The authors declare no competing interests.
Footnotes
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
These authors contributed equally: Benjamin P. Howden, Torsten Seemann.
References
- 1.Lane CR, et al. Genomics-informed responses in the elimination of COVID-19 in Victoria, Australia: an observational, genomic epidemiological study. Lancet Public Health. 2021;6:e547–e556. doi: 10.1016/S2468-2667(21)00133-X. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Ladner JT, Grubaugh ND, Pybus OG, Andersen KG. Precision epidemiology for infectious disease control. Nat. Med. 2019;25:206–11.. doi: 10.1038/s41591-019-0345-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Deng X, et al. Genomic surveillance reveals multiple introductions of SARS-CoV-2 into Northern California. Science. 2020;369:582. doi: 10.1126/science.abb9263. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.The White House. Fact Sheet: Biden Administration Announces $1.7 Billion Investment to Fight COVID-19 Variantshttps://www.whitehouse.gov/briefing-room/statements-releases/2021/04/16/fact-sheet-biden-administration-announces-1-7-billion-investment-to-fight-covid-19-variants/ 16 April 2021.
- 5.Allard MW, et al. Practical value of food pathogen traceability through building a whole-genome sequencing network and database. J. Clin. Microbiol. 2016;54:1975–1983. doi: 10.1128/JCM.00081-16. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Chattaway M. A. et al. The transformation of reference microbiology methods and surveillance for Salmonella with the use of whole genome sequencing in England and Wales. Front. Public Health7, 317 (2019). [DOI] [PMC free article] [PubMed]
- 7.Seemann T, et al. Tracking the COVID-19 pandemic in Australia using genomics. Nat. Commun. 2020;11:4376. doi: 10.1038/s41467-020-18314-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Rockett RJ, et al. Revealing COVID-19 transmission in Australia by SARS-CoV-2 genome sequencing and agent-based modeling. Nat. Med. 2020;26:1398–404.. doi: 10.1038/s41591-020-1000-7. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
No relevant data is available for this publication.
No relevant code is available for this publication.