STEED: A data mining tool for automated extraction of experimental parameters and risk of bias items from in vivo publications

Wolfgang Emanuel Zurrer; Amelia Elaine Cannon; Ewoud Ewing; David Brüschweiler; Julia Bugajska; Bernard Friedrich Hild; Marianna Rosso; Daniel Salo Reich; Benjamin Victor Ineichen

doi:10.1371/journal.pone.0311358

. 2024 Nov 26;19(11):e0311358. doi: 10.1371/journal.pone.0311358

STEED: A data mining tool for automated extraction of experimental parameters and risk of bias items from in vivo publications

Wolfgang Emanuel Zurrer ^1,^‡, Amelia Elaine Cannon ^1,^‡, Ewoud Ewing ², David Brüschweiler ¹, Julia Bugajska ¹, Bernard Friedrich Hild ¹, Marianna Rosso ¹, Daniel Salo Reich ³, Benjamin Victor Ineichen ^1,^2,^3,^*

Editor: John Blake⁴

PMCID: PMC11594395 PMID: 39591436

Abstract

Background and methods

Systematic reviews, i.e., research summaries that address focused questions in a structured and reproducible manner, are a cornerstone of evidence-based medicine and research. However, certain steps in systematic reviews, such as data extraction, are labour-intensive, which hampers their feasibility, especially with the rapidly expanding body of biomedical literature. To bridge this gap, we aimed to develop a data mining tool in the R programming environment to automate data extraction from neuroscience in vivo publications. The function was trained on a literature corpus (n = 45 publications) of animal motor neuron disease studies and tested in two validation corpora (motor neuron diseases, n = 31 publications; multiple sclerosis, n = 244 publications).

Results

Our data mining tool, STEED (STructured Extraction of Experimental Data), successfully extracted key experimental parameters such as animal models and species, as well as risk of bias items like randomization or blinding, from in vivo studies. Sensitivity and specificity were over 85% and 80%, respectively, for most items in both validation corpora. Accuracy and F1-score were above 90% and 0.9 for most items in the validation corpora, respectively. Time savings were above 99%.

Conclusions

Our text mining tool, STEED, can extract key experimental parameters and risk of bias items from the neuroscience in vivo literature. This enables the tool’s deployment for probing a field in a research improvement context or replacing one human reader during data extraction, resulting in substantial time savings and contributing towards the automation of systematic reviews.

Introduction

Synthesising evidence is an essential part of scientific progress [1]. To this end, systematic reviews—i.e. the rigorous identification, appraisal, and integration of all available evidence on a specific research question—have become a default tool in clinical research [2, 3]. Yet, they are also increasingly employed for preclinical in vivo research [4–7].

Systematic reviews allow the identification of trends that may be missed when reviewing individual, smaller studies, and add soundness to one’s conclusions. For this reason, the use of systematic reviews in animal research is an acknowledged aid to implementing the reduction, replacement, and refinement of animal experiments [8], e.g., by gaining knowledge without the use of new animal experiments or by improving the ethical position of animal research by increasing the value and reliability of research findings [9].

The process of manual evidence synthesis is highly laborious [10]. This problem is further hampered by the skyrocketing amount of publications in the biomedical field [11] and these numbers are set to increase still further in the near future [12]. With this, it becomes increasingly difficult to keep abreast with the published evidence which in turn precludes evidence-based research [13]. Consequently, automation of the labour-intensive steps of a systematic review is warranted to optimize the value of published data in the age of information overload. One particularly labour-intensive systematic review task which would profit from automation is data extraction [14, 15], i.e., the manual retrieval of specific data from publications. Based on these shortcomings, we set out to develop a text mining tool to automatically extract key study parameters from publications of animal research modelling motor neuron diseases and multiple sclerosis. Our endeavour is focused on two key domains of experimental science, that is 1) disease model parameters such as animal models and species, and 2) risk of bias measures such as randomization or blinding.

Methods

Study protocol

The development of the text mining tool was part of a systematic review on neuroimaging findings in motor neuron disease animal models registered as prospective study protocol in the International Prospective Register of Systematic Reviews (PROSPERO, CRD42022373146).

Literature corpora

Three literature corpora were included in this study: one for the training of the text mining toolbox and two for its validation. The training corpus was identified by searching Medline via PubMed for animal motor neuron disease models using the search string: "motor neuron disease" OR motor neuron diseases [MeSH] OR "amyotrophic lateral sclerosis" OR "ALS" OR "MND" OR "SOD" and limiting the search to the publication year 2021. The two validation corpora are derived from two in-house systematic reviews: a systematic review on neuroimaging findings in motor neuron disease animal models [16] and a systematic review on neuroimaging findings in multiple sclerosis animal models [17].

Parameters to extract and development of text mining tool

We defined items of interest to extract a priori which belong to two domains: first, experimental parameters including 1) animal species, 2) animal sex, 3) model disease, 4) number of experimental animals used, and 5–7) experimental outcomes, i.e., whether a respective study assessed behavioral, histological, or neuroimaging outcomes. Second, risk of bias items including: 1) implementation in the experimental setup of any measure of randomization, 2) any measure of blinding, 3) prior sample size calculation (power calculation), 4) statement of whether conducted animal experiments are in accordance with local animal welfare guidelines, 5) statement of a potential conflict of interest, and 6) accordance with the ARRIVE guidelines [18]. This second domain also includes an item for the data availability statement, i.e., a statement whether and where primary study data are available.

For each item of interest, we developed a library of regular expressions (RegEx) in the R programming environment. RegEx are patterns of characters that define specific text matches. This library was built by methodically gathering relevant words and phrases from the training corpus. Notably, only one study in our training corpus reported neuroimaging outcomes, prompting us to enrich our RegEx library with terms from another unpublished animal systematic review. We aimed to minimize overfitting by avoiding hard-coded expressions, yet some unique terms were essential to include to the RegEx libraries.

Using the RegEx libraries, we created an R function to extract data from scientific papers. This process starts with converting PDFs to text using the ’pdftools’ package and then applying the ’stringr’ package to identify relevant RegEx patterns. The function segments each paper into sections (like results or methods), strips the ‘references’ section, searches for matching RegEx patterns, and then aggregates this data into a dataframe. Each paper corresponds to one row in the dataframe, with columns representing the different data points extracted.

The RegEx libraries and the R function were iteratively improved to maximize performance, based on a pre-defined threshold (see below). Both our RegEx libraries and the R function are available at: https://github.com/Ineichen-Group/Auto-STEED or on the Open Science Framework (OSF): https://osf.io/n8dz7/.

Assessment of text mining tool performance

Performance of our text mining function was gauged using the following metrics:

S e n s i t i v i t y = \frac{T P}{T P + F N} (1)

S p e c i f i c i t y = \frac{T N}{T N + F P} (2)

P r e c i s i o n = \frac{T P}{T P + F P} (3)

A c c u r a c y = \frac{T P + T N}{T P + T N + F P + F N} (4)

F 1 - s c o r e = \frac{2 * T P}{2 * T P + F P + F N} (5)

With TP, TN, FP, and FN being true positive, true negative, false positive, and false negative, respectively. We used R to calculate these performance metrics.

All included literature corpora have undergone dual and independent manual extraction of these parameters (WEZ, AEC, BVI) constituting the ‘gold standard’ for data extraction. We measured mean extraction time for both the human and the automated extraction to gauge time savings by the automated extraction. As defined in the protocol, for development of the text mining function in the training set, automated extraction of individual items was considered to be sufficiently accurate if they attained a sensitivity of 85% and a specificity of 80% (i.e., with a slightly higher sensitivity as per recommendation by the ‘Systematic Living Information Machine’ [SLIM] consortium).

Results

General characteristics of literature corpora

We included three literature corpora with manual annotation by two trained and independent reviewers. The training corpus comprised 45 individual publications on motor neuron disease animal models from 2021. The validation sets included 31 publications on neuroimaging in motor neuron disease animal models and 244 publications on neuroimaging in multiple sclerosis animal models, with median publication years of 2014 and 2009, respectively (see S1 File).

The median reporting prevalence for experimental parameters was 85%, 95%, and 93% in the training and validation corpora, respectively. Similarly, the median reporting prevalence for risk of bias items was 58%, 19%, and 20% in the training and validation corpora, respectively. A detailed summary of the characteristics and reporting prevalence of the literature corpora is presented in Table 1.

Table 1. Characteristics of included literature corpora and reporting prevalence for parameters to extract.

	Training corpus	Validation corpus 1	Validation corpus 2
Characteristics of eligible publications
Topic	Motor neuron disease animal models	Neuroimaging in motor neuron disease animal models	Neuroimaging in multiple sclerosis animal models
Number of publications	45	31	244
Publication year median and range	2021 (2021–2021)	2014 (2004–2020)	2009 (1985–2017)
Number of different journals	35	22	72
Reporting prevalence
Experimental parameters:
Species	100%	100%	100%
Sex	87%	61%	78%
Model	100%	100%	>99%
Outcome histology	82%	90%	85%
Outcome behaviour	73%	42%	61%
Outcome imaging	2%	100%	100%
Risk of bias items:
Randomization	58%	23%	20%
Blinding	47%	19%	32%
Animal welfare	98%	90%	78%
Conflict of interest	96%	58%	25%
Sample size calculation	27%	10%	1%
ARRIVE guidelines	29%	0%	1%
Data availability	69%	19%	2%

Open in a new tab

The interrater agreement was 85–95% for experimental parameters and 81–100% for risk of bias items in the training and validation corpora.

Architecture of text mining tool

Due to copyright restrictions on data mining from HTML, the tool was developed for extracting data at the PDF publication level. Initially, the text mining function reads in PDFs of the relevant publications and converts them to text. This text is then cleaned of certain keywords, such as ’random primer,’ to reduce false positives for items we aim to extract, like randomization. Subsequently, the manuscript’s body is parsed into different sections (e.g., abstract, introduction, materials, and methods) based on the appearance of specific RegEx, such as the heading ’materials and methods.’ Then, specific sections of the paper are mined for relevant regular expressions, using RegEx libraries tailored to each item that needs to be extracted. More concretely, the function extracts experimental parameters as well as some risk of bias items (randomization, blinding, and animal welfare statement) from the methods section and the other risk of bias items from the entire manuscript (excluding the ‘references’ section). The mining pipeline is depicted in Fig 1. The tool can be accessed directly on Github at https://github.com/Ineichen-Group/Auto-STEED.

PDFs of full texts are imported into the R environment, converted to text, and cleaned. Subsequently, the text is parsed into different sections such as ‘materials and methods’ or ‘results’. Then, individual items to mine are extracted using custom-made Regex libraries and a data frame with the extracted items is created.

Performance metrics of STEED

In the training set, the text mining function was tuned until it reached a sensitivity of 85% and a specificity of 80% for each individual item. The specificity threshold was not attained for the items ‘sample size calculation’, ‘sex’, and ‘outcome behaviour’ with only 78%, 67% and 50%, respectively but with above-threshold sensitivity. Some items such as accordance with the ARRIVE guidelines or whether a conflict-of-interest statement was included reached a sensitivity close to 100%. F1-score and accuracy were above 90% for most items (Table 2).

Table 2. Summary of performance measures of STEED compared with manual human ascertainment.

	Specificity	Sensitivity	Precision	Accuracy	F1-score
Training corpus (motor neuron diseases, n = 45)
Species	NA	96	100	96	0.98
Sex	67	85	94	82	0.89
Disease model	NA	96	100	96	0.98
Outcome histology	89	92	97	91	0.94
Outcome behaviour	50	97	84	84	0.90
Outcome imaging	96	NA	NA	96	NA
Randomization	84	96	89	91	0.93
Blinding	95	92	96	93	0.94
Animal welfare	NA	86	97	84	0.92
Conflict of interest	100	98	100	97	0.99
Sample size calculation	78	92	63	82	0.75
ARRIVE guidelines	100	100	100	100	1.00
Data availability	85	94	94	91	0.94
Validation corpus 1 (motor neuron diseases, n = 31)
Species	NA	100	100	100	1.00
Sex	100	74	100	84	0.85
Disease model	NA	90	100	90	0.95
Outcome histology	100	96	100	97	0.98
Outcome behaviour	78	85	76	81	0.79
Outcome imaging	NA	100	100	100	1.00
Randomization	100	86	100	97	0.92
Blinding	100	89	100	97	0.94
Animal welfare	100	89	100	90	0.94
Conflict of interest	92	94	94	94	0.94
Sample size calculation	81	80	44	81	0.57
ARRIVE guidelines	100	NA	NA	100	NA
Data availability	96	83	83	94	0.83
Validation corpus 2 (multiple sclerosis, n = 244)
Species	NA	75	100	75	0.86
Sex	76	83	93	82	0.88
Disease model	NA	87	100	88	0.93
Outcome histology	64	96	93	91	0.95
Outcome behaviour	66	91	81	82	0.86
Outcome imaging	NA	94	100	94	0.97
Randomization	93	81	75	90	0.78
Blinding	98	85	96	93	0.90
Animal welfare	86	80	95	82	0.87
Conflict of interest	96	97	90	97	0.93
Sample size calculation	94	100	27	97	0.43
ARRIVE guidelines	100	100	100	100	1.00
Data availability	100	80	80	100	0.80

Open in a new tab

Specificity, sensitivity, precision, and accuracy are denoted in percentage. For details regarding measures, please see the materials and methods section. Items reaching or exceeding our pre-defined thresholds (sensitivity of 85% and a specificity of 80%) are printed in bold font.

The mining function performed well on both validation corpora. In the motor neuron disease corpus, the mining function accomplished above-threshold specificity and sensitivity for most items, except for ‘outcome behaviour’ with slightly below-threshold specificity and ‘data availability’, ‘sample size calculation’, and ‘sex’ with slightly below-threshold sensitivity. In the multiple sclerosis validation corpus, additional items did not reach the specificity and sensitivity thresholds. However, F1-score and accuracy were above 90% for most items in the motor neuron disease validation corpus and above 80% in the multiple sclerosis corpus, respectively (Table 2).

Time savings automated versus manual extraction

Mean time for the manual extraction was 12 (± standard deviation: 8), 13 (± 7), and 15 (± 11) minutes per publication and per human reader for the training corpus and the two validation corpora, respectively. This amounts to a total of 540, 403, and 3660 minutes for one reader for the three corpora, respectively. In contrast, the mining function required 0.3 seconds to mine one record amounting to 0.23, 0.15, and 1.22 minutes for the three corpora. With this, the text mining function provides time savings above 99%.

Reporting of items on abstract versus full text level

For the experimental parameters, we quantified how commonly the respective items were reported in the abstract in addition to the full text. Disease models and species as well as outcome measures were commonly reported on abstract level in all three literature corpora with reporting frequencies between 95–100%. However, animal sexes were only rarely reported with reporting frequencies between 0 and 5%.

Discussion

Main findings

We developed STEED (STructured Extraction of Experimental Data), an R-based text mining tool designed to automatically extract key experimental details, such as animal models and species, and risk of bias factors like randomization or blinding, from preclinical in vivo studies. The tool demonstrated high sensitivity, specificity, and accuracy for extracting most items across two validation literature corpora. These corpora included one in a field similar to the training set (motor neuron diseases) and another in a different area (multiple sclerosis), both encompassing older publications as well. The use of STEED substantially reduced the time required to extract these data.

Findings in the context of existing evidence

STEED performed well on literature corpora outside of the field is has been developed in as well as in corpora with older publication years, i.e., it has been developed in a corpus covering the motor neuron disease literature and performed well in a corpus of the multiple sclerosis literature. Thus, our developed function could be applied to literature bodies of other research fields. However, adapting STEED to new disciplines requires some consideration: While the tool has shown flexibility across related fields, creating discipline-specific versions may necessitate refining the underlying RegExes to accurately capture more distinct experimental parameters pertinent to each field. This process would involve collaborative efforts with domain experts to ensure the tool’s precision and subsequent validation [19]. Consequently, while separate packages for each discipline are conceivable, they would require some adaptation efforts to maintain STEED’s standards of accuracy and utility.

Although STEED showed relatively high performance, it is not yet ready for evaluating individual publications and cannot completely replace manual data extraction. Nevertheless, this automated tool has two practical applications: first, it can be employed to large reference libraries (over 1000 records) to survey specific fields for experimental parameters and potential biases [20]. Secondly, STEED can serve to replace one human reviewer during the data extraction of e.g., a systematic review, which would still lead to substantial labour savings [15, 21]. Any discrepancies between human and machine analysis can be manually reviewed for accuracy.

Similar approaches have been leveraged to extract specific information—such as the study population, intervention, outcome measured and risks of bias—from abstracts [22] or full texts [20, 23]. Bahor and colleagues developed a text mining function in a literature body of stroke animal models able to extract certain risk of bias items including randomization, blinding, and sample size calculation [24]. The achieved accuracy was between 67–86% for randomization (our approach: 90–97%), 91–94% for blinding (our approach: 93–97%), and 96–100% for sample size calculation (our approach: 81–97%). With this, our developed tool shows similar performance metrics and does complement former tool by extracting additional risk of bias items such as statement of a conflict of interest, accordance with local animal welfare regulations, a data availability statement, and accordance with the ARRIVE guidelines [18]. Another text mining toolbox based on natural language processing (NLP) was developed by Zeiss and colleagues [22]: This toolbox extracts data such as species, model, genes, or outcomes from PubMed abstracts with F1-score between 0.75 and 0.95.

For many tasks, NLP models seem to outperform RegEx-based text mining [11, 25]. Yet they are more complex and labour-intensive to develop and deploy and thus only warrant application in more complex extraction tasks. Wang and colleagues tested performance of a variety of models such as convolutional neural networks to extract risk of bias items from preclinical studies [20]. These models outperformed RegEx-based methods for four risk of bias items with F1-score between 0.47–0.91. The validity of NLP for such tasks has also been corroborated by SciScore—a proprietary NLP tool that can automatically evaluate the compliance of publications with six rigour items taken from the MDAR framework and other guidelines [23]. These items mostly relate to risk of bias, including compliance with animal welfare regulations, blinding/randomisation, prior sample size calculation and other items such as organism or animal sex. SciScore was developed on a training corpus from PubMed open access articles. In contrast, our approach was developed on preclinical neuroscience corpora thus being more tailored to this field. Additionally, techniques involving generative large language models like GPT have been explored to automate data extraction from systematic reviews [26]. While these methods show promise, they require further evaluation to establish reliability. Current findings indicate that such models may extract incorrect data [27]. Furthermore, these models often face challenges in extracting key information and tend to be more prone to errors, especially when summarizing extensive text.

While our original plan included extracting the number of animals used in studies, we had to abandon this objective due to the highly heterogeneous ways these numbers are reported—such as in the methods/results sections, tables, figure legends, graphs, or only separate for experimental and control groups. A possible approach to address this issue could be to treat it as an NLP categorization task, classifying studies into small (for instance, fewer than 10 animals), medium (10–50 animals), and large groups (more than 100 animals).

Limitations

Firstly, our method was developed and tested specifically for preclinical neuroscience research. Its effectiveness in other areas, such as in vivo cancer studies, is yet to be determined. Secondly, our tool relies on full-text PDFs for data extraction. While extracting data from online versions of publications (HTML format) could solve problems related to PDF conversion, such as inconsistent layouts and varying journal formats, current copyright regulations and the need for costly licenses make this challenging [28]. Lastly, while our automated approach offers substantial time savings compared to manual data extraction, this does not take into account the time needed to verify the results of the automated process.

Conclusions

Our developed text mining tool STEED is able to extract key risk of bias items and experimental parameters from the neuroscience in vivo literature. Accelerating the usually labour-intensive data extraction during a systematic review contributes towards automation of systematic reviews.

Supporting information

S1 File. Supplementary reference list.

(PDF)

pone.0311358.s001.pdf^{(264.4KB, pdf)}

S2 File. Primary reporting of studies.

(XLSX)

pone.0311358.s002.xlsx^{(117KB, xlsx)}

Acknowledgments

We thank Robert Wyatt from Matching Mole for help with data analysis.

Data Availability

The datasets generated and/or analysed during the current study are available on the Open Science Framework (OSF): https://osf.io/n8dz7/. All included publications to develop and validate the data mining tool are reported in the supplementary reference list.

Funding Statement

This work was supported by grants of the Swiss National Science Foundation (No. 407940_206504, to BVI), the UZH Alumni (to BVI), and the Intramural Research Program of NINDS. We thank all our funders for their support. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

1.Nakagawa S, Dunn AG, Lagisz M, Bannach-Brown A, Grames EM, Sánchez-Tójar A, et al. A new ecosystem for evidence synthesis. Nature Ecology & Evolution. 2020;4(4):498–501. doi: 10.1038/s41559-020-1153-2 [DOI] [PubMed] [Google Scholar]
2.Egger M, Higgins JP, Smith GD. Systematic reviews in health research: Meta-analysis in context: John Wiley & Sons; 2022. [Google Scholar]
3.Higgins JP, Thomas J, Chandler J, Cumpston M, Li T, Page MJ, et al. Cochrane handbook for systematic reviews of interventions: John Wiley & Sons; 2019. [DOI] [PMC free article] [PubMed] [Google Scholar]
4.Soliman N, Rice AS, Vollert J. A practical guide to preclinical systematic review and meta-analysis. Pain. 2020;161(9):1949. doi: 10.1097/j.pain.0000000000001974 [DOI] [PMC free article] [PubMed] [Google Scholar]
5.Ritskes-Hoitinga M, Pound P. The role of systematic reviews in identifying the limitations of preclinical animal research, 2000–2022: part 1. Journal of the Royal Society of Medicine. 2022;115(5):186–92. doi: 10.1177/01410768221093551 [DOI] [PMC free article] [PubMed] [Google Scholar]
6.Ioannidis JP. Systematic reviews for basic scientists: a different beast. Physiological reviews. 2022;103(1):1–5. doi: 10.1152/physrev.00028.2022 [DOI] [PubMed] [Google Scholar]
7.Bahor Z, Liao J, Currie G, Ayder C, Macleod M, McCann SK, et al. Development and uptake of an online systematic review platform: the early years of the CAMARADES Systematic Review Facility (SyRF). BMJ Open Science. 2021;5(1):e100103. doi: 10.1136/bmjos-2020-100103 [DOI] [PMC free article] [PubMed] [Google Scholar]
8.Ritskes-Hoitinga M, van Luijk J. How can systematic reviews teach us more about the implementation of the 3Rs and animal welfare? Animals. 2019;9(12):1163. doi: 10.3390/ani9121163 [DOI] [PMC free article] [PubMed] [Google Scholar]
9.Macleod M, Mohan S. Reproducibility and rigor in animal-based research. ILAR journal. 2019;60(1):17–23. doi: 10.1093/ilar/ilz015 [DOI] [PMC free article] [PubMed] [Google Scholar]
10.Borah R, Brown AW, Capers PL, Kaiser KA. Analysis of the time and workers needed to conduct systematic reviews of medical interventions using data from the PROSPERO registry. BMJ open. 2017;7(2):e012545. doi: 10.1136/bmjopen-2016-012545 [DOI] [PMC free article] [PubMed] [Google Scholar]
11.Ineichen BV, Rosso M, Macleod MR. From data deluge to publomics: How AI can transform animal research. Lab Anim (NY). 2023;52(10):213–4. doi: 10.1038/s41684-023-01256-4 . [DOI] [PubMed] [Google Scholar]
12.Bornmann L, Mutz R. Growth rates of modern science: A bibliometric analysis based on the number of publications and cited references. Journal of the Association for Information Science and Technology. 2015;66(11):2215–22. [Google Scholar]
13.Ioannidis JP. Extrapolating from animals to humans. Science translational medicine. 2012;4(151):151ps15. Epub 2012/09/14. doi: 10.1126/scitranslmed.3004631 . [DOI] [PubMed] [Google Scholar]
14.Bannach-Brown A, Hair K, Bahor Z, Soliman N, Macleod M, Liao J. Technological advances in preclinical meta-research. BMJ Open Science. 2021;5(1):e100131. doi: 10.1136/bmjos-2020-100131 [DOI] [PMC free article] [PubMed] [Google Scholar]
15.Marshall IJ, Johnson BT, Wang Z, Rajasekaran S, Wallace BC. Semi-Automated Evidence Synthesis in Health Psychology: Current Methods and Future Prospects. Health psychology review. 2020:1–35. Epub 2020/01/17. doi: 10.1080/17437199.2020.1716198 . [DOI] [PMC free article] [PubMed] [Google Scholar]
16.Cannon AE, Zürrer WE, Zejlon C, Kulcsar Z, Lewandowski S, Piehl F, et al. Neuroimaging findings in preclinical amyotrophic lateral sclerosis models—How well do they mimic the clinical phenotype? A systematic review. Frontiers in Veterinary Science. 2023;10:1135282. doi: 10.3389/fvets.2023.1135282 [DOI] [PMC free article] [PubMed] [Google Scholar]
17.Ineichen BV, Sati P, Granberg T, Absinta M, Lee NJ, Lefeuvre JA, et al. Magnetic resonance imaging in multiple sclerosis animal models: A systematic review, meta-analysis, and white paper. NeuroImage: Clinical. 2020:102371. doi: 10.1016/j.nicl.2020.102371 [DOI] [PMC free article] [PubMed] [Google Scholar]
18.Percie du Sert N, Hurst V, Ahluwalia A, Alam S, Avey MT, Baker M, et al. The ARRIVE guidelines 2.0: Updated guidelines for reporting animal research. Journal of Cerebral Blood Flow & Metabolism. 2020;40(9):1769–77. [DOI] [PMC free article] [PubMed] [Google Scholar]
19.Mohammadi E, Karami A. Exploring research trends in big data across disciplines: A text mining analysis. Journal of Information Science. 2022;48(1):44–56. [Google Scholar]
20.Wang Q, Liao J, Lapata M, Macleod M. Risk of bias assessment in preclinical literature using natural language processing. Res Synth Methods. 2021. Epub 2021/10/29. doi: 10.1002/jrsm.1533 . [DOI] [PMC free article] [PubMed] [Google Scholar]
21.Marshall IJ, Wallace BC. Toward systematic review automation: a practical guide to using machine learning tools in research synthesis. Syst Rev. 2019;8(1):163. Epub 2019/07/13. doi: 10.1186/s13643-019-1074-9 ; PubMed Central PMCID: PMC6621996. [DOI] [PMC free article] [PubMed] [Google Scholar]
22.Zeiss CJ, Shin D, Vander Wyk B, Beck AP, Zatz N, Sneiderman CA, et al. Menagerie: A text-mining tool to support animal-human translation in neurodegeneration research. PloS one. 2019;14(12):e0226176. doi: 10.1371/journal.pone.0226176 [DOI] [PMC free article] [PubMed] [Google Scholar]
23.Menke J, Roelandse M, Ozyurt B, Martone M, Bandrowski A. The Rigor and Transparency Index quality metric for assessing biological and medical science methods. Iscience. 2020;23(11):101698. doi: 10.1016/j.isci.2020.101698 [DOI] [PMC free article] [PubMed] [Google Scholar]
24.Bahor Z, Liao J, Macleod MR, Bannach-Brown A, McCann SK, Wever KE, et al. Risk of bias reporting in the recent animal focal cerebral ischaemia literature. Clinical Science. 2017;131(20):2525–32. doi: 10.1042/CS20160722 [DOI] [PMC free article] [PubMed] [Google Scholar]
25.Wang Q, Hair K, Macleod MR, Currie G, Bahor Z, Sena E, et al. Protocol for an analysis of in vivo reporting standards by journal, institution and funder. OSF (https://osfio/preprints/metaarxiv/cjxtf/). 2021. [Google Scholar]
26.Khraisha Q, Put S, Kappenberg J, Warraitch A, Hadfield K. Can large language models replace humans in the systematic review process? Evaluating GPT-4’s efficacy in screening and extracting data from peer-reviewed and grey literature in multiple languages. arXiv preprint arXiv:231017526. 2023. [DOI] [PubMed] [Google Scholar]
27.Tang L, Sun Z, Idnay B, Nestor JG, Soroush A, Elias PA, et al. Evaluating large language models on medical evidence summarization. npj Digital Medicine. 2023;6(1):158. [DOI] [PMC free article] [PubMed] [Google Scholar]
28.brief NSD-Tni. Space-junk spear, depression drug and the EU’s digital copyright. 2019. doi: 10.1038/d41586-019-00614-y [DOI] [PubMed] [Google Scholar]

PLoS One. doi: 10.1371/journal.pone.0311358.r001

Decision Letter 0

John Blake

5 Mar 2024

PONE-D-23-39345STEED: A data mining tool for automated extraction of experimental parameters and risk of bias items from in vivo publicationsPLOS ONE

Dear Dr. Ineichen,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

Please submit your revised manuscript by Apr 19 2024 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.
A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.
An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols.

We look forward to receiving your revised manuscript.

Kind regards,

John Blake, PhD

Academic Editor

PLOS ONE

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2. Please note that PLOS ONE has specific guidelines on code sharing for submissions in which author-generated code underpins the findings in the manuscript. In these cases, all author-generated code must be made available without restrictions upon publication of the work. Please review our guidelines at https://journals.plos.org/plosone/s/materials-and-software-sharing#loc-sharing-code and ensure that your code is shared in a way that follows best practice and facilitates reproducibility and reuse.

3. We note that the grant information you provided in the ‘Funding Information’ and ‘Financial Disclosure’ sections do not match.

When you resubmit, please ensure that you provide the correct grant numbers for the awards you received for your study in the ‘Funding Information’ section.

4. Thank you for stating the following financial disclosure:

"This work was supported by grants of the Swiss National Science Foundation (No. P400PM_183884, to BVI), the UZH Alumni (to BVI), and the Intramural Research Program of NINDS. We thank all our funders for their support."

Please state what role the funders took in the study. If the funders had no role, please state: "The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript." If this statement is not correct you must amend it as needed.

Please include this amended Role of Funder statement in your cover letter; we will change the online submission form on your behalf.

5. Please note that funding information should not appear in the Acknowledgments section or other areas of your manuscript. We will only publish funding information present in the Funding Statement section of the online submission form. Please remove any funding-related text from the manuscript.

6. Please include captions for your Supporting Information files at the end of your manuscript, and update any in-text citations to match accordingly. Please see our Supporting Information guidelines for more information: http://journals.plos.org/plosone/s/supporting-information.

7. Please review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the rebuttal letter that accompanies your revised manuscript. If you need to cite a retracted article, indicate the article’s retracted status in the References list and also include a citation and full reference for the retraction notice.

Additional Editor Comments:

As the editor and the second reviewer, I agree with the comments made by Reviewer 1. Please ensure that you address each of the specific suggestions in your revised version.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: Thank you for the opportunity to review the manuscript titled “STEED: A data mining tool for automated extraction of experimental parameters and risk of bias items from in vivo publications.

The manuscript is concise, and clearly written. I am not a programmer but rather a potential “user” of the developed R package. Nevertheless, I found the writing to be easy to follow. The authors are open about the potential advantages as well as limitations of their software.

The supplementary materials are very helpful, especially the reporting table to give readers an idea of what the end product looks like. I have very little in terms of criticism.

L 26 In the abstract, you might want to consider using “feasibility” instead of “applicability”. Systematic reviews are still applicable, especially when there is a large body of literature, but they may not be feasible to undertake because of the labor involved.

In the discussion section, could you comment on the potential to adapt the package to other topics? Do you foresee separate packets to be developed for each discipline through adaptation of the regular expressions?

I agree with the authors that the tool may be able to at least eliminate one reviewer, while not quite replace all human reviewers yet.

Well done!

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

**********

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2024 Nov 26;19(11):e0311358. doi: 10.1371/journal.pone.0311358.r002

Author response to Decision Letter 0

15 Mar 2024

Point-by-point response

Copied from the decision letter, responses in blue.

Comments made by the Editor

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

Thank you for taking time to edit and review our manuscript.

We formatted our manuscript to meet PloS One’s style requirements.

All custom-made code is freely and fully available at our GitHub page (https://github.com/Ineichen-Group/Auto-STEED), as stated in the manuscript (line 100).

3. We note that the grant information you provided in the ‘Funding Information’ and ‘Financial Disclosure’ sections do not match.

When you resubmit, please ensure that you provide the correct grant numbers for the awards you received for your study in the ‘Funding Information’ section.

We will submit the correct funding information.

4. Thank you for stating the following financial disclosure:

Please include this amended Role of Funder statement in your cover letter; we will change the online submission form on your behalf.

We added this statement to the cover letter.

We removed this section from the manuscript.

We are now including a caption for our supporting data (i.e., the supplementary reference list) at the end of the manuscript.

We included one additional reference (see below).

8. As the editor and the second reviewer, I agree with the comments made by Reviewer 1. Please ensure that you address each of the specific suggestions in your revised version.

See our comments for R1 below. Thank you again for taking time to edit and review our manuscript.

Reviewer #1

The supplementary materials are very helpful, especially the reporting table to give readers an idea of what the end product looks like. I have very little in terms of criticism.

Thank you for reviewing our manuscript and for the positive and constructive comments.

We agree with this suggestion and adjusted it accordingly.

This is an important point you raise. We amended the first paragraph of the discussion to discuss this in more detail which now reads:

“STEED performed well on literature corpora outside of the field is has been developed in as well as in corpora with older publication years, i.e., it has been developed in a corpus covering the motor neuron disease literature and performed well in a corpus of the multiple sclerosis literature. Thus, our devel-oped function could be applied to literature bodies of other research fields. However, adapting STEED to new disciplines requires some consideration: While the tool has shown flexibility across related fields, creating discipline-specific versions may necessitate refining the underlying RegExes to accurately capture more distinct experimental parameters pertinent to each field. This process would involve collaborative efforts with domain experts to ensure the tool's precision and subsequent valida-tion [19]. Consequently, while separate packages for each discipline are conceivable, they would re-quire some adaptation efforts to maintain STEED's standards of accuracy and utility.” (Page 14).

We also added this reference:

19. Mohammadi E, Karami A. Exploring research trends in big data across disciplines: A text mining analysis. Journal of Information Science. 2022;48(1):44-56.

I agree with the authors that the tool may be able to at least eliminate one reviewer, while not quite replace all human reviewers yet.

Well done!

Again, thank you for taking time to review our manuscript.

Attachment

Submitted filename: Response_to_reviewers_R1.docx

pone.0311358.s003.docx^{(17.7KB, docx)}

PLoS One. doi: 10.1371/journal.pone.0311358.r003

Decision Letter 1

John Blake

18 Sep 2024

STEED: A data mining tool for automated extraction of experimental parameters and risk of bias items from in vivo publications

PONE-D-23-39345R1

Dear Dr. Ineichen,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice will be generated when your article is formally accepted. Please note, if your institution has a publishing partnership with PLOS and your article meets the relevant criteria, all or part of your publication costs will be covered. Please make sure your user information is up-to-date by logging into Editorial Manager at Editorial Manager® and clicking the ‘Update My Information' link at the top of the page. If you have any questions relating to publication charges, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

John Blake, PhD

Academic Editor

PLOS ONE

Additional Editor Comments (optional):

I would like to clarify that I am Reviewer #2.

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #2: All comments have been addressed

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

Reviewer #2: Yes

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #2: Yes

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

Reviewer #2: Yes

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

Reviewer #2: Yes

**********

6. Review Comments to the Author

Reviewer #2: The authors have fully addressed all the comments raised in the initial round of reviews. I therefore have no further objections to publication and look forward to reading this article in print in PLOS ONE.

Comments and responses from Review 1

---

L 26 In the abstract, you might want to consider using “feasibility” instead of

“applicability”. Systematic reviews are still applicable, especially when there is a large

body of literature, but they may not be feasible to undertake because of the labor

involved.

Thank you for reviewing our manuscript and for the positive and constructive

comments. We agree with this suggestion and adjusted it accordingly.

---

In the discussion section, could you comment on the potential to adapt the package to

other topics? Do you foresee separate packets to be developed for each discipline

through adaptation of the regular expressions?

This is an important point you raise. We amended the first paragraph of the discussion

to discuss this in more detail which now reads:

“STEED performed well on literature corpora outside of the field is has been developed

in as well as in corpora with older publication years, i.e., it has been developed in a

corpus covering the motor neuron disease literature and performed well in a corpus of

the multiple sclerosis literature. Thus, our devel-oped function could be applied to

literature bodies of other research fields. However, adapting STEED to new disciplines

requires some consideration: While the tool has shown flexibility across related fields,

creating discipline-specific versions may necessitate refining the underlying RegExes

to accurately capture more distinct experimental parameters pertinent to each field.

This process would involve collaborative efforts with domain experts to ensure the

tool's precision and subsequent valida-tion [19]. Consequently, while separate

packages for each discipline are conceivable, they would re-quire some adaptation

efforts to maintain STEED's standards of accuracy and utility.” (Page 14).

----

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #2: No

**********

PLoS One. doi: 10.1371/journal.pone.0311358.r004

Acceptance letter

John Blake

23 Sep 2024

PONE-D-23-39345R1

PLOS ONE

Dear Dr. Ineichen,

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now being handed over to our production team.

At this stage, our production department will prepare your paper for publication. This includes ensuring the following:

* All references, tables, and figures are properly cited

* All relevant supporting information is included in the manuscript submission,

* There are no issues that prevent the paper from being properly typeset

If revisions are needed, the production department will contact you directly to resolve them. If no revisions are needed, you will receive an email when the publication date has been set. At this time, we do not offer pre-publication proofs to authors during production of the accepted work. Please keep in mind that we are working through a large volume of accepted articles, so please give us a few weeks to review your paper and let you know the next and final steps.

Lastly, if your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

If we can help with anything else, please email us at customercare@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Dr. John Blake

Academic Editor

PLOS ONE

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

S1 File. Supplementary reference list.

(PDF)

pone.0311358.s001.pdf^{(264.4KB, pdf)}

S2 File. Primary reporting of studies.

(XLSX)

pone.0311358.s002.xlsx^{(117KB, xlsx)}

Attachment

Submitted filename: Response_to_reviewers_R1.docx

pone.0311358.s003.docx^{(17.7KB, docx)}

Data Availability Statement

[pone.0311358.ref001] 1.Nakagawa S, Dunn AG, Lagisz M, Bannach-Brown A, Grames EM, Sánchez-Tójar A, et al. A new ecosystem for evidence synthesis. Nature Ecology & Evolution. 2020;4(4):498–501. doi: 10.1038/s41559-020-1153-2 [DOI] [PubMed] [Google Scholar]

[pone.0311358.ref002] 2.Egger M, Higgins JP, Smith GD. Systematic reviews in health research: Meta-analysis in context: John Wiley & Sons; 2022. [Google Scholar]

[pone.0311358.ref003] 3.Higgins JP, Thomas J, Chandler J, Cumpston M, Li T, Page MJ, et al. Cochrane handbook for systematic reviews of interventions: John Wiley & Sons; 2019. [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0311358.ref004] 4.Soliman N, Rice AS, Vollert J. A practical guide to preclinical systematic review and meta-analysis. Pain. 2020;161(9):1949. doi: 10.1097/j.pain.0000000000001974 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0311358.ref005] 5.Ritskes-Hoitinga M, Pound P. The role of systematic reviews in identifying the limitations of preclinical animal research, 2000–2022: part 1. Journal of the Royal Society of Medicine. 2022;115(5):186–92. doi: 10.1177/01410768221093551 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0311358.ref006] 6.Ioannidis JP. Systematic reviews for basic scientists: a different beast. Physiological reviews. 2022;103(1):1–5. doi: 10.1152/physrev.00028.2022 [DOI] [PubMed] [Google Scholar]

[pone.0311358.ref007] 7.Bahor Z, Liao J, Currie G, Ayder C, Macleod M, McCann SK, et al. Development and uptake of an online systematic review platform: the early years of the CAMARADES Systematic Review Facility (SyRF). BMJ Open Science. 2021;5(1):e100103. doi: 10.1136/bmjos-2020-100103 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0311358.ref008] 8.Ritskes-Hoitinga M, van Luijk J. How can systematic reviews teach us more about the implementation of the 3Rs and animal welfare? Animals. 2019;9(12):1163. doi: 10.3390/ani9121163 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0311358.ref009] 9.Macleod M, Mohan S. Reproducibility and rigor in animal-based research. ILAR journal. 2019;60(1):17–23. doi: 10.1093/ilar/ilz015 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0311358.ref010] 10.Borah R, Brown AW, Capers PL, Kaiser KA. Analysis of the time and workers needed to conduct systematic reviews of medical interventions using data from the PROSPERO registry. BMJ open. 2017;7(2):e012545. doi: 10.1136/bmjopen-2016-012545 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0311358.ref011] 11.Ineichen BV, Rosso M, Macleod MR. From data deluge to publomics: How AI can transform animal research. Lab Anim (NY). 2023;52(10):213–4. doi: 10.1038/s41684-023-01256-4 . [DOI] [PubMed] [Google Scholar]

[pone.0311358.ref012] 12.Bornmann L, Mutz R. Growth rates of modern science: A bibliometric analysis based on the number of publications and cited references. Journal of the Association for Information Science and Technology. 2015;66(11):2215–22. [Google Scholar]

[pone.0311358.ref013] 13.Ioannidis JP. Extrapolating from animals to humans. Science translational medicine. 2012;4(151):151ps15. Epub 2012/09/14. doi: 10.1126/scitranslmed.3004631 . [DOI] [PubMed] [Google Scholar]

[pone.0311358.ref014] 14.Bannach-Brown A, Hair K, Bahor Z, Soliman N, Macleod M, Liao J. Technological advances in preclinical meta-research. BMJ Open Science. 2021;5(1):e100131. doi: 10.1136/bmjos-2020-100131 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0311358.ref015] 15.Marshall IJ, Johnson BT, Wang Z, Rajasekaran S, Wallace BC. Semi-Automated Evidence Synthesis in Health Psychology: Current Methods and Future Prospects. Health psychology review. 2020:1–35. Epub 2020/01/17. doi: 10.1080/17437199.2020.1716198 . [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0311358.ref016] 16.Cannon AE, Zürrer WE, Zejlon C, Kulcsar Z, Lewandowski S, Piehl F, et al. Neuroimaging findings in preclinical amyotrophic lateral sclerosis models—How well do they mimic the clinical phenotype? A systematic review. Frontiers in Veterinary Science. 2023;10:1135282. doi: 10.3389/fvets.2023.1135282 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0311358.ref017] 17.Ineichen BV, Sati P, Granberg T, Absinta M, Lee NJ, Lefeuvre JA, et al. Magnetic resonance imaging in multiple sclerosis animal models: A systematic review, meta-analysis, and white paper. NeuroImage: Clinical. 2020:102371. doi: 10.1016/j.nicl.2020.102371 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0311358.ref018] 18.Percie du Sert N, Hurst V, Ahluwalia A, Alam S, Avey MT, Baker M, et al. The ARRIVE guidelines 2.0: Updated guidelines for reporting animal research. Journal of Cerebral Blood Flow & Metabolism. 2020;40(9):1769–77. [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0311358.ref019] 19.Mohammadi E, Karami A. Exploring research trends in big data across disciplines: A text mining analysis. Journal of Information Science. 2022;48(1):44–56. [Google Scholar]

[pone.0311358.ref020] 20.Wang Q, Liao J, Lapata M, Macleod M. Risk of bias assessment in preclinical literature using natural language processing. Res Synth Methods. 2021. Epub 2021/10/29. doi: 10.1002/jrsm.1533 . [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0311358.ref021] 21.Marshall IJ, Wallace BC. Toward systematic review automation: a practical guide to using machine learning tools in research synthesis. Syst Rev. 2019;8(1):163. Epub 2019/07/13. doi: 10.1186/s13643-019-1074-9 ; PubMed Central PMCID: PMC6621996. [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0311358.ref022] 22.Zeiss CJ, Shin D, Vander Wyk B, Beck AP, Zatz N, Sneiderman CA, et al. Menagerie: A text-mining tool to support animal-human translation in neurodegeneration research. PloS one. 2019;14(12):e0226176. doi: 10.1371/journal.pone.0226176 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0311358.ref023] 23.Menke J, Roelandse M, Ozyurt B, Martone M, Bandrowski A. The Rigor and Transparency Index quality metric for assessing biological and medical science methods. Iscience. 2020;23(11):101698. doi: 10.1016/j.isci.2020.101698 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0311358.ref024] 24.Bahor Z, Liao J, Macleod MR, Bannach-Brown A, McCann SK, Wever KE, et al. Risk of bias reporting in the recent animal focal cerebral ischaemia literature. Clinical Science. 2017;131(20):2525–32. doi: 10.1042/CS20160722 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0311358.ref025] 25.Wang Q, Hair K, Macleod MR, Currie G, Bahor Z, Sena E, et al. Protocol for an analysis of in vivo reporting standards by journal, institution and funder. OSF (https://osfio/preprints/metaarxiv/cjxtf/). 2021. [Google Scholar]

[pone.0311358.ref026] 26.Khraisha Q, Put S, Kappenberg J, Warraitch A, Hadfield K. Can large language models replace humans in the systematic review process? Evaluating GPT-4’s efficacy in screening and extracting data from peer-reviewed and grey literature in multiple languages. arXiv preprint arXiv:231017526. 2023. [DOI] [PubMed] [Google Scholar]

[pone.0311358.ref027] 27.Tang L, Sun Z, Idnay B, Nestor JG, Soroush A, Elias PA, et al. Evaluating large language models on medical evidence summarization. npj Digital Medicine. 2023;6(1):158. [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0311358.ref028] 28.brief NSD-Tni. Space-junk spear, depression drug and the EU’s digital copyright. 2019. doi: 10.1038/d41586-019-00614-y [DOI] [PubMed] [Google Scholar]

PERMALINK

STEED: A data mining tool for automated extraction of experimental parameters and risk of bias items from in vivo publications

Wolfgang Emanuel Zurrer

Amelia Elaine Cannon

Ewoud Ewing

David Brüschweiler

Julia Bugajska

Bernard Friedrich Hild

Marianna Rosso

Daniel Salo Reich

Benjamin Victor Ineichen

Roles

Abstract

Background and methods

Results

Conclusions

Introduction

Methods

Study protocol

Literature corpora

Parameters to extract and development of text mining tool

Assessment of text mining tool performance

Results

General characteristics of literature corpora

Table 1. Characteristics of included literature corpora and reporting prevalence for parameters to extract.

Architecture of text mining tool

Fig 1. Architecture of the text mining function.

Performance metrics of STEED

Table 2. Summary of performance measures of STEED compared with manual human ascertainment.

Time savings automated versus manual extraction

Reporting of items on abstract versus full text level

Discussion

Main findings

Findings in the context of existing evidence

Limitations

Conclusions

Supporting information

Acknowledgments

Data Availability

Funding Statement

References

Decision Letter 0

John Blake

Roles

Author response to Decision Letter 0

Decision Letter 1

John Blake

Roles

Acceptance letter

John Blake

Roles

Associated Data

Supplementary Materials

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases