Abstract
In this editorial, we first summarize the 2016 International Conference on Intelligent Biology and Medicine (ICIBM 2016) held on December 8–10, 2016 in Houston, Texas, USA, and then briefly introduce the ten research articles included in this supplement issue. At ICIBM 2016, a special theme, “Medical Informatics and Big Data,” was dedicated to the recent advances of data science in the medical domain. After peer review, ten articles were selected for this special issue, covering topics such as Knowledge and Data Personalization, Social Media Applications to Healthcare, Clinical Natural Language Processing, Patient Safety Analyses, and Data Mining Using Electronic Health Records.
Introduction
The 2016 International Conference on Intelligent Biology and Medicine (ICIBM 2016) was held December 8–10, 2016 at the Texas Medical Center, Houston, Texas, USA. ICIBM 2016 included four keynote lectures, four conference invited talks, four workshops/tutorials, eight scientific sessions, and a poster session. We received 77 original manuscripts and 53 abstracts, all of which had undergone peer-review. The scientific sessions, papers and abstracts covered a great variety of research topics in bioinformatics, systems biology, big data science, biomedical informatics, pharmacogenomics, and intelligent computing. A total of 22 trainees from diverse backgrounds across the USA and international institutes were selected for travel awards and fully participated in the conference. The conference also provided a venue to allow trainees and junior investigators to meet with established scientists and, through in-depth face-to-face interactions, exchange ideas and discoveries in their research.
Biomedical science is undergoing a big data revolution. Biomedical informatics plays an essential role in the full life cycle of data science, including data harvesting, extraction, integration, normalization, and analyses. At ICIBM 2016, a special theme, “Medical Informatics and Big Data,” was dedicated to the recent advances of data science in the medical domain. It was especially attractive to conference participants, as judged from paper/abstract submission and onsite discussions during the conference. Here, we present a summary of the 10 research articles that were selected for the ICIBM 2016 special session on Medical Informatics and Big Data. All papers underwent peer-review and revision before acceptance.
Summary of selected papers in the thematic issue
The 10 papers selected for this thematic issue were presented at ICIBM 2016. These papers cover a wide range of topics including Knowledge and Data Personalization [1, 2], Social Media Applications to Healthcare [3, 4], Clinical Natural Language Processing [5, 6], Patient Safety Analyses [7, 8], and Data Mining Using Electronic Health Records [9, 10].
As we are entering the big data era, it is important to enable data and knowledge personalization by facilitating the expression of user need, access to the relevant data based on this need, and interpretation of the data based on the user’s background [11]. Ru et al. evaluated clinician’s reading preferences regarding scientific literature, which can serve as useful information for developing a personalized literature-recommender system for clinicians who work at the front-line of patient care [1]. While Ru et al. focused on clinician users, Amith et al. focused on patients. Amith et al. developed an ontology-based approach for representing knowledge about HPV-causing cancer for patients [2]. Although focusing on cancer related knowledge, their approach can presumably be applied to any other domain for creating ontologies.
With the advent of social media and the rapidly growing volume of user-generated data on social media, there is an emergent interest in using social media data for healthcare related studies [12]. Du et al. developed a machine learning based sentiment analysis system to extract public opinions towards HPV vaccines from Twitter [3]. They further conducted time trend analyses on related tweets, which could potentially provide feedback to public health professionals to monitor online public responses, examine the effectiveness of their HPV vaccination promotion strategies, and adjust their promotion plans. Zhang et al. surveyed how the public use WeChat, the most widely and frequently used social media in China, to obtain health information [4]. This study predicted that WeChat could play on a central role in individualized health education in China.
Entity recognition is one of the most basic steps for natural language processing. Chen et al. developed a novel active learning (AL) algorithm and the first AL-enabled annotation system for clinical named entity recognition (NER) [5]. In contrast to simulation experiments, the user study here revealed that AL did not always save annotation time when compared with random sampling, indicating the need for developing querying algorithms that model the actual annotation time. In Liu et al., the authors leveraged recurrent neural network (RNN), one of the deep learning methods, for clinical entity recognition and protected health information recognition [6]. Their results show that deep learning outperforms traditional machine learning methods that suffer from fuzzy feature engineering.
Patient safety research endeavors to prevent and reduce harm to patients during the process of health care. Kang et al. developed a similarity-searching module that can dynamically measure the similarities of events in patient safety event reporting systems [7]. The similarity analysis of events is key to learning from retrospective analysis, promoting proactive identification and remediating safety events. The work presented by Cai et al. investigated the temporal variation of adverse effects of vaccines that provides a new angle for examining the data from adverse event reporting systems [8]. The application of informatics approaches to innovative design patient safety event reporting system holds promise in advancing safety event data analytics.
Exploiting the rich information in electronic health records is a mainstay of health informatics aimed at improving the quality of health care. The project detecting and quantifying new and redundant information over longitudinal clinical notes presented by Zhang et al. discussed an automated technique with statistical language models to improve data quality [9]. Kaewprag et al. [10] explored a predictive analysis of pressure ulcer in intensive care unit patient records, which helps clinicians identify relationships between risk factors associated with pressure ulcers.
Discussion and conclusion
In the big data era, the bottleneck in biomedical research has shifted from data collection to data management and analysis. Medical informatics, an interdisciplinary field that studies the effective uses of data, information, and knowledge for solving medical problems, plays an essential role in biomedical data science. As shown in this workshop, diverse types of data including social media, clinical documents, safety reports, and patient records have been studied. Moreover, new methods and algorithms (e.g., those involving NLP and predictive modeling) have been developed to process, integrate, and analyze these data into knowledge and actionable wisdom, thus to facilitate clinical research and to improve clinical practice. We envision rapid growth of the medical informatics field and hope to organize similar future events to promote medical informatics research in the big data era.
Acknowledgements
We thank the National Science Foundation (NSF grant IIS-1645823) for the financial support of ICIBM 2016.
Funding
Funding for the publication of this introduction came from the National Science Foundation (NSF grant IIS-1645823).
Authors’ contributions
All the authors contributed to the writing of the manuscript and have read and approved the final manuscript.
Competing interests
The authors declare that they have no competing interests.
About this supplement
This article has been published as part of BMC Medical Informatics and Decision Making Volume 17 Supplement 2, 2017: Selected articles from the International Conference on Intelligent Biology and Medicine (ICIBM) 2016: medical informatics and decision making. The full contents of the supplement are available online at https://bmcmedinformdecismak.biomedcentral.com/articles/supplements/volume-17-supplement-2.
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Contributor Information
Cui Tao, Email: Cui.Tao@uth.tmc.edu.
Yang Gong, Email: Yang.Gong@uth.tmc.edu.
Hua Xu, Email: Hua.Xu@uth.tmc.edu.
Zhongming Zhao, Email: Zhongming.Zhao@uth.tmc.edu.
References
- 1.Ru B, Wang X, Yao L. Evaluation of the informatician perspective: determining types of research papers preferred by clinicians. BMC Med Inform Decis Mak. 2017;17(S2). doi:10.1186/s12911-017-0463-z. [DOI] [PMC free article] [PubMed]
- 2.Amith M, Song H-Y, Zhang Y, Tao C. Lightweight predicate extraction for patient-level cancer information and ontology development. BMC Med Inform Decis Mak. 2017;17(S2). doi:10.1186/s12911-017-0465-x. [DOI] [PMC free article] [PubMed]
- 3.Du J, Xu J, Song H-Y, Tao C. Leveraging machine learning based approaches to assess human papillomavirus vaccination sentiment trends with twitter data. BMC Med Inform Decis Mak. 2017;17(S2). doi:10.1186/s12911-017-0469-6. [DOI] [PMC free article] [PubMed]
- 4.Zhang X, Wen D, Lei J. How the public uses social media WeChat to obtain health information in china: a survey study. BMC Med Inform Decis Mak. 2017;17(S2). doi:10.1186/s12911-017-0470-0. [DOI] [PMC free article] [PubMed]
- 5.Chen Y, Lasko T, Mei Q, et al. An active learning-enabled annotation system for clinical named entity recognition. BMC Med Inform Decis Mak. 2017;17(S2). doi:10.1186/s12911-017-0466-9. [DOI] [PMC free article] [PubMed]
- 6.Liu Z, Wang X, Chen Q, Tang B. Entity recognition from clinical texts via recurrent neural network. BMC Med Inform Decis Mak. 2017;17(S2). doi:10.1186/s12911-017-0468-7. [DOI] [PMC free article] [PubMed]
- 7.Kang H, Gong Y. Developing a Similarity Searching Module for Patient Safety Event Reporting System. BMC Med Inform Decis Mak. 2017;17(S2). doi:10.1186/s12911-017-0467-8. [DOI] [PMC free article] [PubMed]
- 8.Cai Y, Du J, Huang J, Tao C, Chen Y. A signal detection method for temporal variation of adverse effect with vaccine adverse event reporting system data. BMC Med Inform Decis Mak. 2017;17(S2). doi:10.1186/s12911-017-0472-y. [DOI] [PMC free article] [PubMed]
- 9.Zhang R, Pakhomov S, Arsoniadis E, Lee J, Wang Y, Melton G. Detecting clinically relevant new information in clinical notes across specialties and settings. BMC Med Inform Decis Mak. 2017;17(S2). doi:10.1186/s12911-017-0464-y. [DOI] [PMC free article] [PubMed]
- 10.Kaewprag P, Newton C, Vermillion B, Hyun S, Huang K, Machiraju R. Predictive models for pressure ulcers from intensive care unit electronic health records using Bayesian networks. BMC Med Inform Decis Mak. 2017;17(S2). doi:10.1186/s12911-017-0471-z. [DOI] [PMC free article] [PubMed]
- 11.Koutrika G. Data Personalization. Data Management in Pervasive Systems: Switzerland: Springer; 2015:213–34.
- 12.Ventola CL. Social media and health care professionals: benefits, risks, and best practices. P T. 2014;39(7):491–520. [PMC free article] [PubMed] [Google Scholar]