Skip to main content
BMC Bioinformatics logoLink to BMC Bioinformatics
. 2011 Nov 30;12(Suppl 13):S1. doi: 10.1186/1471-2105-12-S13-S1

Towards big data science in the decade ahead from ten years of InCoB and the 1st ISCB-Asia Joint Conference

Shoba Ranganathan 1,2,, Christian Schönbach 3, Janet Kelso 4, Burkhard Rost 5, Sheila Nathan 6,7, Tin Wee Tan 2
PMCID: PMC3278825  PMID: 22372736

Abstract

The 2011 International Conference on Bioinformatics (InCoB) conference, which is the annual scientific conference of the Asia-Pacific Bioinformatics Network (APBioNet), is hosted by Kuala Lumpur, Malaysia, is co-organized with the first ISCB-Asia conference of the International Society for Computational Biology (ISCB). InCoB and the sequencing of the human genome are both celebrating their tenth anniversaries and InCoB’s goalposts for the next decade, implementing standards in bioinformatics and globally distributed computational networks, will be discussed and adopted at this conference. Of the 49 manuscripts (selected from 104 submissions) accepted to BMC Genomics and BMC Bioinformatics conference supplements, 24 are featured in this issue, covering software tools, genome/proteome analysis, systems biology (networks, pathways, bioimaging) and drug discovery and design.

Introduction

InCoB (International Conference on Bioinformatics), the official conference of the Asia-Pacific Bioinformatics Network (APBioNet) [1] is celebrating its 10th anniversary this year, as a joint conference with the first ISCB-Asia meeting of the International Society for Computational Biology (ISCB) [2], at Kuala Lumpur, Malaysia. Since the first 2002 meeting in Bangkok, Thailand, InCoB serves as one of the largest bioinformatics conferences in the Asia-Pacific region, publishing submissions as research papers in conference supplements of international PubMed-indexed open-access impact factor journals, since 2006.

As InCoB’s 10th anniversary coincides with that of the human genome, the results of the systematic genome-wide sequencing applied to medical purposes, are appearing in the literature according to Collins [3], although Venter [4] believes we still have a long way to go before genome sequencing reaches its full potential. While genomic data is reaching tsunami proportions [5], its clinical applications are seen as a “slowly rising tide” [6]. Perhaps, as Trelles et al. [7] suggest, we are not yet ready for Big Data science. As succinctly summarized by Pennisi [8], while sequencing technologies have become more and more affordable, the challenges of storing, comparing and analyzing the data appear to persist, despite computational solutions proposed by Schadt et al. [5] and Zhou et al. [9].

We see these issues as challenges for the next decade, with cloud [10] and grid computing (reviewed in the first InCoB2006 publication [11]), gearing up for the data deluge, and data interchange standards becoming better established and adopted. In the Asia-Pacific, large scientific consortia are addressing personal genomic questions of local interest, such as the Pan Asian SNP initiative [12], which has provided a possible route for human migration into Asia [13]. We have to figure out how to build the resources for hosting Big Data in our own regions, with well organized and structured access to this Big Data, as a first step. Concurrently, with pre-existing computational resources are already available to our researchers, we need to motivate our researchers to ask the right questions of this Big Data and generate meaningful results.

For InCoB/ISCB-Asia 2011, we have therefore introduced dedicated sessions in Standards in Bioinformatics, following a keynote address on Biocuration by Gaudet and BioCloud/Grid Computing for Sharing Bioinformatics Resources. From APBioNet, we will present the Minimum Information About a Bioinformatics Investigation initiative (MIABi) [14] as well as a status update on BioDB100, the 100 MIABI-compliant BioDatabases initiative. We will also launch our BioSW100, the 100 MIABi-compliant BioSoftware initiative and invite the community to contribute to these ongoing projects, for provide Big Data in standardized format for developing distributed workflows that are grid- and cloud-enabled, to bring “bioinformatics to the bedside” a step closer to reality. We also noticed that since InCoB2008, accepted papers have focused on identifying target disease genes using networks, pathways and systems biology approaches as well as drug design and discovery, enabling translational bioinformatics.

Submissions and review for InCoB/ISCB-Asia 2011

Of the 104 submissions received this year, we accepted 24 articles for BMC Bioinformatics (this issue), 25 for BMC Genomics [15] and four for Immunome Research [16], an independent bioinformatics-driven immunology journal. Details of the reviewing process are presented in the BMC Genomics introduction article [15], with at least three reviews for each submission (see Additional File 1 of ref. [15] for a list of reviewers) and in the majority of the acceptable papers going through two rounds of reviews. The submitted articles originated from 19 countries with East Asia, South-East Asia and South Asia accounting for 83% of the submissions and 82% of the acceptances (details in Additional File 2 of ref. [15]), reinforcing the strong regional support for InCoB and ICSB-Asia from the region.

The challenges of developing bioinformatics research tools and applying them to the areas of genome and proteome analysis, systems biology (networks, pathways and bioimaging) and structure-based drug design and discovery are presented in this issue.

Software tools

Firdaus-Raih et al. [17] have a novel graph theoretical method to identify highly stable base triplets in RNA structures, while Benso et al. [18] have proposed simple decision rules in R, to classify gene expression data. PTIGS-IdIt [19] provides a novel approach for plant species identification using DNA barcoding technology, with HabiSign [20] for habitat-specific metagenome analysis. A webserver for predicting dinucleotide-specific RNA-binding sites is presented by Fernandez et al. [21], while PB1-F2 Finder by DeLuca et al. [22] can scan influenza viral sequences for specific RNA encoding regions. Protein analysis methods include support vector machine (SVM) models to predict RNA-binding residues (Choi and Han [23]) and to differentiate between carboxylation and non-carboxylation sites (Lee et al. [24]) while Nair et al. [25] have combined several machine learning approaches to predict amyloidogenic regions.

Genome and proteome analysis

Kim et al. [26] have evaluated the performance of several matrix factorization methods for clustering gene expression data, while Mallek et al. [27] have compared the efficacy of four predictive models for estimating chlorophyll-a concentrations in environmental samples. Choi et al. [28] have used molecular dynamics to predict the functionality of a hypothetical pathogen protein.

Systems biology: pathways, networks and imaging

Networks of biomolecules and pathways provide a deep understanding of the mode of action of biological systems. Poirel et al. [29] report a network approach to function enrichment. Soh et al. [30] have identified disease subnetworks from gene expression data. While Hsu et al. [31] have explored consistency in gene interaction networks, Rajapakse and Mundra [32] have proposed models for estimating the stability of gene interaction networks. Lee et al. [33] present an application of protein interaction networks to neurological disorders and Liu et al. [34] have proposed a possible initiation of the Wnt signaling pathway using a conformational simulation approach.

Bioimaging captures biological processes in real-time. Du et al. [35] have demonstrated that automated cell cycle phase classification can be applied to monitor in vivo cellular processes, while Veronika et al. [36] have correlated membrane dynamics with cell motility.

Structure-based drug design and discovery

With the availability of 3D structures for drug targets, Grover et al. [37] have proposed a possible mechanism for the action of the herbal drug, withaferin A, used in the treatment of herpes simplex virus, Tambunan et al. [38] explored modifications improve the efficacy of a known histone deacetylase inhibitor of the oncogenic human papilloma virus, while Lim et al. [39] have used virtual screening to identify candidate drug molecules for dengue virus methyl transferase. Khanna and Ranganathan [40] have proposed a novel set of antiparasitic compounds using an SVM approach.

Conclusion

We are encouraged by the robust support for InCoB and ISCB-Asia, arising from the strong representation from the region in the accepted papers and posters. We believe the region is well poised to exploit the latest technological advances in high-throughput sequencing, data dissemination as well as computational analyses, to usher in an era of personalized medicine. To ensure that these activities are compliant with international standards, we have included biocuration and standards as a new initiative in InCoB/ISCB-Asia 2011 and will provide updates on APBioNet’s BioDB100 and BioSW100 projects at InCoB2012.

Competing interests

The authors declare that they have no competing interests.

Authors' contributions

SR and CS (Program Committee Co-chairs) wrote the introduction and managed the review and editorial processes. CS, SR, JK (Chair, ISCB Conferences Committee), BR, TWT and SN (Conference Chair) jointly contributed to the scientific program development and its implementation. TWT supported the post-acceptance manuscript processing.

Contributor Information

Shoba Ranganathan, Email: shoba.ranganathan@mq.edu.au.

Christian Schönbach, Email: schoen@bio.kyutech.ac.jp.

Janet Kelso, Email: kelso@eva.mpg.de.

Burkhard Rost, Email: rost@in.tum.de.

Sheila Nathan, Email: sheila@ukm.my.

Tin Wee Tan, Email: tinwee@bic.nus.edu.sg.

Acknowledgements

The Program Committee, Local Organizing Committee and additional reviewers have delivered an excellent conference, with their efforts and time. We gratefully acknowledge Prof. Rofina Yasmin Othman (Under Secretary, MOSTI), Dr. Amir Feisal Merican bin Aljunid Merican (MOSTI), Dr. Mohd Basyaruddin Bin Abdul Rahman (MOSTI), Dr. Suhaimi Napis (iDEC), Dr. M. Shahir Shamsir Omar (UTM) and Dr. M. Firdaus-Raih (UKM) for their support, Ms. BJ Morrison McKay (Executive Officer, ISCB) for her advice and conference promotion support, and Ms. Kalaivani Nadarajah for manning the conference secretariat. We thank the ISCB Board Members, Drs. Reinhard Schneider, Scott Markel and Paul Horton for their time and energy during the planning phase. CS, SR and SN acknowledge the support of Kyushu Institute of Technology, Macquarie University and Universiti Kebangsaan Malaysia, respectively. Last but not least, we are very grateful to BioMed Central for their continued publication and material support.

This article has been published as part of BMC Bioinformatics Volume 12 Supplement 13, 2011: Tenth International Conference on Bioinformatics – First ISCB Asia Joint Conference 2011 (InCoB/ISCB-Asia 2011): Bioinformatics. The full contents of the supplement are available online at http://www.biomedcentral.com/1471-2105/12?issue=S13.

References

  1. The Asia-Pacific Bioinformatics Network. http://www.apbionet.org
  2. The International Society for Computational Biology. http://www.iscb.org
  3. Collins FS. Genome-sequencing anniversary. Faces of the genome. Science. 2011;331:546. doi: 10.1126/science.1202894. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Venter JC. Genome-sequencing anniversary. The human genome at 10: successes and challenges. Science. 2011;331:546–7. doi: 10.1126/science.1202812. [DOI] [PubMed] [Google Scholar]
  5. Schadt EE, Linderman MD, Sorenson J, Lee L, Nolan GP. Computational solutions to large-scale data management and analysis. Nat Rev Genet. 2010;11:647–57. doi: 10.1038/nrg2857. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Marshall E. Human genome 10th anniversary. Waiting for the revolution. Science. 2011;331:526–9. doi: 10.1126/science.331.6017.526. [DOI] [PubMed] [Google Scholar]
  7. Trelles O, Prins P, Snir M, Jansen RC. Big data, but are we ready? Nat Rev Genet. 2011;12:224. doi: 10.1038/nrg2857-c1. [DOI] [PubMed] [Google Scholar]
  8. Pennisi E. Human genome 10th anniversary. Will computers crash genomics? Science. 2011;331:666–8. doi: 10.1126/science.331.6018.666. [DOI] [PubMed] [Google Scholar]
  9. Zhou Y, Liepe J, Sheng X, Stumpf MP, Barnes C. GPU accelerated biochemical network simulation. Bioinformatics. 2011;27:874–6. doi: 10.1093/bioinformatics/btr015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Schadt EE, Linderman MD, Sorenson J, Lee L, Nolan GP. Cloud and heterogeneous computing solutions exist today for the emerging big data problems in biology. Nat Rev Genet. 2011;12:224. doi: 10.1038/nrg2857-c2. [DOI] [PubMed] [Google Scholar]
  11. Konagaya A. Trends in life science grid: from computing grid to knowledge grid. BMC Bioinformatics. 2006;7(Suppl 5):S10. doi: 10.1186/1471-2105-7-S5-S10. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Ngamphiw C, Assawamakin A, Xu S, Shaw PJ, Yang JO, Ghang H, Bhak J, Liu E, Tongsima S. HUGO Pan-Asian SNP Consortium. PanSNPdb: the Pan-Asian SNP genotyping database. PLoS One. 2011;6(6):e21451. doi: 10.1371/journal.pone.0021451. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. The HUGO Pan-Asian SNP Consortium. Mapping human genetic diversity in Asia. Science. 2009;326:1541–5. doi: 10.1126/science.1177074. [DOI] [PubMed] [Google Scholar]
  14. Tan TW, Tong JC, De Silva M, Lim KS, Ranganathan S. Advancing standards for bioinformatics activities: persistence, reproducibility, disambiguation and Minimum Information about a Bioinformatics Investigation (MIABi) BMC Genomics. 2010;11(Suppl 4):S27. doi: 10.1186/1471-2164-11-S4-S27. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Schönbach C, Nathan S, Tan TW, Ranganathan S. InCoB celebrates its tenth anniversary as first joint conference with ISCB-Asia. BMC Genomics. 2011;12(Suppl 3):S1. doi: 10.1186/1471-2164-12-S3-S1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Immunome Research. http://immunome-research.net/
  17. Firdaus-Raih M, Harrison AM, Willett P, Artymiuk PJ. Novel base triples in RNA structures revealed by graph theoretical searching methods. BMC Bioinformatics. 2011;12(Suppl 13):S2. doi: 10.1186/1471-2105-12-S13-S2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Benso A, Di Carlo S, Politano G, Savino A, Hafeezurrehman H. Building gene expression profile classifiers with a simple and efficient rejection option in R. BMC Bioinformatics. 2011;12(Suppl 13):S3. doi: 10.1186/1471-2105-12-S13-S3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Liu C, Liang D, Gao T, Pang X, Song J, Yao H, Han J, Liu Z, Guan X, Jiang K, Li H, Chen S. PTIGS-IdIt, a system for species identification by DNA sequences of the psbA-trnH intergenic spacer region. BMC Bioinformatics. 2011;12(Suppl 13):S4. doi: 10.1186/1471-2105-12-S13-S4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Ghosh TS, Mohammed MH, Rajasingh H, Chadaram S, Mande SS. HabiSign: a novel approach for comparison of metagenomes and rapid identification of habitat-specific sequences. BMC Bioinformatics. 2011;12(Suppl 13):S9. doi: 10.1186/1471-2105-12-S13-S9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Fernandez M, Kumagai Y, Standley DM, Sarai A, Mizuguchi K, Ahmad S. Prediction of dinucleotide-specific RNA-binding sites in proteins. BMC Bioinformatics. 2011;12(Suppl 13):S5. doi: 10.1186/1471-2105-12-S13-S5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. DeLuca DS, Keskin DB, Zhang GL, Reinherz EL, Brusic V. PB1-F2 Finder: scanning influenza sequences for PB1-F2 encoding RNA segments. BMC Bioinformatics. 2011;12(Suppl 13):S6. doi: 10.1186/1471-2105-12-S13-S6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Choi S, Han K. Prediction of RNA-binding amino acids from protein and RNA sequences. BMC Bioinformatics. 2011;12(Suppl 13):S7. doi: 10.1186/1471-2105-12-S13-S7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Lee TY, Lu CT, Chen SA, Bretaña NA, Cheng TH, Su MG, Huang KY. Investigation and identification of protein γ-glutamyl carboxylation sites. BMC Bioinformatics. 2011;12(Suppl 13):S10. doi: 10.1186/1471-2105-12-S13-S10. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Nair SSK, Reddy NVS, Hareesha KS. Exploiting heterogeneous features to improve in silico prediction of peptide status – amyloidogenic or non-amyloidogenic. BMC Bioinformatics. 2011;12(Suppl 13):S21. doi: 10.1186/1471-2105-12-S13-S21. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Kim MH, Seo HJ, Joung JG, Kim JH. Comprehensive evaluation of matrix factorization methods for the analysis of DNA microarray gene expression data. BMC Bioinformatics. 2011;12(Suppl 13):S8. doi: 10.1186/1471-2105-12-S13-S8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Malek S, Ahmad SMS, Singh SKK, Milow P, Salleh A. Assessment of predictive models for chlorophyll-a concentration of a tropical lake. BMC Bioinformatics. 2011;12(Suppl 13):S12. doi: 10.1186/1471-2105-12-S13-S12. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Choi SB, Normi YM, Wahab HA. Revealing the functionality of the hypothetical protein KPN00728 from Klebsiella pneumoniae MGH78578: molecular dynamics simulation approaches. BMC Bioinformatics. 2011;12(Suppl 13):S11. doi: 10.1186/1471-2105-12-S13-S11. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Poirel CL, Owens CC III, Murali TM. Network-based functional enrichment. BMC Bioinformatics. 2011;12(Suppl 13):S14. doi: 10.1186/1471-2105-12-S13-S14. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Soh D, Dong D, Guo Y, Wong L. Finding consistent disease subnetworks across microarray datasets. BMC Bioinformatics. 2011;12(Suppl 13):S15. doi: 10.1186/1471-2105-12-S13-S15. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Hsu CH, Wang TY, Chu HT, Kao CY, Chen KC. A quantitative analysis of monochromaticity in genetic interaction networks. BMC Bioinformatics. 2011;12(Suppl 13):S16. doi: 10.1186/1471-2105-12-S13-S16. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Rajapakse JC, Mundra PA. Stability of building gene regulatory networks with sparse autoregressive models. BMC Bioinformatics. 2011;12(Suppl 13):S17. doi: 10.1186/1471-2105-12-S13-S17. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Lee SA, Tsao TTH, Yang KC, Lin H, Kuo YL, Hsu CH, Lee WK, Huang KC, Kao CY. Construction and analysis of the protein-protein interaction networks for schizophrenia, bipolar disorder, and major depression. BMC Bioinformatics. 2011;12(Suppl 13):S20. doi: 10.1186/1471-2105-12-S13-S20. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Liu C, Yao M, Hogue CWV. Near-membrane ensemble elongation in the proline-rich LRP6 intracellular domain may explain the mysterious initiation of the Wnt signaling pathway. BMC Bioinformatics. 2011;12(Suppl 13):S13. doi: 10.1186/1471-2105-12-S13-S13. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Du TH, Puah WC, Wasser M. Cell cycle phase classification in 3D in vivo microscopy of Drosophila embryogenesis. BMC Bioinformatics. 2011;12(Suppl 13):S18. doi: 10.1186/1471-2105-12-S13-S18. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Veronika M, Welsch R, Ng A, Matsudaira P, Rajapakse JC. Correlation of cell membrane dynamics and cell motility. BMC Bioinformatics. 2011;12(Suppl 13):S19. doi: 10.1186/1471-2105-12-S13-S19. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Grover A, Agrawal V, Shandilya A, Bisaria VS, Sundar D. Non-nucleosidic inhibition of Herpes simplex virus DNA polymerase: mechanistic insights into the anti-herpetic mode of action of herbal drug withaferin A. BMC Bioinformatics. 2011;12(Suppl 13):S22. doi: 10.1186/1471-2105-12-S13-S22. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Tambunan USF, Bramantya N, Parikesit AA. In silico modification of suberoylanilide hydroxamic acid (SAHA) as a potential inhibitor for class II histone deacetylase (HDAC) BMC Bioinformatics. 2011;12(Suppl 13):S23. doi: 10.1186/1471-2105-12-S13-S23. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Lim SV, Rahman MBA, Tejo BA. Structure-based and ligand-based virtual screening of novel methyltransferase inhibitors of the dengue virus. BMC Bioinformatics. 2011;12(Suppl 13):S24. doi: 10.1186/1471-2105-12-S13-S24. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Khanna V, Ranganathan S. In silico approach to screen compounds active against parasitic nematodes of major socio-economic importance. BMC Bioinformatics. 2011;12(Suppl 13):S25. doi: 10.1186/1471-2105-12-S13-S25. [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from BMC Bioinformatics are provided here courtesy of BMC

RESOURCES