Abstract
The essence of computer applications is to store things in the real world into computer systems in the form of data, i.e., it is a process of producing data. Some data are the records related to culture and society, and others are the descriptions of phenomena of universe and life. The large scale of data is rapidly generated and stored in computer systems, which is called data explosion. Data explosion forms data nature in computer systems. To explore data nature, new theories and methods are required. In this paper, we present the concept of data nature and introduce the problems arising from data nature, and then we define a new discipline named dataology (also called data science or science of data), which is an umbrella of theories, methods and technologies for studying data nature. The research issues and framework of dataology are proposed.
Keywords: Cloud Computing, Real Nature, Data Nature, Brain Data, Internet Data
Contributor Information
Ning Zhong, Email: zhong@maebashi-it.ac.jp.
Kuncheng Li, Email: likuncheng1955@yahoo.com.cn.
Shengfu Lu, Email: lusf@bjut.edu.cn.
Lin Chen, Email: cl@cogsci.ibp.ac.cn.
Yangyong Zhu, Email: yyzhu@fudan.edu.cn.
Ning Zhong, Email: zhong@maebashi-it.ac.jp.
Yun Xiong, Email: yunx@fudan.edu.cn.
References
- 1.http://www.emc.com/collateral/demos/microsites/idc-digital-universe/iview.htm (accessed May 2009)
- 2.Krawetz, S., Misener, S. (eds.): Bioinformatics Methods and Protocols. Humana Press (2000)
- 3.Zhong N., Liu J., Yao Y., Wu J., Lu S., Li K., editors. Web Intelligence Meets Brain Informatics. Heidelberg: Springer; 2007. pp. 1–31. [Google Scholar]
- 4.Cao, L.B.: Behavior Informatics and Analytics: Let Behavior Talk. In: Proceedings of the 2008 IEEE International Conference on Data Mining Workshops (2008)
- 5.Berman F., Fox G., Hey A., editors. Grid Computing: Making the Global Infrastructure a Reality. Chichester: John Wiley & Sons; 2003. [Google Scholar]
- 6.Hayes B. Cloud Computing. Communications of the ACM. 2008;51(7):9–11. doi: 10.1145/1364782.1364786. [DOI] [Google Scholar]
- 7.Zhong N., Liu J., Yao Y.Y. Envisioning Intelligent Information Technologies through the Prism of Web Intelligence. Communications of the ACM. 2007;50(3):89–94. doi: 10.1145/1226736.1226741. [DOI] [Google Scholar]
- 8.Chapman G., Marc R. The National Information Infrastructure: A Public Interest Opportunity. Computer Professionals for Social Responsibility. 1993;11(2):13–15. [Google Scholar]
- 9.Collins F.S., Patrinos A., Jordan E., et al. New Goals for the U.S. Human Genome Project: 1998-2003. Science. 1998;282(5389):682–689. doi: 10.1126/science.282.5389.682. [DOI] [PubMed] [Google Scholar]
- 10.Freeston, M.: The Alexandria Digital Library and the Alexandria Digital Earth Prototype. In: Proceedings of the 4th ACM/IEEE-CS joint Conference on Digital Libraries (2004)
- 11.Fayyad, U.M., Piatetsky-Shapiro, G., Smyth, P.: From Data Mining to Knowledge Discovery: an overview. In: Advances in Knowledge Discovery and Data Mining (1996)
