Abstract
The intelligent research assistant, VERA, supports inquiry-based modeling by supplying contextualized large-scale domain knowledge in the Encyclopedia of Life. Learners can use VERA to construct conceptual models of ecological phenomena, run them as simulations, and review their predictions. A study on the use of VERA by college-level students indicates that providing access to large scale but contextualized knowledge helped students build more complex models and generate more hypotheses in problem-solving.
Keywords: Agent-based modeling, Inquiry-based modeling, Ecology, College-level education, Science education
Introduction
Research on learning about scientific modeling has revealed the need for cognitive assistance of several kinds [2, 4, 9, 10]. In particular, scientific modeling requires domain knowledge, e.g., relationships between variables describing the system being modeled, as well as mathematical skills [9]. Thus, the question our research addresses is how can we scaffold the acquisition of domain knowledge involved in scientific modeling?
Of course, large amounts of knowledge about many domains are now readily accessible on the internet. However, much of this general-purpose knowledge is not particular to any specific task and thus difficult to comprehend by many learners. Our research hypothesis is that contextualized acquisition of this knowledge may help students achieve deeper understanding about the domain and generate richer models. The Virtual Ecological Research Assistant (VERA) supports scientific modeling in the domain of ecology using large scale domain knowledge through Smithsonian’s Encyclopedia of Life (EOL; [7]). Preliminary results from the experiment indicate that contextualized access to ecological knowledge from EOL helped the students build more richer models in problem-solving.
VERA: A Research Assistant for Ecological Modeling
VERA is a web-based system intended for large-scale use and supports scientific modeling in three ways [1]. First, it provides a visual language with a well-defined semantics to represent conceptual models clearly. Second, it automatically translates a conceptual model into an agent-based simulation suitable for the ecological domain without requiring any programming skills or mathematical expertise. Third, it provides access to large scale biological knowledge through EOL to help the students construct the conceptual models and set the simulation parameters.
While other recent modeling systems (Co-Lab and PROMETHEUS) use equation-based modeling that consists of a set of equations and executions to evaluate them [3, 5], VERA uses a visual language to specify conceptual models that automatically generate agent-based simulations and leverages contextualized domain knowledge to assist the process of construction of the models of ecological phenomena.
VERA is built on our previous work [6] that integrated Component-Mechanism-Phenomenon (CMP) models and their agent-based simulations. VERA contains three types of components: biotic abiotic, and habitat. VERA’s taxonomy of interactions among biotic components is based on the ontology of the interactions used by EOL, in particular, Global Biotic Interactions (GloBI) [8]. The specific interactions it uses are produces, consumes, becomes on death, and affects. It uses an off-the-shelf agent-based simulation system called NetLogo [11] because agent-based simulations are especially well suited for ecological modeling. Running the simulation enables the user to observe how system variables change over time and to refine their models through a generate-evaluate-revise loop.
Contextualization of Domain Knowledge
EOL is the world’s largest aggregated and curated database of species data with almost two million species and eleven million attribute records in the biological domain. VERA enables a learner to access EOL to find species of interest and automatically populate simulation parameters. VERA currently uses the following parameters specific to ecology from EOL: lifespan, body mass, carbon biomass, respiratory rate, photosynthesis rate, assimilation efficiency, reproductive maturity, reproductive interval, and offspring count.
Illustrative Example of Inquiry-Based Modeling Using VERA
In the following scenario, a learner wants to create a model of an observed food web to explore the predator-prey relationship between sheep and wolves. The learner begins by placing a biotic component into the conceptual model canvas, naming it “sheep,” and clicking on “Lookup species on EOL.” The system queries EOL for all matches to the scientific or common name “sheep” and checks for the existence of attribute records for each found species. The learner selects “domestic sheep,” and VERA extracts the species attributes from EOL that are relevant to the agent-based simulation (see Fig. 1). This provides the learner with valuable data that a student would be hard-pressed to locate and make sense of, reducing the cognitive load in model creation.
Fig. 1.
Automatic filling of simulation parameters retrieved from EOL.
The learner carries on with the model construction to add a predator (wolf) and food source (grass) along with adding consumption interactions between the populations, leveraging EOL lookups with each component and interaction. In this way, our intrepid novice scientist has constructed a partial food web model revolving around the species of interest. VERA automatically spawns the simulation and displays the results as a set of graphs, for example, a graph indicating the changes in populations of various species over time (see Fig. 2). The learner may now experiment with different simulation parameters, revise the conceptual model, or generate an alternative hypothesis.
Fig. 2.
A conceptual model of the relationships between species (left) and the simulation results generated from the conceptual model creating predator-prey cycle (right).
Lab Experiment
The goal of the study was to see if providing access to domain knowledge helps learners build richer models in problem solving. Fifteen self-selected students (N = 15) were recruited from a college-level general biology introductory course taught in Fall 2018. During the study, the students were asked to use VERA to explore multiple hypotheses to explain the decline in the sheep population. The students were also encouraged to actively use EOL when they needed information about a given species and later asked how often they used EOL while modeling.
Results
The students in our study developed models in VERA to evaluate their hypotheses and used EOL to get information about the species being modeled. The complexity of a model was calculated by adding the number of components in the conceptual model and the total number of relationships among the components. We found that access to ecological data from EOL helped students build more complex models and generate more hypotheses. The students who answered that they used the EOL frequently were found to come up with multiple hypotheses and build more complex models (Pearson product-moment correlation coefficients; r = 0.38; r = 0.26). We conjecture that information about the relationships between predator, prey, and competitors in the EOL knowledge-base may have led to the construction of more complex models. Interestingly, building complex models was associated with generating more hypotheses (r = 0.66). This means that the students who build more complex models are likely to have generated more hypotheses.
Conclusion
The research question in this work is how might we scaffold the acquisition of domain knowledge for students engaged in scientific modeling? The research hypothesis is that the contextualized acquisition of domain knowledge will help students build richer models. VERA contextualizes EOL’s large scale domain knowledge to support modeling of ecological systems. The study with college-level students using VERA confirmed that contextualized acquisition of domain knowledge helped them construct more complex models and more explanatory hypotheses.
Acknowledgements
This research is supported by an US NSF grant #1636848 (Big Data Spokes: Collaborative: Using Big Data for Environmental Sustainability: Big Data + AI Technology = Accessible, Usable, Useful Knowledge!) and Georgia Tech seed grants through the Brooke Byers Institute for Sustainable Systems.
Contributor Information
Ig Ibert Bittencourt, Email: ig.ibert@ic.ufal.br.
Mutlu Cukurova, Email: m.cukurova@ucl.ac.uk.
Kasia Muldner, Email: kasia.muldner@carleton.ca.
Rose Luckin, Email: r.luckin@ucl.ac.uk.
Eva Millán, Email: eva@lcc.uma.es.
Sungeun An, Email: sungeun.an@gatech.edu.
References
- 1.An S, Bates R, Hammock J, Rugaber S, Goel A, et al. VERA: popularizing science through AI. In: Penstein Rosé C, et al., editors. Artificial Intelligence in Education; Cham: Springer; 2018. pp. 31–35. [Google Scholar]
- 2.Bransford JD, Brown AL, Cocking RR, et al. How People Learn. Washington, DC: National Academy Press; 2000. [Google Scholar]
- 3.Bridewell W, Sánchez JN, Langley P, Billman D. An interactive environment for the modeling and discovery of scientific knowledge. Int. J. Hum Comput Stud. 2006;64(11):1099–1114. doi: 10.1016/j.ijhcs.2006.06.006. [DOI] [Google Scholar]
- 4.Hogan K, Thomas D. Cognitive comparisons of students’ systems modeling in ecology. J. Sci. Educ. Technol. 2001;10(4):319–345. doi: 10.1023/A:1012243102249. [DOI] [Google Scholar]
- 5.van Joolingen WR, de Jong T, Lazonder AW, Savelsbergh ER, Manlove S. Co-lab: research and development of an online learning environment for collaborative scientific discovery learning. Comput. Hum. Behav. 2005;21(4):671–688. doi: 10.1016/j.chb.2004.10.039. [DOI] [Google Scholar]
- 6.Joyner, D.A., Goel, A.K., Papin, N.M.: MILA-S: generation of agent-based simulations from conceptual models of complex systems. In: Proceedings of the 19th International Conference on Intelligent User Interfaces, pp. 289–298 (2014)
- 7.Parr, C.S., et al.: The encyclopedia of life v2: providing global access to knowledge about life on earth. Biodiv. Data J. (2), e1079 (2014) [DOI] [PMC free article] [PubMed]
- 8.Poelen JH, Simons JD, Mungall CJ. Global biotic interactions: an open infrastructure to share and analyze species-interaction datasets. Ecol. Inform. 2014;24:148–159. doi: 10.1016/j.ecoinf.2014.08.005. [DOI] [Google Scholar]
- 9.Sins PH, Savelsbergh ER, van Joolingen WR. The difficult process of scientific modelling: an analysis of novices’ reasoning during computer-based modelling. Int. J. Sci. Educ. 2005;27(14):1695–1721. doi: 10.1080/09500690500206408. [DOI] [Google Scholar]
- 10.VanLehn K. Model construction as a learning activity: a design space and review. Interact. Learn. Environ. 2013;21(4):371–413. doi: 10.1080/10494820.2013.803125. [DOI] [Google Scholar]
- 11.Wilensky U, Resnick M. Thinking in levels: a dynamic systems approach to making sense of the world. J. Sci. Educ. Technol. 1999;8(1):3–19. doi: 10.1023/A:1009421303064. [DOI] [Google Scholar]


