Skip to main content
Elsevier - PMC COVID-19 Collection logoLink to Elsevier - PMC COVID-19 Collection
. 2020 Oct 2;176:2069–2076. doi: 10.1016/j.procs.2020.09.243

Named Entities and Their Role in Creating Context Information

Kurt Englmeier 1
PMCID: PMC7531913  PMID: 33042306

Abstract

Context encompasses the classification of a certain environment by its key attributes that take the role of semantic markers. It is an abstract representation of a certain data environment. In texts, the context classifies and represents a piece of text in a generalized form. Context can be a recursive construct when summarizing information on a more coarse-grained level. This paper presents identification and standardization of context on different levels of granularity that finally supports faster and more precise information retrieval. The prototypical system presented here applies supervised learning for a semi-automatic approach to extract, distil, and standardize data from text. The approach is based on named-entity recognition and simple ontologies for identification and disambiguation of context. Even though the prototype shown here still represents work in progress, it already demonstrates its potential for mining texts on different levels of context granularity. The paper presents the design of the Contexter system that supports identification and classification of misinformation and fake news around the topic Covid-19.

Keywords: Information Extraction, Named-Entity Recognition, Bag of Words, Information Summarization, Entity Linking, Semantic Markers, Context-Awareness, Economic Information

References

  1. Jacobs S., Wächter K., Coates L. "First Major German City Under Curfew, will Berlin follow soon?". Der Tagesspiegel. 2020 https://www.tagesspiegel.de/berlin/coronavirus-outbreak-first-major-german-city-under-curfew-will-berlin-follow-soon/25664744.html, March 20, 2020. Retrieved at, on May 11, 2020. [Google Scholar]
  2. 2.Rubin, V. L., Chen, Y., Conroy, N. J., (2015) “Deception Detection for News: Three Types of Fakes”. Proceedings of the 78th ASIS&T Annual Meeting: Information Science with Impact: Research in and for the Community: 1-4.
  3. 3.Pérez-Rosas, V., Kleinberg, B., Lefevre, A., Mihalcea, R. (2019) “Automatic Detection of Fake News”. Proceedings of the 27th International Conference on Computational Linguistics, 27: 3391-3401.
  4. 4.Juola, P. (2012) “Detecting stylistic deception”. Proceedings of the Workshop on Computational Approaches to Deception Detection: 91-96.
  5. 5.Levitan, S.I., An, G., Wang, M., Mendels, G., Hirschberg, J., Levine, M., Rosenberg, A. (2015) “Cross-Cultural Production and Detection of Deception from Speech”. Proceedings of the 2015 ACM on Workshop on Multimodal Deception Detection: 1-8
  6. Gupta M., Han J. "Heterogeneous Network-Based Trust Analysis: A Survey". ACM SIGKDD Explorations Newsletter. 2011;13(1):54–71. [Google Scholar]
  7. 7.Zhou, X., Zafarani, R. (2019) “Fake News Detection: An Interdisciplinary Research”. Companion Proceedings of the 2019 World Wide Web Conference: 1292.
  8. 8.Zafarani, R., Zhou, X., Shu, K., Liu, H. (2019) “Fake News Research:Theories, Detection Strategies, and Open Problems”. Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining: 3207-3208.
  9. Cha M., Gao W., Li C.-T. "Detecting Fake News in Social Media: An Asia-Pacific Perspective". Communications of the ACM. 2020;63(4):68–71. [Google Scholar]
  10. 10.Katsaros, D., Stavropoulos, G., Papakostas, D. (2019) “Which machine learning paradigm for fake news detection?”. IEEE/WIC/ACM International Conference on Web Intelligence: 383-387.
  11. Shu K., Sliva A., Wang S., Tang J., Liu H. "Fake News Detection on Social Media:A Data Mining Perspective". ACM SIGKDD Explorations Newsletter. 2017;19(1):22–36. [Google Scholar]
  12. 12.Wynne, H.E., Wint, Z.Z. (2019) “Content Based Fake News Detection Using N-Gram Models”. Proceedings of the 21st International Conference on Information Integration and Web-based Applications & Services (iiWAS2019): 669-673.
  13. Woods W.A. "Context-Sensitive Parsing". Communications of the ACM. 1970;13(7):413–445. [Google Scholar]
  14. Cowie J., Lehnert W. "Information Extraction". Communications of the ACM. 1996;39(1):80–91. [Google Scholar]
  15. Salton G., Allan J., Buckely Ch., Singhal A. Karen Sparck Jones and Peter Willett. Readings in Information Retrieval; San Francisco: 1997. "Automatic Analysis, Theme Generation, and Summarization of Machine-Readable Texts"; pp. 478–483. [Google Scholar]
  16. 16.Jancsary, J., Neubarth, F., Schreitter, S., Trost, H. (2011) “Towards a context-sensitive online newspaper”. Proceedings of the 2011 Workshop on Context-awareness in Retrieval and Recommendation: 2-9.
  17. 17.Calvo Martinez, J. (2018) “Event Mining over Distributed Text Streams”. Proceedings of the Eleventh ACM International Conference on Web Search and Data Mining: 745-746.
  18. Schneider P. "Language usage and social action in the psychoanalytic encounter: discourse analysis of a therapy session fragment". Language and Psychoanalysis. 2013;2(1):4–19. [Google Scholar]
  19. Murtagh F. "Mathematical Representations of Matte Blanco’s Bi-Logic, based on Metric Space and Ultrametric or Hierarchical Topology: Towards Practical Application.". Language and Psychanalysis. 2014;3(2):40–63. [Google Scholar]
  20. 20.Sha, F., Pereira, F. (2003) “Shallow Parsing with Conditional Random Fields”. Proceedings of the HLT-NAACL conference: 134-141.
  21. 21.Freitag, D., McCallum, A. (2000) “Information Extraction with HMM Structures Learned by Stochastic Optimization”. Proceedings of the Seventeenth National Conference on Artificial Intelligence and Twelfth Conference on Innovative Applications of Artificial Intelligence: 584-589.
  22. 22.Dang, V., Croft, B.W. (2010) “Query Reformulation Using Anchor Text”. Proceedings of the Third ACM International Conference on Web Search and Data Mining: 41-50.

Articles from Procedia Computer Science are provided here courtesy of Elsevier

RESOURCES