skip to main content
10.1145/3546118.3546149acmotherconferencesArticle/Chapter ViewAbstractPublication PagescompsystechConference Proceedingsconference-collections
research-article

Ontology-driven detection of redundancy in short texts and its visualization

Authors Info & Claims
Published:14 August 2022Publication History

ABSTRACT

Nowadays, a lot of content is being moved to digital form, which allows its better processing and analysis. However, in some cases, loose redundancy or duplicity can be a problem. In this paper, redundancy even duplicity of short texts based on ontology-related approaches and their visualization is presented. The paper presents several attempts even forms of visualization used to display information about possible redundancy and duplicity of content plus specially in the domain of educational content engineering.

References

  1. Muhammad Ahasanuzzaman, Muhammad Asaduzzaman, Chanchal K. Roy, and Kevin A. Schneider. 2016. Mining Duplicate Questions in Stack Overflow. In Proceedings of the 13th International Conference on Mining Software Repositories (Austin, Texas) (MSR ’16). Association for Computing Machinery, New York, NY, USA, 402–412. https://doi.org/10.1145/2901739.2901770Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Marek Dudáš, Steffen Lohmann, Vojtěch Svátek, and Dmitry Pavlov. 2018. Ontology visualization methods and tools: a survey of the state of the art. The Knowledge Engineering Review 33 (07 2018). https://doi.org/10.1017/S0269888918000073Google ScholarGoogle Scholar
  3. Mathias Ellmann. 2019. Same-Same But Different: On Understanding Duplicates in Stack Overflow. Informatik Spektrum 42 (08 2019). https://doi.org/10.1007/s00287-019-01185-yGoogle ScholarGoogle Scholar
  4. Jun Fang, Lei Guo, and Yue Niu. 2010. Documents classification by using ontology reasoning and similarity measure. In 2010 Seventh International Conference on Fuzzy Systems and Knowledge Discovery, Vol. 4. 1535–1539. https://doi.org/10.1109/FSKD.2010.5569338Google ScholarGoogle ScholarCross RefCross Ref
  5. Wael Gomaa and Aly Fahmy. 2013. A Survey of Text Similarity Approaches. international journal of Computer Applications 68 (04 2013). https://doi.org/10.5120/11638-7118Google ScholarGoogle Scholar
  6. Vasyl Lytvyn, Victoria Vysotska, Oleh Veres, Ihor Rishnyak, and Halya Rishnyak. 2017. Classification Methods of Text Documents Using Ontology Based Approach. Vol. 512. 229–240. https://doi.org/10.1007/978-3-319-45991-2_15Google ScholarGoogle Scholar
  7. Vijaymeena M K and Kavitha K. 2016. A Survey on Similarity Measures in Text Mining. Machine Learning and Applications: An International Journal 3 (03 2016), 19–28. https://doi.org/10.5121/mlaij.2016.3103Google ScholarGoogle Scholar
  8. Didik Prasetya, Aji Wibawa, and Tsukasa Hirashima. 2018. The performance of text similarity algorithms. International Journal of Advances in Intelligent Informatics 4 (05 2018). https://doi.org/10.26555/ijain.v4i1.152Google ScholarGoogle ScholarCross RefCross Ref
  9. Aijazahamed Qazi and R.H. Goudar. 2018. An Ontology-based Term Weighting Technique for Web Document Categorization. Procedia Computer Science 133 (01 2018), 75–81. https://doi.org/10.1016/j.procs.2018.07.010Google ScholarGoogle Scholar
  10. Arash Saghafi. 2016. Visualizing Ontologies – A Literature Survey. 204–221. https://doi.org/10.1007/978-3-319-40985-6_16Google ScholarGoogle Scholar
  11. Rodrigo F. G. Silva, Klérisson Paixão, and Marcelo de Almeida Maia. 2018. Duplicate question detection in stack overflow: A reproducibility study. In 2018 IEEE 25th International Conference on Software Analysis, Evolution and Reengineering (SANER). 572–581. https://doi.org/10.1109/SANER.2018.8330262Google ScholarGoogle ScholarCross RefCross Ref
  12. Minghe Yu, Guoliang Li, Dong Deng, and Jianhua Feng. 2015. String similarity search and join: a survey. Frontiers of Computer Science 10, 3 (Nov. 2015), 399–417. https://doi.org/10.1007/s11704-015-5900-5Google ScholarGoogle Scholar
  13. Yun Zhang, David Lo, Xin Xia, and Jianling Sun. 2015. Multi-Factor Duplicate Question Detection in Stack Overflow. Journal of Computer Science and Technology 30 (09 2015), 981–997. https://doi.org/10.1007/s11390-015-1576-4Google ScholarGoogle Scholar

Index Terms

  1. Ontology-driven detection of redundancy in short texts and its visualization

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Other conferences
      CompSysTech '22: Proceedings of the 23rd International Conference on Computer Systems and Technologies
      June 2022
      188 pages
      ISBN:9781450396448
      DOI:10.1145/3546118

      Copyright © 2022 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 14 August 2022

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article
      • Research
      • Refereed limited

      Acceptance Rates

      Overall Acceptance Rate241of492submissions,49%
    • Article Metrics

      • Downloads (Last 12 months)10
      • Downloads (Last 6 weeks)0

      Other Metrics

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    HTML Format

    View this article in HTML Format .

    View HTML Format