skip to main content
10.1145/3428757.3429136acmotherconferencesArticle/Chapter ViewAbstractPublication PagesiiwasConference Proceedingsconference-collections
research-article

A Patten Matcher for English Idioms on Web IndeX

Authors Info & Claims
Published:27 January 2021Publication History

ABSTRACT

Web Index (WIX in short) is a system that achieves joining information resources on the Web. WIX replaces keywords in Web documents hyperlinks to other web pages based on a WIX file that a user chose. WIX file is a kind of a dictionary that have a set of WIX entries (keyword and target URL). Using WIX, users can join any Web contents and arbitrary dictionaries. In conventional WIX, matching and linking are executed only for fixed character strings between the keyword set and the input text. However, when a user wants to search for phrases like idioms, this matching system is not sufficient because of the declension of words, change of the verb tense, and so on. Therefore, we propose a phrasal pattern matching mechanism on WIX. This helps users easily find idiom expressions in the text on the web and get more information.

References

  1. Dimitra Anastasiou. 2010. Idiom treatment experiments in machine translation. Cambridge Scholars Publishing.Google ScholarGoogle Scholar
  2. Masahiro Hayashi and Motomichi Toyama. 2011. Keio WIX System (1) User Interface. In DEIM'11 The 3rd Forum on Data Engineering and Information Management.Google ScholarGoogle Scholar
  3. F Ishizaki and M Toyama. 2012. An incremental update algorithm for large Aho-Corasick automaton. In Proceedings of the 4th Forum on Data Engineering and Information Management, F11-5. 1--6.Google ScholarGoogle Scholar
  4. Mitchell Marcus, Beatrice Santorini, and Mary Ann Marcinkiewicz. 1993. Building a large annotated corpus of English: The Penn Treebank. (1993).Google ScholarGoogle Scholar
  5. Pablo N Mendes, Max Jakob, Andrés García-Silva, and Christian Bizer. 2011. DBpedia spotlight: shedding light on the web of documents. In Proceedings of the 7th international conference on semantic systems. 1--8. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Rada Mihalcea and Andras Csomai. 2007. Wikify!: Linking Documents to Encyclopedic Knowledge. In Proceedings of the Sixteenth ACM Conference on Conference on Information and Knowledge Management, CIKM '07. 233--242. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. David Milne and Ian H Witten. 2008. Learning to link with wikipedia. In Proceedings of the 17th ACM conference on Information and knowledge management. 509--518. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Ryosuke Mori and Motomichi Toyama. 2011. Keio WIX System (2) Server Side Implementation. In DEIM'11 The 3rd Forum on Data Engineering and Information Management.Google ScholarGoogle Scholar
  9. W. Shen, J. Han, J. Wang, X. Yuan, and Z. Yang. 2018. SHINE+: A General Framework for Domain-Specific Entity Linking with Heterogeneous Information Networks. IEEE Transactions on Knowledge and Data Engineering 30, 2 (2018), 353--366.Google ScholarGoogle Scholar
  10. spacy.io. [n.d.]. spaCy · Industrial-strength Natural Language Processing in Python. https://spacy.io/Google ScholarGoogle Scholar
  11. Ikuya Yamada, Tomotaka Ito, Shinnosuke Usami, Shinsuke Takagi, Tomoya Toyoda, Hideaki Takeda, and Yoshiyasu Takefuji. 2014. Linkify: Enhanced reading experience by augmenting text using Linked Open Data. ISWC 2014 Semantic Web Challenge (2014).Google ScholarGoogle Scholar

Index Terms

  1. A Patten Matcher for English Idioms on Web IndeX

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Other conferences
      iiWAS '20: Proceedings of the 22nd International Conference on Information Integration and Web-based Applications & Services
      November 2020
      492 pages

      Copyright © 2020 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 27 January 2021

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article
      • Research
      • Refereed limited
    • Article Metrics

      • Downloads (Last 12 months)0
      • Downloads (Last 6 weeks)0

      Other Metrics

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader