skip to main content
10.1145/2184751.2184761acmconferencesArticle/Chapter ViewAbstractPublication PagesicuimcConference Proceedingsconference-collections
research-article

REV: extracting entity relations from world wide web

Authors Info & Claims
Published:20 February 2012Publication History

ABSTRACT

Quantities of valuable relation knowledge are contained in textual documents on the World Wide Web. However, those data are always organized in semi-structured text and cannot be used directly. We develop an automatic and effective approach to extract relations from World Wide Web, which just requires a few user specified seed instances as input. Those instances are used to generate extraction rules that in turn result in new instances. And in order to improve the reliability of results, an effective method is proposed to assess new extracted instances. This paper introduces the approach in details and the experimental results show that the approach achieves an average precision of 98.67% and can preferably complete the relation extraction task.

References

  1. Brin, S. 1998. "Extracting patterns and relations from the World Wide Web". In Proceedings of the 1998 International Work-shop on the Web and Databases (WebDB'98), March 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Freitag, D. and McCallum, A. 1999. "Information extraction with HMMs and shrinkage". In Proceedings of the AAAI-99 Workshop on Machine Learning for Information Extraction, 1999.Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Skounakis, M., Craven, M. and Ray, S. 2003. "Hierarchical hidden markov models for information extraction". In Proceedings of the Eighteenth International Joint Conference on Artificial Intelligence, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Soderland, S. 1999. "Learning information extraction rules for semi-structured and free text". Machine Learning, 34(1-3):233--272, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. McCallum, A. 2003. "Efficiently inducing features or conditional random fields". In Proceedings of the Nineteenth Conference on Uncertainty in Artificial Intelligence, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Che, W. X., Liu, T., Li, S. 2005. "Automatic Entity Relation Extraction". Journal of Chinese Information Processing, 2005, 19(2):1--6.Google ScholarGoogle Scholar
  7. Wang, Y., Xu, D. Z. and Chen, J. E. 2009. "Entity Relation Extraction for Complex Chinese Text". Computer Science, Vol.36, No.8, 2009.Google ScholarGoogle Scholar
  8. Jiang, J. F. and Wang, S. X. 2005. "A Bootstrapping Method for Acquisition of Bi-relations and Bi-relations Patterns". Journal of Chinese Information Processing, 2005, 19(2): 71--77.Google ScholarGoogle Scholar
  9. Agichtein, E. and Gravano, L. 2000. "Snowball: Extracting Relations from Large Plain-Text Collections". Proceedings of the 5th ACM International Conference on Digital Libraries, June 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Baidu, http://www.baidu.com.Google ScholarGoogle Scholar
  11. ICTCLAS, http://ictclas.org.Google ScholarGoogle Scholar
  12. Li, W. G., Liu, T. and Li, S. 2007. "Automated Entity Relation Tuple Extracting Using Web Mining". Acta Electronica Sinica, Vol. 35. No. 11, 2007.Google ScholarGoogle Scholar

Index Terms

  1. REV: extracting entity relations from world wide web

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      ICUIMC '12: Proceedings of the 6th International Conference on Ubiquitous Information Management and Communication
      February 2012
      852 pages
      ISBN:9781450311724
      DOI:10.1145/2184751

      Copyright © 2012 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 20 February 2012

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      Overall Acceptance Rate251of941submissions,27%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader