skip to main content
10.1145/1667780.1667853acmotherconferencesArticle/Chapter ViewAbstractPublication PagesiucsConference Proceedingsconference-collections
research-article

How-to information search by lightweight analysis of web pages

Published:03 December 2009Publication History

ABSTRACT

We propose a method for searching for comprehensible how-to information on the Web. In our how-to information search, we use lightweight analysis of Web pages to extract how-to information from Web pages obtained by conventional Web search engines and rank them according to their easily-viewable-degree. In the extraction process, we focus on expressions in Web page text blocks that describe procedures. In the ranking process, we focus on images, the effect of letter string and the length of the how-to information.

References

  1. D. Cai, S. Yu, J.-R. Wen, and W.-Y. Ma. Extracting content structure for web pages based on visual representation. APWeb, pages 406--417, 2003.Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. T. Kokubo, S. Oyama, T. Yamada, Y. Kitamura, and T. Ishida. Keyword spice method for building domain-specific web search engines(in Japanese). IPSJ Journal, 43(6):1804--1813, 2002.Google ScholarGoogle Scholar
  3. S. Oyama, T. Kokubo, and T. Ishida. Domain-specific web search with keyword spices. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 16(1):17--27, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. M. Takechi, T. Tokunoga, Y. Matsumoto, and H. Tanaka. Extracting lists of procedural expressions from web pages(in Japanese). IPSJ Journal, 44(5):1--13, 2003.Google ScholarGoogle Scholar

Index Terms

  1. How-to information search by lightweight analysis of web pages

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Other conferences
        IUCS '09: Proceedings of the 3rd International Universal Communication Symposium
        December 2009
        404 pages
        ISBN:9781605586410
        DOI:10.1145/1667780
        • General Chair:
        • Kazumasa Enami

        Copyright © 2009 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 3 December 2009

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader