ABSTRACT
We propose a method for searching for comprehensible how-to information on the Web. In our how-to information search, we use lightweight analysis of Web pages to extract how-to information from Web pages obtained by conventional Web search engines and rank them according to their easily-viewable-degree. In the extraction process, we focus on expressions in Web page text blocks that describe procedures. In the ranking process, we focus on images, the effect of letter string and the length of the how-to information.
- D. Cai, S. Yu, J.-R. Wen, and W.-Y. Ma. Extracting content structure for web pages based on visual representation. APWeb, pages 406--417, 2003.Google ScholarDigital Library
- T. Kokubo, S. Oyama, T. Yamada, Y. Kitamura, and T. Ishida. Keyword spice method for building domain-specific web search engines(in Japanese). IPSJ Journal, 43(6):1804--1813, 2002.Google Scholar
- S. Oyama, T. Kokubo, and T. Ishida. Domain-specific web search with keyword spices. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 16(1):17--27, 2004. Google ScholarDigital Library
- M. Takechi, T. Tokunoga, Y. Matsumoto, and H. Tanaka. Extracting lists of procedural expressions from web pages(in Japanese). IPSJ Journal, 44(5):1--13, 2003.Google Scholar
Index Terms
- How-to information search by lightweight analysis of web pages
Recommendations
Clustering Search Engine Suggests by Modeling Topics of Web Pages collected with Suggests
IMCOM '16: Proceedings of the 10th International Conference on Ubiquitous Information Management and CommunicationIn this paper, we address the issue of how to overview the knowledge of a given query keyword. We especially focus on concerns of those who search for Web pages with a given query keyword, and study how to efficiently overview the whole list of Web ...
Comments