ABSTRACT
Web Directories are repositories of Web pages organized in a hierarchy of topics and sub-topics. In this paper, we present DirectoryRank, a ranking framework that orders the pages within a given topic according to how informative they are about the topic. Our method works in three steps: first, it processes Web pages within a topic in order to extract structures that are called lexical chains, which are then used for measuring how informative a page is for a particular topic. Then, it measures the relative semantic similarity of the pages within a topic. Finally, the two metrics are combined for ranking all the pages within a topic before presenting them to the users.
- Google Directory http://dir.google.com/.Google Scholar
- MultiWordNet Domains http://wndomains.itc.it/.Google Scholar
- Open Directory Project http://dmoz.com/.Google Scholar
- Sumo Ontology http://ontology.teknowledge.com/.Google Scholar
- WordNet 2.0 http://www.cogsci.princeton.edu/wn/.Google Scholar
- Barzilay R Lexical chains for text summarization. Master's Thesis, Ben-Gurion University, 1997.Google Scholar
- Bharat K and Mihaila G. Hilltop: a search engine based on expert documents: http://www.cs.toronto.edu/georgem/hilltop/.Google Scholar
- Kleinberg J. Authoritative sources in a hyperlinked environment. In Journal of the ACM, 46(5), 1999, 604--632. Google ScholarDigital Library
- Haveliwala T. Topic sensitive PageRank. In Proceedings of the 11th WWW Conference, 2002, 517--526. Google ScholarDigital Library
- Ntoulas A., Cho J. and Olston Ch. What's new on the web? The evolution of the web from a search engine perspective. In Proceedings of the 13th WWW Conference, 2004, 1--12. Google ScholarDigital Library
- Page L., Brin S., Motwani R. and Winograd T. The PageRank citation ranking: Bringing order to the web. Available at http://dbpubs.stanford.edu:8090/pub/1999-66.Google Scholar
- Song Y.I., Han K.S. and Rim H.C. A term weighting method based on lexical chain for automatic summarization. In Proceedings of the 5 th CICLing Conference, 2004, 636--639.Google ScholarCross Ref
- Stamou S., Krikos V., Kokosis P., Ntoulas A. and Christodoulakis D. Web directory construction using lexical chains. In Proceedings of the 10 th NLDB Conference 2005, 138--149. Google ScholarDigital Library
- Wang Y. and DeWitt D. Computing PageRank in a distributed internet search system. In Proc. of the 30th VLDB Conf., 2004. Google ScholarDigital Library
Index Terms
- DirectoryRank: ordering pages in web directories
Recommendations
HSWS: enhancing efficiency of web search engine via semantic web
MEDES '11: Proceedings of the International Conference on Management of Emergent Digital EcoSystemsWith the tremendous growth of information availability to users through the Web, search engines come to play ever a more critical role. However, search engines retrieve vast amount of information that is far larger than an individual capability of ...
Applying two-level reinforcement ranking in query-oriented multidocument summarization
Sentence ranking is the issue of most concern in document summarization today. While traditional feature-based approaches evaluate sentence significance and rank the sentences relying on the features that are particularly designed to characterize the ...
Diversifying search results of controversial queries
CIKM '11: Proceedings of the 20th ACM international conference on Information and knowledge managementDiversifying search results of queries seeking for different view points about controversial topics is key to improving satisfaction of users. The challenge for finding different opinions is how to maximize the number of discussed arguments without ...
Comments