skip to main content
10.1145/1097047.1097052acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
Article

DirectoryRank: ordering pages in web directories

Published:04 November 2005Publication History

ABSTRACT

Web Directories are repositories of Web pages organized in a hierarchy of topics and sub-topics. In this paper, we present DirectoryRank, a ranking framework that orders the pages within a given topic according to how informative they are about the topic. Our method works in three steps: first, it processes Web pages within a topic in order to extract structures that are called lexical chains, which are then used for measuring how informative a page is for a particular topic. Then, it measures the relative semantic similarity of the pages within a topic. Finally, the two metrics are combined for ranking all the pages within a topic before presenting them to the users.

References

  1. Google Directory http://dir.google.com/.Google ScholarGoogle Scholar
  2. MultiWordNet Domains http://wndomains.itc.it/.Google ScholarGoogle Scholar
  3. Open Directory Project http://dmoz.com/.Google ScholarGoogle Scholar
  4. Sumo Ontology http://ontology.teknowledge.com/.Google ScholarGoogle Scholar
  5. WordNet 2.0 http://www.cogsci.princeton.edu/wn/.Google ScholarGoogle Scholar
  6. Barzilay R Lexical chains for text summarization. Master's Thesis, Ben-Gurion University, 1997.Google ScholarGoogle Scholar
  7. Bharat K and Mihaila G. Hilltop: a search engine based on expert documents: http://www.cs.toronto.edu/georgem/hilltop/.Google ScholarGoogle Scholar
  8. Kleinberg J. Authoritative sources in a hyperlinked environment. In Journal of the ACM, 46(5), 1999, 604--632. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Haveliwala T. Topic sensitive PageRank. In Proceedings of the 11th WWW Conference, 2002, 517--526. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Ntoulas A., Cho J. and Olston Ch. What's new on the web? The evolution of the web from a search engine perspective. In Proceedings of the 13th WWW Conference, 2004, 1--12. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Page L., Brin S., Motwani R. and Winograd T. The PageRank citation ranking: Bringing order to the web. Available at http://dbpubs.stanford.edu:8090/pub/1999-66.Google ScholarGoogle Scholar
  12. Song Y.I., Han K.S. and Rim H.C. A term weighting method based on lexical chain for automatic summarization. In Proceedings of the 5 th CICLing Conference, 2004, 636--639.Google ScholarGoogle ScholarCross RefCross Ref
  13. Stamou S., Krikos V., Kokosis P., Ntoulas A. and Christodoulakis D. Web directory construction using lexical chains. In Proceedings of the 10 th NLDB Conference 2005, 138--149. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Wang Y. and DeWitt D. Computing PageRank in a distributed internet search system. In Proc. of the 30th VLDB Conf., 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. DirectoryRank: ordering pages in web directories

          Recommendations

          Comments

          Login options

          Check if you have access through your login credentials or your institution to get full access on this article.

          Sign in
          • Published in

            cover image ACM Conferences
            WIDM '05: Proceedings of the 7th annual ACM international workshop on Web information and data management
            November 2005
            96 pages
            ISBN:1595931945
            DOI:10.1145/1097047

            Copyright © 2005 ACM

            Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

            Publisher

            Association for Computing Machinery

            New York, NY, United States

            Publication History

            • Published: 4 November 2005

            Permissions

            Request permissions about this article.

            Request Permissions

            Check for updates

            Qualifiers

            • Article

            Upcoming Conference

          PDF Format

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader