skip to main content
10.1145/967900.968241acmconferencesArticle/Chapter ViewAbstractPublication PagessacConference Proceedingsconference-collections
Article

A learning-based approach for fetching pages in WebVigiL

Published:14 March 2004Publication History

ABSTRACT

The World Wide Web is an omni-present and an ever-expanding source of data. Data on the web is constantly increasing and changing. Many a times, users are interested in specific changes to the data on the web. Currently, in order to detect changes of interest, users have to poll the pages periodically and check for the changes of interest. WebVigiL is a general-purpose information monitoring and notification system. It handles the specification, intelligent fetch, detection, and propagation of changes as requested by a user while meeting the quality of service requirements. We use the active capability in the form of event-condition-action (ECA) rules, and a combination of push/pull paradigm for change monitoring. In this paper, we present an overview of the specification language and the run time management of sentinels. We discuss in detail the use of ECA rules for fetching and the adaptive learning algorithm used for fetching pages. We conclude with the implementation status of WebVigiL.

References

  1. Jacob, J, A. Sachde, and S. Chakravarthy, CX-Diff: A Change Detection Algorithm for XML Content and Change Presentation Issues in WebVigiL, in the Proc. of XSDM Workshop, Chicago, 2003, pp. 273--284.Google ScholarGoogle Scholar
  2. Jacob, J., et al., WebVigiL: An approach to Just-In-Time Information Propagation In Large Network-Centric Environments, in Web Dynamics Book. 2003, Springer-Verlag, 2004.Google ScholarGoogle Scholar
  3. Changedetection, http://www.changedetection.com.Google ScholarGoogle Scholar
  4. Douglis, F., et al., The AT&T Internet Difference Engine: Tracking and Viewing Changes on the Web, in World Wide Web. 1998, Baltzer Science Publishers. p. 27--44. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Bhowmick, S., et al. Detecting and Representing Relevant Web Deltas Using Web Join. in the Proc of ICDCS, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Mind-it, http://www.netmind.com/.Google ScholarGoogle Scholar
  7. Liu, L., et al. Information Monitoring on the Web: A Scalable Solution. in the Int'l Journal of World Wide Web, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Xyleme, http://www.xyleme.com/.Google ScholarGoogle Scholar
  9. Nguyen, B., et al. Monitoring XML Data on the Web. in the Proc. of ACM SIGMOD, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Chakravarthy, S. and D. Mishra, Snoop: An Expressive Event Specification Language for Active Databases. Data and Knowledge Engineering, 1994. 14(10): p. 1--26. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Anwar, E., L. Maugis, and S. Chakravarthy, A New Perspective on Rule Support for Object-Oriented Databases, in the Proc. of ACM SIGMOD, 1993, p. 99--108. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Pandrangi, N., et al. WebVigiL: User Profile-Based Change Detection for HTML/XML Documents, in the Proc. of BNCOD, Coventry, UK, 2003, pp. 38--55. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Deolasee, P, et al. Adaptive Push-Pull: Disseminating Dynamic Web Data. in the Proc. of WWW Conference. 2001. Hong Kong. Google ScholarGoogle ScholarDigital LibraryDigital Library

Recommendations

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Sign in
  • Published in

    cover image ACM Conferences
    SAC '04: Proceedings of the 2004 ACM symposium on Applied computing
    March 2004
    1733 pages
    ISBN:1581138121
    DOI:10.1145/967900

    Copyright © 2004 ACM

    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    • Published: 14 March 2004

    Permissions

    Request permissions about this article.

    Request Permissions

    Check for updates

    Qualifiers

    • Article

    Acceptance Rates

    Overall Acceptance Rate1,650of6,669submissions,25%

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader