Abstract
We introduce a new information system for organization of a Digital Library of news articles found on the Web, with automatic topic classification. We present our strategies to deal with different update frequencies of news Web sites, the classification methodology, the data model for storing news articles, measurements on the data retrieved and finally results of classification of this type of information.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Bowman, C., Danzig, P., Hardy, D., Manber, U. and Schwartz, M.: The Harvest Information Discovery and Access System. Proceedings of the Second International WWW Conference. pp.763–771, 1994.
Dumais, S., Platt, J., Heckerman, D. and Sahami, M.: Inductive Learning Algorithms and Representations for Text Categorization. Proceedings of the Seventh International Conference on Information and Knowledge Management, 1998.
Joachims, T.: Making large-Scale SVM Learning Practical. Advances in Kernel Methods-Support Vector Learning, B. Schölkopf and C. Burges and A. Smola (ed.), MIT-Press, 1999.
Maria, N., Gaspar, P., Grilo, N., Ferreira, A. and Silva M. J.: ARIADNE-Digital Library Architecture. Proceedings of the 2nd European Conference on digital Libraries (ECDL’98), pages 667–668, 1998.
Maria, N. and Silva, M. J.: Theme-based Retrieval of Web News. Proceedings of the Third International Workshop on the Web and Databases (WebDB’2000). To be published as Springer LNCS.
Wiederhold G.: Mediators in the Architecture of Future Information Systems. IEEE Computer, pages 38–49, March 1992.
Yang, Y. and Liu X.. A re-examination of text categorization methods. Proceedings of the 22th Ann Int ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR’99), pages 42–49, 1999.
Yang, Y., Carbonell, J., Brown, R., Pierce, T., Archibald B. T. and Liu X.. Learning approaches for Detecting and Tracking News Events. IEEE Intelligent Systems: Special Issue on Applications of Intelligent Information Retrieval, 14(4), July/August 1999.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2000 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Maria, N., Silva, M.J. (2000). Building a Digital Library of Web News. In: Borbinha, J., Baker, T. (eds) Research and Advanced Technology for Digital Libraries. ECDL 2000. Lecture Notes in Computer Science, vol 1923. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45268-0_36
Download citation
DOI: https://doi.org/10.1007/3-540-45268-0_36
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-41023-2
Online ISBN: 978-3-540-45268-3
eBook Packages: Springer Book Archive