Abstract
There is a wealth of information to be mined from the World Wide Web. Unfortunately, standard natural language processing (NLP) extraction techniques perform poorly on the choppy, semi-structured information fragments, such as sports results, which are popular to be published on the Web pages nowadays.In this paper,we present an information agent: SportsFinder, an agent to ext act sports scores from the World Wide Web, as well as the knowledge discovering method to learn new express patterns to improve the agent’s performance.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
O. Etzioni, D. Weld, and R. Doorenbos. A Scalable Comparison-Shopping Agent for the World-Wide Web. In Proceedings of the First International Conference on Autonomous Agents, Feb. 1997.
D. Freitag. Toward Genaral-Purpose Learning for Information Extraction. In Proceedings of COLING/ACL, Jan. 1998.
D. Freitag. Information Extraction from HTML: Application of a General Machine Learning Approach. AAAI-98.
K. Hammond, R. Burke, C. Martin, and S. Lytinen. FAQ Finder: A Case-Based Approach to Knowledge Navigation. In Working Notes of the AAAI Symposium on Information Gathering from Heterogeneous Distributed Environment, 69–73, 1995
S. Huffman. Learning Information Extraction Patterns from Examples. Connectionist, Statistical, and Symbolic Approaches to Learning for Natural Language Processing, Springer, 246–260, 1996.
N. Kushmerick, D. Weld, and R. Doorenbos. Wrapper Induction for Information Extraction. IJCAI-97.
M. Perkowitz, O. Etzioni. Category Translation: Learning to Understand Information on the Internet. IJCAI-95.
S. Soderland. Learning Information Extraction Rules fo Semi-structured and Free Text. Technical Report, Department of Computer Science and Engineering, University of Washington, Dec. 1997.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Copyright information
© 1999 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Lu, H., Sterling, L., Wyatt, A. (1999). Knowledge Discovery in SportsFinder: An Agent to Extract Sports Results from the Web. In: Zhong, N., Zhou, L. (eds) Methodologies for Knowledge Discovery and Data Mining. PAKDD 1999. Lecture Notes in Computer Science(), vol 1574. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-48912-6_62
Download citation
DOI: https://doi.org/10.1007/3-540-48912-6_62
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-65866-5
Online ISBN: 978-3-540-48912-2
eBook Packages: Springer Book Archive