Abstract
It has long been a dream to build computer systems that learn automatically by reading text. This dream is generally considered infeasible, but some surprising developments in the US over the past three years have led to the funding of several short-term investigations into whether and how much the best current practices in Natural Language Processing and Knowledge Representation and Reasoning, when combined, actually enable this dream. This paper very briefly describes one of these efforts, the Learning by Reading project at ISI, which has converted a high school textbook of Chemistry into very shallow logical form and is investigating which semantic features can plausibly be added to support the kinds of inference required for answering standard high school text questions.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Baker, C.F., Fillmore, C.J., Lowe, J.B.: The Berkeley FrameNet Project. In: Proceedings of the COLING/ACL conference, pp. 86–90 (1998)
Berland, M., Charniak, E.: Finding Parts in Very Large Corpora. In: Proceedings of the conference of the Association for Computational Linguistics (ACL 1999), pp. 57–64 (1999)
Chalupsky, H., Russ, T.A.: WhyNot: Debugging Failed Queries in Large Knowledge Bases. In: Proceedings of the Fourteenth Innovative Applications of Artificial Intelligence conference (IAAI 2002), pp. 870–877 (2002)
Charniak, E.: A Maximum-Entropy-Inspired Parser. In: Proc. of NAACL 2000 conference (2000)
Dorr, B.J., Habash, N.: Lexical Conceptual Structure Lexicons. In: Calzolari, et al. (eds.) Survey of Major Approaches Towards Bilingual/Multilingual Lexicons (2001) ISLE-IST-1999-10647-WP2-WP3
Fellbaum, C. (ed.): WordNet: An On-line Lexical Database and Some of its Applications. MIT Press, Cambridge (1998)
Fleischmann, M., Hovy, E.H., Echihabi, A.: Offline Strategies for Online Question Answering: Answering Questions before They are Asked. In: Proceedings of the conference of the Association for Computational Linguistics (ACL 2003), pp. 1–7 (2003)
Friedland, N., et al.: (2003), http://www.projecthalo.com/halotempl.asp?cid=30
Girju, R., Baldulescu, A., Moldovan, D.: Learning Semantic Constraints for the Automatic Discovery of Part-hole Relations. In: Proceedings of HLT-NAACL 2003 conference, pp. 80–87 (2003)
Hearst, M.: Automatic Acquisition of Hyponyms from Large Text Corpora. In: Proceedings of the COLING 1992 conference, pp. 539–545 (1992)
Hobbs, J.R.: Overview of the TACITUS Project. Computational Linguistics 12(3) (1986)
Hobbs, J.R.: Discourse and Inference (forthcoming)
Hovy, E.H., Hermjakob, U., Lin, C.-Y., Ravichandran, D.: Using Knowledge to Facilitate Pinpointing of Factoid Answers. In: Proceedings of the COLING-2002 conference (2002), Available at: http://www.isi.edu/natural-language/projects/webclopedia/Taxonomy/taxonomy_toplevel.html
Hovy, E.H., Lin, C.-Y., Zhou, L., Fukumoto, J.: Automated Summarization Evaluation with Basic Elements. In: Proceedings of the LREC conference (2006)
Lin, D.: Principar — An Efficient, Broad-Coverage, Principle-Based Parser. In: Proceedings of the COLING/ACL conference, pp. 42–48 (1994)
Lucene: Open Source software (2005), http://lucene.apache.org/java/docs/
Mahesh, K.: Ontology Development for Machine Translation: Ideology and Methodology. New Mexico State University CRL report MCCS-96-292 (1996)
Palmer, M., Gildea, D., Kingsbury, P.: The Proposition Bank: A Corpus Annotated with Semantic Roles. Computational Linguistics 31(1) (2005)
Pantel, P., Ravichandran, D.: Automatically Labeling Semantic Classes. In: Proceedings of the HLT-NAACL 2004 conference (2004)
Pantel, P.: Inducing Ontological Co-occurrence Vectors. In: Proceedings of the conference of the Association for Computational Linguistics (ACL 2005) (2005)
Pantel, P., Pennacchiotti: Espresso: Leveraging Generic Patterns for Automatically Harvesting Semantic Relations. In: Proceedings of the conference of the Association for Computational Linguistics (COLING/ACL 2006) (2006)
Pennacchiotti, M., Pantel, P.: A Bootstrapping Algorithm for Automatically Harvesting Semantic Relations. In: Proceedings of the ICOS 2006 conference (2006)
Philpot, A., Hovy, E.H., Pantel, P.: The Omega Ontology. In: Proceedings of the ONTOLEX Workshop at the International Joint Conference on NLP, Jeju Island, Korea (2005)
Ravichandran, D., Hovy, E.H.: Learning Surface Text Patterns for a Question Answering System. In: Proceedings of the conference of the Association for Computational Linguistics (ACL 2002), pp. 41–47 (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Hovy, E. (2006). Learning by Reading: An Experiment in Text Analysis. In: Sojka, P., Kopeček, I., Pala, K. (eds) Text, Speech and Dialogue. TSD 2006. Lecture Notes in Computer Science(), vol 4188. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11846406_1
Download citation
DOI: https://doi.org/10.1007/11846406_1
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-39090-9
Online ISBN: 978-3-540-39091-6
eBook Packages: Computer ScienceComputer Science (R0)