Abstract
We present a new approach to content determination and discourse organization in Natural Language Generation (NLG). This approach relies on two decision-support oriented database technologies, OLAP and data mining, and it can be used for any NLG application involving the textual summarization of quantitative data. It improves on previous approaches to content planning for NLG in quantitative domains by providing: (1) application domain independence, (2) efficient, variable granularity insight search in high dimensionality data spaces, (3) automatic discovery of surprising, counter-intuitive data, and (4) tailoring of output text organization towards different, declaratively speci- fied, analytical perspectives on the input data.
HYpertext Summary System of On-line analytical Processing. On-Line Analytical Processing Multidimensional Analysis and Textual Reporting for Insight Knowledge Search.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Aït-Kaci, H. and Lincoln, P.: LIFE. A natural language for natural language. T.A. Informations, 30(1-2):37–67, Association pour le Traitement Automatique des Langues, Paris France (1989).
Bourbeau, L., Carcagno, D., Goldberg, E., Kittredge, R., Polguère, A.: Bilingual generation of weather forecast in an operational environment. COLING’90, Helsinki, (1990).
Carcagno, D., Iordanskaja, L.: Content determination and text structuring; two interrelated processes. In H. Horacek [ed.] New concepts in NLG: Planning, realization and systems. London: Pinter Publishers, pp 10–26, (1993).
Chen, Q.: Mining exceptions and quantitative association rules in OLAP data cube. M.Sc.Thesis. Department of CS, Simon Fraser University, B.C., Canada, July (1999).
Elhadad, M. and Robin, J.: An overview of SURGE: a re-usable comprehensive syntactic realization component. In Proceedings of the 8th International Workshop on Natural Language Generation (demonstration session). Brighton, UK (INLG’96). (1996)
Favero, E.L.: Generating hypertext summaries of data mining discoveries in multidimensional databases. Ph.D. thesis, CIn, UFPE, Recife, Brazil.(2000)
Favero, E.L. and Robin, J.: Um ambiente para desenvolvimento de gramáticas computacionais para o Português, Revista de Informática Teórica e Aplicada, Volume VI, Número 1, Julho, 1999 pp 49–76, Porto Alegre, (1999).
Han, J.: OLAP Mining: An integration of OLAP with Data Mining. In Proc. 1997 IFIP Conf. Data Semantics (DS-7), pp 1–11, Leysin, Switzerland, Oct. (1997).
Hovy, E.: Automated discourse generation using discourse structure relations. Artificial Intelligence, 63: 341–385, (1993).
Iordanskaja, L., Kim, M., Kittredge, R., Lavoie, B., Polguere, A.: Generation extended bilingual statistical reports. In Proc. of COLING 94, pp.1019–1023 (1994).
Kay, M. Functional grammar. In proceedings of the 5th Annual Meeting of the Berkeley Linguist Society, (1979).
Knott, A. Mellish, C. Oberlander, J. and O’Donnell, M.: Sources of flexibility in dynamic hypertext generation. In Proc. of the 8th International Workshop in NLG, Sussex, UK, (1996).
Kukich, K.: Knowledge-based Report Generation: A knowledge-engineering approach to NL Report Generation; Department of Information Science, University of Pittsburgh, Ph.D. thesis, (1983).
Mann, W C Thompson, S A.: Rhetorical structure theory: Toward a functional theory of text organization. Text, 8(3):243–281, (1988).
McKeown, K.: Text generation. Cambridge University Press, Cambridge, (1985).
McKeown, K., Kukich, K., Shaw, J.: Practical issues in automatic document generation. In Proc. of ANLP.94, pages 7–14, Stuttgart, October (1994).
Milosavljevic, M. Tulloch, A. and Dale, R. Text generation in a dynamic hypertext environment. In Proceeding of the 199th Australian Computer Science Conference, 417–426, Melbourne, Austria, (1996).
Passonneau, B., Kukich, K., Robin, J., Hatzivassiloglou, V., Lefkowitz, L. Jin, H.: Generating Summaries of Work Flow Diagrams. In Proc. of the Intern. Conference on NLP and Industrial Applications. Moncton, New Brunswick, Canada (NLP+IA’96). 7p. (1996).
Reiter, E., Dale, R.: Building applied natural language generation system. ANLPC Washington DC. (1997).
Robin, J. Favero, E.L.: Content aggregation in natural language hypertext summarization of OLAP and Data Mining discoveries. In Proc. of INLG.2000 Conference (Intern. Natural Language Generation), Mitzpe Ramon, Israel, June (2000).
Sarawagi, S., Agrawal, R., Megiddo, N.: Discovery-driven exploration of MDDB data cubes. In Proc. Int. Conf. of Extending Database Technology EDBT.98, March, (1998).
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2001 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Favero, E.L., Robin, J. (2001). Using OLAP and Data Mining for Content Planning in Natural Language Generation. In: Bouzeghoub, M., Kedad, Z., Métais, E. (eds) Natural Language Processing and Information Systems. NLDB 2000. Lecture Notes in Computer Science, vol 1959. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45399-7_14
Download citation
DOI: https://doi.org/10.1007/3-540-45399-7_14
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-41943-3
Online ISBN: 978-3-540-45399-4
eBook Packages: Springer Book Archive