Skip to main content

Using OLAP and Data Mining for Content Planning in Natural Language Generation

  • Conference paper
  • First Online:
Book cover Natural Language Processing and Information Systems (NLDB 2000)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1959))

  • 4466 Accesses

Abstract

We present a new approach to content determination and discourse organization in Natural Language Generation (NLG). This approach relies on two decision-support oriented database technologies, OLAP and data mining, and it can be used for any NLG application involving the textual summarization of quantitative data. It improves on previous approaches to content planning for NLG in quantitative domains by providing: (1) application domain independence, (2) efficient, variable granularity insight search in high dimensionality data spaces, (3) automatic discovery of surprising, counter-intuitive data, and (4) tailoring of output text organization towards different, declaratively speci- fied, analytical perspectives on the input data.

HYpertext Summary System of On-line analytical Processing. On-Line Analytical Processing Multidimensional Analysis and Textual Reporting for Insight Knowledge Search.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Aït-Kaci, H. and Lincoln, P.: LIFE. A natural language for natural language. T.A. Informations, 30(1-2):37–67, Association pour le Traitement Automatique des Langues, Paris France (1989).

    Google Scholar 

  2. Bourbeau, L., Carcagno, D., Goldberg, E., Kittredge, R., Polguère, A.: Bilingual generation of weather forecast in an operational environment. COLING’90, Helsinki, (1990).

    Google Scholar 

  3. Carcagno, D., Iordanskaja, L.: Content determination and text structuring; two interrelated processes. In H. Horacek [ed.] New concepts in NLG: Planning, realization and systems. London: Pinter Publishers, pp 10–26, (1993).

    Google Scholar 

  4. Chen, Q.: Mining exceptions and quantitative association rules in OLAP data cube. M.Sc.Thesis. Department of CS, Simon Fraser University, B.C., Canada, July (1999).

    Google Scholar 

  5. Elhadad, M. and Robin, J.: An overview of SURGE: a re-usable comprehensive syntactic realization component. In Proceedings of the 8th International Workshop on Natural Language Generation (demonstration session). Brighton, UK (INLG’96). (1996)

    Google Scholar 

  6. Favero, E.L.: Generating hypertext summaries of data mining discoveries in multidimensional databases. Ph.D. thesis, CIn, UFPE, Recife, Brazil.(2000)

    Google Scholar 

  7. Favero, E.L. and Robin, J.: Um ambiente para desenvolvimento de gramáticas computacionais para o Português, Revista de Informática Teórica e Aplicada, Volume VI, Número 1, Julho, 1999 pp 49–76, Porto Alegre, (1999).

    Google Scholar 

  8. Han, J.: OLAP Mining: An integration of OLAP with Data Mining. In Proc. 1997 IFIP Conf. Data Semantics (DS-7), pp 1–11, Leysin, Switzerland, Oct. (1997).

    Google Scholar 

  9. Hovy, E.: Automated discourse generation using discourse structure relations. Artificial Intelligence, 63: 341–385, (1993).

    Article  Google Scholar 

  10. Iordanskaja, L., Kim, M., Kittredge, R., Lavoie, B., Polguere, A.: Generation extended bilingual statistical reports. In Proc. of COLING 94, pp.1019–1023 (1994).

    Google Scholar 

  11. Kay, M. Functional grammar. In proceedings of the 5th Annual Meeting of the Berkeley Linguist Society, (1979).

    Google Scholar 

  12. Knott, A. Mellish, C. Oberlander, J. and O’Donnell, M.: Sources of flexibility in dynamic hypertext generation. In Proc. of the 8th International Workshop in NLG, Sussex, UK, (1996).

    Google Scholar 

  13. Kukich, K.: Knowledge-based Report Generation: A knowledge-engineering approach to NL Report Generation; Department of Information Science, University of Pittsburgh, Ph.D. thesis, (1983).

    Google Scholar 

  14. Mann, W C Thompson, S A.: Rhetorical structure theory: Toward a functional theory of text organization. Text, 8(3):243–281, (1988).

    Google Scholar 

  15. McKeown, K.: Text generation. Cambridge University Press, Cambridge, (1985).

    Google Scholar 

  16. McKeown, K., Kukich, K., Shaw, J.: Practical issues in automatic document generation. In Proc. of ANLP.94, pages 7–14, Stuttgart, October (1994).

    Google Scholar 

  17. Milosavljevic, M. Tulloch, A. and Dale, R. Text generation in a dynamic hypertext environment. In Proceeding of the 199th Australian Computer Science Conference, 417–426, Melbourne, Austria, (1996).

    Google Scholar 

  18. Passonneau, B., Kukich, K., Robin, J., Hatzivassiloglou, V., Lefkowitz, L. Jin, H.: Generating Summaries of Work Flow Diagrams. In Proc. of the Intern. Conference on NLP and Industrial Applications. Moncton, New Brunswick, Canada (NLP+IA’96). 7p. (1996).

    Google Scholar 

  19. Reiter, E., Dale, R.: Building applied natural language generation system. ANLPC Washington DC. (1997).

    Google Scholar 

  20. Robin, J. Favero, E.L.: Content aggregation in natural language hypertext summarization of OLAP and Data Mining discoveries. In Proc. of INLG.2000 Conference (Intern. Natural Language Generation), Mitzpe Ramon, Israel, June (2000).

    Google Scholar 

  21. Sarawagi, S., Agrawal, R., Megiddo, N.: Discovery-driven exploration of MDDB data cubes. In Proc. Int. Conf. of Extending Database Technology EDBT.98, March, (1998).

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2001 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Favero, E.L., Robin, J. (2001). Using OLAP and Data Mining for Content Planning in Natural Language Generation. In: Bouzeghoub, M., Kedad, Z., Métais, E. (eds) Natural Language Processing and Information Systems. NLDB 2000. Lecture Notes in Computer Science, vol 1959. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45399-7_14

Download citation

  • DOI: https://doi.org/10.1007/3-540-45399-7_14

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-41943-3

  • Online ISBN: 978-3-540-45399-4

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics