Abstract
We present in this article the state of the art in lexical choice research in text generation and machine translation. The existing implementations are discussed with respect to four topics: (i) the place of lexical choice in the overall generation process, (ii) the information flow within the generation process and the consequences thereof for lexical choice, (iii) the internal organization of the lexical choice process, (iv) the phenomena covered by lexical choice. Possible future directions in lexical choice research are identified.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Abeillé A.: 1995, The Flexibility of French Idioms: A Representation with Lexicalized Tree Adjoining Grammar, in M. Everaert et al. (eds) Idioms: Structural and Psychological Perspectives. Hillsdale: Lawrence Erlbaum Associates, 15–42.
Alonso M. and A. Tutin: 1996, A Classification and Description of Lexical Functions for the Analysis of their Combinations, in L. Wanner (ed.) Lexical Functions in Lexicography and Natural Language Processing, Amsterdam: Benjamins Academic Publishers, 147–167.
Alshawi, H., D. Carter, M. Rayner, and B. Gambäck: 1991, Translation by Quasi Logical Form Transfer, in Proceedings of the 29th Annual Meeting of the Association for Computational Linguistics, Berkeley, CA, 161–168.
Appelt D.: 1985, Planning Natural Language Utterances. Cambridge: Cambridge University Press.
Appelt, D. and A. Kronfeld: 1987, A Computational Model of Referring, in Proceedings of the 10th International Joint Conference on Artificial Intelligence, Milan, 640–647.
Barnett, J., I. Mani, E. Rich, C. Aone, K. Knight, and J.C. Martinez: 1991, Capturing Language-Specific Semantic Distinctions in Interlingua-Based MT, in Proceedings of the 3rd MT Summit, Washington, DC. 25–32.
Bateman J.A. and E.H. Hovy: 1992, Computers and Text Generation: Principles and Uses, in C.S. Butler (ed.) Computers and Written Texts. Oxford: Basil Blackwell, 53–74.
Bateman, J.A. and C.L. Paris: 1989, Phrasing a Text in Terms the User Can Understand, in Proceedings of the 11th International Joint Conference on Artificial Intelligence, Detroit, 1511–1517.
Becker, J.D.: 1975, The Phrasal Lexicon, in R.C. Schank and B.L. Webber (eds) Theoretical Issues in Natural Language Processing (TINLAP 1), Cambridge, 70–73.
Busemann S.: 1993, A Holistic View of Lexical Choice, in H. Horacek and M. Zock (eds) New Concepts in Natural Language Generation. London: Pinter, 302–308.
Chevalier, M., J. Dansereau, and G. Poulin: 1978, TAUM-MÉTÉO: description du système. Technical report. Group taum, Université de Montréal.
Coulthard M. and M. Montgomery (eds): 1981, Studies in discource analysis. London: Routledge and Kegan Paul.
Cumming S.: 1986, The Lexicon in Text Generation. Technical Report ISI/RR-86-168. USC/Information Sciences Institute, Marina del Rey, California.
Dale R.: 1992, Generating Referring Expressions: Building Descriptions in a Domain of Objects and Processes. Cambridge, MA: The MIT Press.
Dale R. and E. Reiter: 1995, Computational Interpretation of the Gricean Maxims in the Generation of Referring Expressions, Cognitive Science, 19: 233–263.
Danlos, L.: 1995, Relations causales directes: discours, structure événementielle et coréférence événementielle, in L. Bouchard and L. Emirkanian (eds) Actes du congrès traitement automatique du français écrit: développements théoriques et applications, Montreal.
Danlos L.: 1987, The Linguistic Basis of Text Generation. Cambridge: Cambridge University Press.
Danlos, L.: 1992, Support Verb Construction: Linguistic Properties, Representation, Translation. Journal of French Language Studies 2.
Danlos L.: 1994, Coder des informations monolingues sur les noms pour éviter des règles bilingues sensibles au contexte, Langages 116: 95–110.
De Smedt K.: 1990, IPF: An Incremental Parallel Formulator, in R. Dale et al. (eds) Current Research in Natural Language Processing. London: Academic Press, 167–192.
DiMarco, C., G. Hirst, and M. Stede: 1993, The Semantic and Stylistic Differentiation of Synonyms and Near-Synonyms, in Working Notes of the AAAI Spring Symposium on Building Lexicons for Machine Translation, Stanford.
Dorna, M. and M.C. Emele: 1996, Semantic-Based Transfer, in Proceedings of the 16th International Conference on Computational Linguistics, Copenhagen.
Dorr B.J.: 1993, Machine Translation: A View from the Lexicon. Cambridge, MA: The MIT Press.
Dorr B.J.: 1994, Machine Translation Divergences: A Formal Description and Proposed Solution, Computational Linguistics 20: 579–634.
Dorr, B.J., C.R. Voss, E. Peterson, and M. Kiker: 1994, Concept-Based Lexical Selection, in Proceedings of the AAAI Fall Symposium on Knowledge Representation for Natural Language Processing in Implemented Systems, New Orleans, LA.
Ducrot O.: 1983, Le sens commun: Le dire et le dire. Paris: Les Editions de Minuit.
Elhadad, M.: 1992, Using Argumentation to Control Lexical Choice: A Functional Unification Implementation. Doctoral dissertation. Columbia University.
Elhadad M. and J. Robin: 1992, Controlling Content Realization with Functional Unification Grammars, in R. Dale et al. (eds) Aspects of Automated Natural Language Generation. Berlin: Springer Verlag, 89–104.
Erbach G. and B. Krenn: 1993, Idioms and Support Verb Constructions in HPSG, in J. Nerbonne et al. (eds) German Grammar in HPSG. Stanford: CSLI Lecture Notes.
Fawcett R.P., G.H. Tucker, and L. Yuen: 1993, How a Systemic-Functional Grammar Works, in H. Horacek and M. Zock (eds) New Concepts in Natural Language: Planning, Realization and Systems, London: Pinter, 114–186.
Firth J.R.: 1957, Modes of Meaning. In Papers in Linguistics 1934–1951. Oxford: Oxford University Press, 190–215.
Goldman N.: 1975, Conceptual Generation, in R.C. Schank (ed) Conceptual Information Processing. Amsterdam: North-Holland Publishing Company, 289–372.
Gross, M.: 1984, Lexicon-Grammar and the Syntactic Analysis of French, in Proceedings of the 10th International Conference on Computational Linguistics, Stanford, CA, 275–282.
Grote, B., N. Lenke, and M. Stede: 1995, Ma(r)king Concessions in English and German, in Proceedings of the 5th European Workshop on Natural Language Generation, Leiden, 11–32.
Halliday M.A.K.: 1961, Categories of the Theory of Grammar, Word 17: 241–292.
Heid, U. and S. Raab: 1989, Collocations in Multilingual Generation, in Proceedings of the 4th Conference of the European Chapter of the Association for Computational Linguistics, Manchester, 130–136.
Heylen, D., K.G. Maxwell, and M. Verhagen: 1994, Lexical Functions in Machine Translation, in Proceedings of the 15th International Conference on Computational Linguistics, Kyoto, 1240–1244.
Hoeppner W., H. Horacek, and J. Moore: 1994, Notes of the Working Groups at the Dagstuhl Seminar on Principles of Natural Language Generation. Dagstuhl Seminar Report 93. Dagstuhl: International Conference and Research Center for Computer Science.
Horacek H.: 1987, Choice of Words in the Generation Process of a Natural Language Interface, Applied Artificial Intelligence 1: 117–132.
Horacek H.: 1992, An Integrated View of Text Planning, in R. Dale et al. (eds) Aspects of Automated Text Generation, Berlin: Springer Verlag, 29–44.
Horacek, H.: 1995, More on Generating Referring Expressions, in Proceedings of the 5th European Workshop on Natural Language Generation, Leiden, 43–58.
Horacek, H. and C. Pyka: 1988, Towards Bridging Two levels of Representation—Linking the Syntactic Functional and the Object-Oriented Paradigms, in Proceedings of the International Computer Science Conference, Hong Kong.
Horacek H. and M. Zock (eds): 1993, New Concepts in Natural Language Generation. London: Pinter.
Hovy E.H.: 1988a, Generating Natural Language under Pragmatic Constraints. Hillsdale, NJ: Lawrence Erlbaum Associates.
Hovy E.H.: 1988b, Generating Language with a Phrasal Lexicon, in D.D. McDonald and L. Bolc (eds) Natural Language Generation Systems, New York: Springer Verlag, 353–384.
Iordanskaja, L.N., M. Kim, R. Kittredge, B. Lavoie, and A. Polguère: 1992, Generation of Extended Bilingual Statistical Reports, in Proceedings of the 14th International Conference on Computational Linguistics, Nantes, 1019–1022.
Iordanskaja L.N., M. Kim, and A. Polguère: 1996, Some Procedural Problems in the Implementation of Lexical Functions for Text Generation, in L. Wanner (ed.) Lexical Functions in Lexicography and Natural Language Processing, Amsterdam: Benjamins Academic Publishers, 279–297.
Iordanskaja L.N., R. Kittredge, and A. Polguère: 1991, Lexical Selection and Paraphrase in a Meaning-Text Generation Model, in C.L. Paris et al. (eds) Natural Language Generation in Artificial Intelligence and Computational Linguistics, Dordrecht: Kluwer Academic Publishers, 293–312.
Jacobs, P.S.: 1985a, PHRED: A Generator for Natural Language Interfaces. Technical report UCB/CSD 85/198. University of California at Berkeley.
Jacobs, P.S.: 1985b, A Knowledge-Based Approach to Language Production. Technical report UCB/CSD 86/254. University of California at Berkeley.
Jackendoff R.: 1990, Semantic Structures. Cambridge, MA: The MIT Press.
Jones M. and K. McCoy: 1992, Transparently-Motivated Metaphor Generation, in R. Dale et al. (eds) Aspects of Automated Natural Language Generation, Berlin: Springer Verlag, 231–246.
Kantrowitz M. and J. Bates: 1992, Integrated Natural Language Systems. In R. Dale et al. (eds) Aspects of Automated Natural Language Generation, Berlin: Springer Verlag, 13–28.
Kilger A. and W. Finkler: 1994, TAG-based Incremental Generation. Technical Report, German Center for Artificial Intelligence, Saarbrücken.
Kittredge, R.: 1995, Efficiency vs. Generality in Interlingual Design: Some Linguistic Considerations. In Working Notes of the Multilingual Text Generation Workshop at the 14th International Joint Conference on Artificial Intelligence, Montreal, 64–74.
Kukich, K.: 1983, Knowledge-Based Report Generation: A Knowledge-Engineering Approach to Natural Language Report Generation. Doctoral dissertation. University of Pittsburgh.
Kukich K.: 1987, Where Do Phrases Come From: Some Preliminary Experiments in Connectionist Phrase Generation, In G. Kempen (ed) Natural Language Generation: New Results in Artificial Intelligence, Psychology and Linguistics, Dordrecht: Martinus Nijhoff, 405–421.
Lakoff G. and M. Johnson: 1980, Metaphors We Live by. Chicago: Chicago University Press.
Lee W. and M. Evens: 1996, Generating Cohesive Text Using Lexical Functions, in L. Wanner (ed.) Lexical Functions in Lexicography and Natural Language Processing, Amsterdam: Benjamins Academic Publishers, 299–306.
Levelt W.J.M. and H. Schriefers: 1987, Stages of Lexical Access, in G. Kempen (ed.) Natural Language Generation: Recent Advances in Artificial Intelligence, Psychology and Linguistics, Dordrecht: Martinus Nijhoff, 395–404.
Levin B.: 1993, English Verb Classes and Alternations: A Preliminary Investigation. Chicago: Chicago University Press.
Levin B. and M. Rappaport Hovav: 1995, Unaccusativity: At the Syntax-Lexical Semantics Interface. Cambridge, MA: The MIT Press.
Marcus M.: 1987, Generation Systems Should Choose Their Words, in Y. Wilks (ed) Theoretical Issues in Natural Language Processing 3, Las Cruces, New Mexico State University, 211–214.
Matthiessen C.M.I.M.: 1991, Lexico (grammatical) Choice in Text Generation, in C.L. Paris et al. (eds) Natural Language Generation in Artificial Intelligence and Computational Linguistics, Dordrecht: Kluwer Academic Publishers, 249–292.
McDonald D.D.: 1991, On the Place of Words in the Generation Process, in C.L. Paris et al. (eds) Natural Language Generation in Artificial Intelligence and Computational Linguistics. Dordrecht: Kluwer Academic Publishers, 229–247.
McDonald, D.D.: 1995, Lexical Discontinuities in the Functional Meaning of Words, in Working Notes of the Multilingual Text Generation Workshop at the 14th International Joint Conference on Artificial Intelligence, Montreal, 110–118.
McDonald, D.D. and J. Pustejovsky: 1985, Description-Directed Natural Language Generation, in Proceedings of the 9th International Joint Conference on Artificial Intelligence, 799–805.
McKeown K.R., M. Elhadad, Y. Fukumoto, J. Lim, C. Lombardi, J. Robin, and F. Smadja: 1990, Natural Language Generation in COMET, in R. Dale et al. (eds) Current Research in Natural Language Generation, London: Academic Press, 103–139.
McKeown, K.R., J. Robin, and M. Tanenblatt: 1993, Tailoring Lexical Choice to the User's Vocabulary in Multimedia Explanation Generation. In Proceedings of the 31th Annual Meeting of the Accociation for Computational Linguistics, 226–234.
Mel'čuk I.A.: 1995, Phrasemes in Language and Phraseology in Linguistics. In M. Everaert et al. (eds) Idioms. Structural and Psychological Perspectives, Hillsdale, NJ: Lawrence Erlbaum Associates, 167–232.
Mel'čuk I.A.: 1981, Meaning-text Models: A Recent Trend in Soviet Linguistics, Annual Review of Anthropology 10: 27–62.
Mel'čuk I.A.: 1988, Paraphrase et lexique dans la théorie linguistique Sens-Texte, Cahiers de Lexicologie 52(1,2): 5–50, 5–53.
Mesli, N.: 1991, Funktionsverbgefüge in der maschinellen Analyse und Übersetzung: linguistische Beschreibung und Implementierung im cat2-Formalismus. Eurotra-d Working Paper No. 19. Institut für Angewandte Informationsforschung (IAI), Universität des Saarlandes.
Meteer M.: 1991, Bridging the Generation Gap Between Text Planning and Linguistic Realization, Computational Intelligence 7: 296–304.
Meteer M.: 1992, Expressibility and the Problem of Efficient Text Planning. London: Pinter.
Meya, M.: 1990, Tenets for an Interlingual Representation of Definite NPs, in Proceedings of the 13th International Conference on Computational Linguistics, Helsinki, 263–269.
Nirenburg S. and C. Defrise: 1993, Lexical and Conceptual Structures for Knowledge Based Translation, in J. Pustejovsky (ed) Semantics and the Lexicon, Kluwer Academic Publishers: Dordrecht, 291–323.
Nirenburg, S., V. Lesser, and E. Nyberg: 1989, Controlling a Language Generation Planner, in Proceedings of the 11th International Joint Conference on Artificial Intelligence, Detroit, 1524–1530.
Nirenburg, S. and I. Nirenburg: 1988, A Framework for Lexical Selection in Natural Language Generation, in Proceedings of the 12th International Conference on Computational Linguistics, Budapest.
Nogier J.-F. and M. Zock: 1992, Lexical Choice as Pattern Matching, in T. Nagle et al. (eds) Conceptual Structures: Current Research and Practice, New York: Ellis Horwood.
Novak H.-J.: 1988, Generating Referring Phrases in a Dynamic Environment, in M. Zock and G. Sabah (eds) Advances in Natural Language Generation, London: Pinter, 76–85.
O'Donnell, M.J.: 1994, Sentence Analysis and Generation: A Systemic Perspective. Doctoral dissertation. University of Sydney.
Panaget, F.: 1994, Using a Textual Representational Level Component in the Context of Discourse or Dialogue Generation. In Proceedings of the 7th International Workshop on Natural Language Generation, Kennebunkport, NE, 127–136.
Penman: 1989. The Penman Documentation. Technical Report. USC/Information Sciences Institute, Marina del Rey, CA.
Polguère, A.: 1990, Structuration et mise en jeu procédurale d'un modèle linguistique déclaratif dans un cardre de génération de texte. Doctoral dissertation. Université de Montréal.
Prince E.: 1981, A Taxonomy of Given-New Information, in P. Cole (ed.) Radical Pragmatics. New York: Academic Press, 223–255.
Pustejovsky J.: 1995, The Generative Lexicon. Cambridge, MA: The MIT Press.
Pustejovsky, J. and Nirenburg, S.: 1987, Lexical Selection in the Process of Language Generation, in Proceedings of the 25th Annual Meeting of the Association for Computational Linguistics, 201–206.
Reiter, E.: 1990, Generating Appropriate Natural Language Object Descriptions. Doctoral dissertation. Harvard University.
Reiter E.: 1991, A New Model of Lexical Choice for Nouns, Computational Intelligence 7: 240–251.
Reithinger N.: 1992, Eine parallele Architektur zur inkrementellen Generierung multimodaler Dialogbeiträge, St. Augustin: Infix Verlag.
Robin, J.: 1990, Lexical Choice in Natural Language Generation. Technical Report CUCS-040-90. Columbia University.
Rosch E.: 1978, Principles of categorization, in E. Rosch and B. Lloyd (eds) Cognition and Categorization. Hillsdale, NJ: Lawrence Erlbaum Associates, 27–48.
Schank R.: 1975, Conceptual Information Processing. Amsterdam: North Holland.
Sharp, R. and O. Streiter: 1992, Simplifying the Complexity of Machine Translation, Meta 93(4).
Sigurd, B. and B. Gawrońska: 1994, Modals as a Problem for MT, in Proceedings of the 15th International Conference on Computational Linguistics, Kyoto, 120–124.
Smadja F. and K. McKeown: 1991, Using Collocations for Language Generation, Computational Intelligence 7: 229–239.
Sobashima, Y., O. Furuse, S. Akamine, J. Kawai, and H. Iida: 1994, A Bidirectional, Transfer-Driven Machine Translation System for Spoken Dialogues, in Proceedings of the 15th International Conference on Computational Linguistics, Kyoto, 64–68.
Sondheimer N., S. Cumming, and R. Albano: 1990, How to Realize a Concept: Lexical Selection and the Conceptual Network in Text Generation, Machine Translation 5: 57–78.
Sowa J.: 1984, Conceptual Structures: Information Processing in Mind and Machine. New York: Addison Wesley.
Stede, M.: 1996, A Generative Perspective on Verbs and their Readings, in Proceedings of the 8th International Workshop on Natural Language Generation, Herstmonceux, 141–150.
Stede M.: 1995, Lexicalization in Natural Language Generation: A Survey, Artificial Intelligence Review 8: 309–336.
Steinacker, I. and E. Buchberger: 1983, Relating Syntax and Semantics: The Syntactico-Semantic Lexicon of the System VIE-LANG, in Proceedings of the 1st Conference of the European Chapter of the Association for Computational Linguistics, Pisa.
Tutin, A.: 1992, Études des anaphores grammaticales et lexicales dans la perspective de la génération automatique dans des textes de procédures. Doctoral dissertation. Université de Montréal.
Tutin, A. and R. Kittredge: 1992, Lexical Choice in Context: Generating Procedural Texts, in Proceedings of the 14th International Conference on Computational Linguistics, Budapest, 763–769.
Vander Linden K. and J.H. Martin: 1995, Expressing Rhetorical Relations in Instructional Text: A Case Study of the Purpose Relation, Computational Linguistics 21: 29–57.
Van Noord G., J. Dorrepaal, P.Vander Eijk, M. Florenza, H. Ruessink, and L.des Tombes: 1991, An Overview of MiMo2, Machine Translation 6: 201–214.
Wanner, L.: 1994, Building Another Bridge over the Generation Gap, in Proceedings of the 7th International Workshop on Natural Language Generation, Kennebunkport, NE, 137–144.
Wanner, L. and J.A. Bateman: 1990, A Collocational Based Approach to Salience-Sensitive Lexical Selection, in Proceedings of the 5th International Workshop on Natural Language Generation, Leiden, 31–38.
Wanner, L. and E.H. Hovy: 1996, The HealthDoc Sentence Planner. In Proceedings of the 8th International Workshop on Natural Language Generation, Herstmonceux, 1–10.
Ward N.: 1994, A Connectionist Language Generator. Norwood: Ablex Publishing Corporation.
Wasow, T., I. Sag, and G. Nunberg: 1983, Idioms: An Interim Report, in S. Hattori and K. Inoue (eds) Proceedings of the XIIth International Congress of Linguistics, Tokyo.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Wanner, L. Lexical choice in text generation and machine translation. Machine Translation 11, 3–35 (1996). https://doi.org/10.1007/BF00349352
Issue Date:
DOI: https://doi.org/10.1007/BF00349352