Using NooJ to Formalize French Cooking Expressions

Yang, Tong

doi:10.1007/978-3-030-92861-2_13

Tong Yang⁹

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1520))

Included in the following conference series:

International Conference on Automatic Processing of Natural-Language Electronic Texts with NooJ

215 Accesses

Abstract

Our study fits the teaching method FOS [27] for Chinese cookers who come to work in French restaurants or who have chosen catering as a specialty. French gastronomy is internationally renowned, and the expressions of the cooking dishes often have informative and explanatory functions, and the large number of expressions of the cooking dishes that have the same structure (nominal group + à la/à l ’/ au/aux + name) attract our attention. Before teaching these expressions to our learners, the extraction of these expressions constitutes the problem of our study. Indeed, the extraction of multiple words sequences is also a problem in NLP (automatic language processing). In order to choose an appropriate software for our study, in this article, we first modeled the lexical data of these expressions of dishes having the same structure (nominal group + à la/à l ’/ au/aux + name) to choose NooJ as our appropriate software from among NLP extraction software. Then, we implemented the lexical data in NooJ by developing some grammars in the form of a transducer and will compile our data respecting the codes of the NooJ, in particular the operator + EXCLUDE (prefixed by the character “/”) and the colored nodes (prefixed by the character “:”). Finally, the result of our extraction with NooJ was satisfactory for teaching.

This article is supported by the Fundamental Research Funds for the Central Universities (JB2021059).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 64.99; Price excludes VAT (USA)

Softcover Book: USD 84.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
These twenty short films are made within the framework of partnership agreements with the Marguerite Yourcenar hotel school in Beuvry (The town of Beuvry is located in the Pas-de-Calais department in the Nord-Pas-de-Calais-Picardie region). These videos were donated by J-M. Mangiante.
2.
Lexicoscope, an online tool for studying combinatorial profiles and extracting lexico-syntactic constructs: http://phraseotext.univ-grenoble-alpes.fr/lexicoscope/ (accessed 08/10/2020).

References

Antony, L.: AntConc: design and development of a freeware corpus analysis toolkit for the technical writing classroom. In: Professional Communication Conference Proceedings, pp. 729–737 (2005)
Google Scholar
Aussenac-Gilles, N., Condamines, A.: Entre textes et ontologies formelles: les Bases de Connaissances Terminologiques. In: Zacklad, M., Grundstein, M. (eds.) Ingénierie et capitalisation des connaissances, pp. 153–176. Hermès (2001)
Google Scholar
Binon, J., Verlinde, S.: L’enseignement/apprentissage du vocabulaire et la lexicographie pédagogique du français sur objectifs spécifique (FOS): le domaine du français des affaires. Études de linguistique appliquée 3(135), 271–283 (2004)
Google Scholar
Brunet, É.: Hyperbase, Logiciel documentaire et statistique pour la création et l’exploitation de bases hypertextuelles, Manuel de référence, Université de Nice (2006)
Google Scholar
Carras, C., Tolas, J., Kohler, P., Sjilagyi, E.: Le français sur objectifs spécifiques et la classe de langue. CLE International (2007)
Google Scholar
Challe, O.: Le français de spécialités. CLE International (2000)
Google Scholar
Chomsky, N., Schützenberger, M.P.: The algebraic theory of context-free languages. In: Braffort, P., Hirschberg, D. (eds.) Computer Programming and Formal Systems, pp. 118–161. North-Holland, Amsterdam (1963)
Chapter Google Scholar
Choueka, Y., Klein, S.T., Neuwitz, E.: Automatic retrieval of frequent idiomatic and collocational expressions in a large corpus. J. Assoc. Literary Linguist. Comput. 4, 34–38 (1983)
Google Scholar
Church, K.W., Hanks, P., Hindle, D.: Using statistics in lexical analysis. In: Zernik, U. (ed.) Lexical Acquisition: Exploiting On-Line Resources to Build a Lexicon, pp. 115–164. Lawrence Erlbaum, Hillsdale (1991)
Google Scholar
Curado, F.A.: Lexical behaviour in academic and technical corpora: implications for ESP development. Lang. Learn. Technol. 5(3), 106–129 (2001)
Google Scholar
Dubois, J.: La nouvelle grammaire du français, Larousse (1973)
Google Scholar
Dumas, A.: Le grand dictionnaire de Cuisine, Couverture reliée, Jaquette illustrée couleur, gravures et planches noir et blanc, préface: Daniel Zimmermann. Phébus (2000)
Google Scholar
Fiala, P., Habert, B., Lafon, P., Pineira, C.: Des mots aux syntagmes, figements et variations dans la Résolution générale du congrès de la CGT de 1978. Mots 14, 45–87 (1987)
Article Google Scholar
Fleury, S.: Le Métier Textométrique: Le Trameur, Manuel d’utilisation (2007). http://tal.univ-paris3.fr/trameur/
Gledhill, C.: Collocations in Science Writing. Gunter Narr Verlag, Tübingen (2000)
Google Scholar
Goldman, J.P., Nerima, L., Wehrli, E.: Collocation extraction using a syntactic parser. In: Proceedings of the ACL 2001 Workshop on Collocation, pp. 61–66 (2001)
Google Scholar
Grefenstette, G., Teufel, S.: Corpus-based method for automatic identification of support verbs for nominalizations. In: Proceedings of the Seventh Conference of the European Chapter of the Association for Computational Linguistics, pp. 98–103 (1995)
Google Scholar
Heiden, S., Magué, J.P., Pincemin, B.: TXM: une plateforme logicielle open-source pour la textométrie – conception et développement. In: JADT 2010, vol. 2, pp. 1021–1032. Edizioni Universitarie di Lettere Economia, Roma (2010)
Google Scholar
Kilgarriff, A., Tugwell, D.: WORD SKETCH: extraction, combination and display of significant collocations for lexicography. In: Proceedings of the Workshop on Collocations: Computational Extraction, Analysis and Exploitation, ACL-EACL 2001, pp. 32–38 (2001)
Google Scholar
Lafon, P., Salem, A.: L’inventaire des segments répétés d’un texte. Mots 6(1), 161–177 (1983). https://www.persee.fr/doc/mots_0243-6450_1983_num_6_1_1101
Lebart, L.: DtmVic: data and text mining - visualization, inference, classification. Exploratory statistical processing of complex data sets comprising both numerical and textual data (2015). http://www.dtmvic.com/
Lewis, M.: There is nothing as practical as a good theory. In: Lewis, M. (eds.) Teaching Collocation: Further Developments in the Lexical Approach, Language Teaching Publications, pp. 10–27 (2000)
Google Scholar
Lin, D.: Extracting collocations from text corpora. In: First Workshop on Computational Terminology, Montréal, pp. 57–63 (1998)
Google Scholar
Longrée, D., Luong, X., Mellet, S.: Les motifs: un outil pour la caractérisation topologique des textes. In: Heiden, S., Pincemin, B., Vosghanian, L., l’équipe Lexicometrica (eds.) Actes des 9ème Journées internationales d’analyse statistique des données textuelles – JADT 2008, pp. 733–744 (2008)
Google Scholar
Longrée, D., Mellet, S.: Le motif: une unité phraséologique englobante? Étendre le champ de la phraséologie de la langue au discours. Langages 189, 68–80 (2013)
Google Scholar
Macwhinney, B.: The CHILDES Project: Tools for Analyzing Talk. Mahwah, Lawrence Erlbaum Associates, New Jersey (2000)
Google Scholar
Mangiante, J.M., Parpette, C.: Le français sur objectif spécifique: de l’analyse de besoins à l’élaboration d’un cours. Hachette (2004)
Google Scholar
Mel’čuk, I., Arbatchewsky-Jumarie, N., Iordanskaja, L., Mantha, S., Polguère, A.: Dictionnaire explicatif et combinatoire du français contemporain: recherches lexico-sémantiques IV. Les Presses de l’Université de Montréal (1999)
Google Scholar
Née, É., Sitri, F., Veniard, M.: Les routines, une catégorie pour l’analyse de discours: le cas des rapports éducatifs. Lidil 53, 71–93 (2016)
Article Google Scholar
Quiniou, S., Cellier, P., Charnois, T., Legallois, D.: What about sequential data mining techniques to identify linguistic patterns for stylistics? In: Gelbukh, A. (ed.) CICLing 2012. LNCS, vol. 7181, pp. 166–177. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-28604-9_14
Chapter Google Scholar
Salem, A.: Pratique des segments répétés: Essai de statistique textuelle. Klincksieck, Paris (1987)
Google Scholar
Silberztein, M.: NooJ: an oriented object approach. In: Royauté, J., Silberztein, M. (eds.) INTEX pour la Linguistique et le Traitement Automatique des Langues. Presses universitaires de Franche-Comté, Besançon (2002)
Google Scholar
Silberztein, M.: Formalizing Natural Languages: The NooJ Approach. Wiley (2016)
Google Scholar
Smadja, F.: Retrieving collocations form text: Xtract. Comput. Linguist. 19(1), 143–177 (1993)
Google Scholar
Tutin, A.: Pour une modélisation dynamique des collocations dans les textes. In: Actes d’Euralex, Lorient, pp. 207–221 (2004)
Google Scholar
Tutin, A., Kraif, O.: Routines sémantico-rhétoriques dans l’écrit scientifique de sciences humaines: l’apport des arbres lexico-syntaxiques récurrents. Lidil 53, 119–141 (2016)
Article Google Scholar
Tutin, A.: La phraséologie transdisciplinaire des écrits scientifiques: des collocations aux routines sémantico-rhétoriques. In: Tutin, A., Grossmann, F. (eds.) L’écrit scientifique: du lexique au discours. Autour de Scientext, pp. 27–44. Presses universitaires de Rennes (2013)
Google Scholar
Wilson, A.: Les Tables du Monde: Recettes et Ingrédients de A à Z. Könemann, Cologne (1977)
Google Scholar
Yang, T.: Constitution et exploitation d’une base de données pour l’enseignement/apprentissage des phrasèmes NAdj du domaine culinaire auprès d’apprenants non-natifs. Thèse de doctorat. Université Sorbonne Nouvelle – Paris 3 (2019)
Google Scholar
Yang, T.: Automatic extraction of verbal phrasemes in the culinary field with NooJ. In: Mauro Mirto, I., Monteleone, M., Silberztein, M. (eds.) NooJ 2018. CCIS, vol. 987, pp. 83–94. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-10868-7_8
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

North China Electric Power University, Beijing, China
Tong Yang

Authors

Tong Yang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Tong Yang .

Editor information

Editors and Affiliations

Université de Franche-Comté, Besançon, France
Magali Bigey
Université de Franche-Comté, Besançon, France
Annabel Richeton
Université de Franche-Comté, Besançon, France
Max Silberztein
Université de Franche-Comté, Besançon, France
Izabella Thomas

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Yang, T. (2021). Using NooJ to Formalize French Cooking Expressions. In: Bigey, M., Richeton, A., Silberztein, M., Thomas, I. (eds) Formalizing Natural Languages: Applications to Natural Language Processing and Digital Humanities. NooJ 2021. Communications in Computer and Information Science, vol 1520. Springer, Cham. https://doi.org/10.1007/978-3-030-92861-2_13

Download citation

DOI: https://doi.org/10.1007/978-3-030-92861-2_13
Published: 03 January 2022
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-92860-5
Online ISBN: 978-3-030-92861-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics