Abstract
Our study fits the teaching method FOS [27] for Chinese cookers who come to work in French restaurants or who have chosen catering as a specialty. French gastronomy is internationally renowned, and the expressions of the cooking dishes often have informative and explanatory functions, and the large number of expressions of the cooking dishes that have the same structure (nominal group + à la/à l ’/ au/aux + name) attract our attention. Before teaching these expressions to our learners, the extraction of these expressions constitutes the problem of our study. Indeed, the extraction of multiple words sequences is also a problem in NLP (automatic language processing). In order to choose an appropriate software for our study, in this article, we first modeled the lexical data of these expressions of dishes having the same structure (nominal group + à la/à l ’/ au/aux + name) to choose NooJ as our appropriate software from among NLP extraction software. Then, we implemented the lexical data in NooJ by developing some grammars in the form of a transducer and will compile our data respecting the codes of the NooJ, in particular the operator + EXCLUDE (prefixed by the character “/”) and the colored nodes (prefixed by the character “:”). Finally, the result of our extraction with NooJ was satisfactory for teaching.
This article is supported by the Fundamental Research Funds for the Central Universities (JB2021059).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
These twenty short films are made within the framework of partnership agreements with the Marguerite Yourcenar hotel school in Beuvry (The town of Beuvry is located in the Pas-de-Calais department in the Nord-Pas-de-Calais-Picardie region). These videos were donated by J-M. Mangiante.
- 2.
Lexicoscope, an online tool for studying combinatorial profiles and extracting lexico-syntactic constructs: http://phraseotext.univ-grenoble-alpes.fr/lexicoscope/ (accessed 08/10/2020).
References
Antony, L.: AntConc: design and development of a freeware corpus analysis toolkit for the technical writing classroom. In: Professional Communication Conference Proceedings, pp. 729–737 (2005)
Aussenac-Gilles, N., Condamines, A.: Entre textes et ontologies formelles: les Bases de Connaissances Terminologiques. In: Zacklad, M., Grundstein, M. (eds.) Ingénierie et capitalisation des connaissances, pp. 153–176. Hermès (2001)
Binon, J., Verlinde, S.: L’enseignement/apprentissage du vocabulaire et la lexicographie pédagogique du français sur objectifs spécifique (FOS): le domaine du français des affaires. Études de linguistique appliquée 3(135), 271–283 (2004)
Brunet, É.: Hyperbase, Logiciel documentaire et statistique pour la création et l’exploitation de bases hypertextuelles, Manuel de référence, Université de Nice (2006)
Carras, C., Tolas, J., Kohler, P., Sjilagyi, E.: Le français sur objectifs spécifiques et la classe de langue. CLE International (2007)
Challe, O.: Le français de spécialités. CLE International (2000)
Chomsky, N., Schützenberger, M.P.: The algebraic theory of context-free languages. In: Braffort, P., Hirschberg, D. (eds.) Computer Programming and Formal Systems, pp. 118–161. North-Holland, Amsterdam (1963)
Choueka, Y., Klein, S.T., Neuwitz, E.: Automatic retrieval of frequent idiomatic and collocational expressions in a large corpus. J. Assoc. Literary Linguist. Comput. 4, 34–38 (1983)
Church, K.W., Hanks, P., Hindle, D.: Using statistics in lexical analysis. In: Zernik, U. (ed.) Lexical Acquisition: Exploiting On-Line Resources to Build a Lexicon, pp. 115–164. Lawrence Erlbaum, Hillsdale (1991)
Curado, F.A.: Lexical behaviour in academic and technical corpora: implications for ESP development. Lang. Learn. Technol. 5(3), 106–129 (2001)
Dubois, J.: La nouvelle grammaire du français, Larousse (1973)
Dumas, A.: Le grand dictionnaire de Cuisine, Couverture reliée, Jaquette illustrée couleur, gravures et planches noir et blanc, préface: Daniel Zimmermann. Phébus (2000)
Fiala, P., Habert, B., Lafon, P., Pineira, C.: Des mots aux syntagmes, figements et variations dans la Résolution générale du congrès de la CGT de 1978. Mots 14, 45–87 (1987)
Fleury, S.: Le Métier Textométrique: Le Trameur, Manuel d’utilisation (2007). http://tal.univ-paris3.fr/trameur/
Gledhill, C.: Collocations in Science Writing. Gunter Narr Verlag, Tübingen (2000)
Goldman, J.P., Nerima, L., Wehrli, E.: Collocation extraction using a syntactic parser. In: Proceedings of the ACL 2001 Workshop on Collocation, pp. 61–66 (2001)
Grefenstette, G., Teufel, S.: Corpus-based method for automatic identification of support verbs for nominalizations. In: Proceedings of the Seventh Conference of the European Chapter of the Association for Computational Linguistics, pp. 98–103 (1995)
Heiden, S., Magué, J.P., Pincemin, B.: TXM: une plateforme logicielle open-source pour la textométrie – conception et développement. In: JADT 2010, vol. 2, pp. 1021–1032. Edizioni Universitarie di Lettere Economia, Roma (2010)
Kilgarriff, A., Tugwell, D.: WORD SKETCH: extraction, combination and display of significant collocations for lexicography. In: Proceedings of the Workshop on Collocations: Computational Extraction, Analysis and Exploitation, ACL-EACL 2001, pp. 32–38 (2001)
Lafon, P., Salem, A.: L’inventaire des segments répétés d’un texte. Mots 6(1), 161–177 (1983). https://www.persee.fr/doc/mots_0243-6450_1983_num_6_1_1101
Lebart, L.: DtmVic: data and text mining - visualization, inference, classification. Exploratory statistical processing of complex data sets comprising both numerical and textual data (2015). http://www.dtmvic.com/
Lewis, M.: There is nothing as practical as a good theory. In: Lewis, M. (eds.) Teaching Collocation: Further Developments in the Lexical Approach, Language Teaching Publications, pp. 10–27 (2000)
Lin, D.: Extracting collocations from text corpora. In: First Workshop on Computational Terminology, Montréal, pp. 57–63 (1998)
Longrée, D., Luong, X., Mellet, S.: Les motifs: un outil pour la caractérisation topologique des textes. In: Heiden, S., Pincemin, B., Vosghanian, L., l’équipe Lexicometrica (eds.) Actes des 9ème Journées internationales d’analyse statistique des données textuelles – JADT 2008, pp. 733–744 (2008)
Longrée, D., Mellet, S.: Le motif: une unité phraséologique englobante? Étendre le champ de la phraséologie de la langue au discours. Langages 189, 68–80 (2013)
Macwhinney, B.: The CHILDES Project: Tools for Analyzing Talk. Mahwah, Lawrence Erlbaum Associates, New Jersey (2000)
Mangiante, J.M., Parpette, C.: Le français sur objectif spécifique: de l’analyse de besoins à l’élaboration d’un cours. Hachette (2004)
Mel’čuk, I., Arbatchewsky-Jumarie, N., Iordanskaja, L., Mantha, S., Polguère, A.: Dictionnaire explicatif et combinatoire du français contemporain: recherches lexico-sémantiques IV. Les Presses de l’Université de Montréal (1999)
Née, É., Sitri, F., Veniard, M.: Les routines, une catégorie pour l’analyse de discours: le cas des rapports éducatifs. Lidil 53, 71–93 (2016)
Quiniou, S., Cellier, P., Charnois, T., Legallois, D.: What about sequential data mining techniques to identify linguistic patterns for stylistics? In: Gelbukh, A. (ed.) CICLing 2012. LNCS, vol. 7181, pp. 166–177. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-28604-9_14
Salem, A.: Pratique des segments répétés: Essai de statistique textuelle. Klincksieck, Paris (1987)
Silberztein, M.: NooJ: an oriented object approach. In: Royauté, J., Silberztein, M. (eds.) INTEX pour la Linguistique et le Traitement Automatique des Langues. Presses universitaires de Franche-Comté, Besançon (2002)
Silberztein, M.: Formalizing Natural Languages: The NooJ Approach. Wiley (2016)
Smadja, F.: Retrieving collocations form text: Xtract. Comput. Linguist. 19(1), 143–177 (1993)
Tutin, A.: Pour une modélisation dynamique des collocations dans les textes. In: Actes d’Euralex, Lorient, pp. 207–221 (2004)
Tutin, A., Kraif, O.: Routines sémantico-rhétoriques dans l’écrit scientifique de sciences humaines: l’apport des arbres lexico-syntaxiques récurrents. Lidil 53, 119–141 (2016)
Tutin, A.: La phraséologie transdisciplinaire des écrits scientifiques: des collocations aux routines sémantico-rhétoriques. In: Tutin, A., Grossmann, F. (eds.) L’écrit scientifique: du lexique au discours. Autour de Scientext, pp. 27–44. Presses universitaires de Rennes (2013)
Wilson, A.: Les Tables du Monde: Recettes et Ingrédients de A à Z. Könemann, Cologne (1977)
Yang, T.: Constitution et exploitation d’une base de données pour l’enseignement/apprentissage des phrasèmes NAdj du domaine culinaire auprès d’apprenants non-natifs. Thèse de doctorat. Université Sorbonne Nouvelle – Paris 3 (2019)
Yang, T.: Automatic extraction of verbal phrasemes in the culinary field with NooJ. In: Mauro Mirto, I., Monteleone, M., Silberztein, M. (eds.) NooJ 2018. CCIS, vol. 987, pp. 83–94. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-10868-7_8
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this paper
Cite this paper
Yang, T. (2021). Using NooJ to Formalize French Cooking Expressions. In: Bigey, M., Richeton, A., Silberztein, M., Thomas, I. (eds) Formalizing Natural Languages: Applications to Natural Language Processing and Digital Humanities. NooJ 2021. Communications in Computer and Information Science, vol 1520. Springer, Cham. https://doi.org/10.1007/978-3-030-92861-2_13
Download citation
DOI: https://doi.org/10.1007/978-3-030-92861-2_13
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-92860-5
Online ISBN: 978-3-030-92861-2
eBook Packages: Computer ScienceComputer Science (R0)