Skip to main content

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1520))

  • 215 Accesses

Abstract

Our study fits the teaching method FOS [27] for Chinese cookers who come to work in French restaurants or who have chosen catering as a specialty. French gastronomy is internationally renowned, and the expressions of the cooking dishes often have informative and explanatory functions, and the large number of expressions of the cooking dishes that have the same structure (nominal group + à la/à l ’/ au/aux + name) attract our attention. Before teaching these expressions to our learners, the extraction of these expressions constitutes the problem of our study. Indeed, the extraction of multiple words sequences is also a problem in NLP (automatic language processing). In order to choose an appropriate software for our study, in this article, we first modeled the lexical data of these expressions of dishes having the same structure (nominal group + à la/à l ’/ au/aux + name) to choose NooJ as our appropriate software from among NLP extraction software. Then, we implemented the lexical data in NooJ by developing some grammars in the form of a transducer and will compile our data respecting the codes of the NooJ, in particular the operator + EXCLUDE (prefixed by the character “/”) and the colored nodes (prefixed by the character “:”). Finally, the result of our extraction with NooJ was satisfactory for teaching.

This article is supported by the Fundamental Research Funds for the Central Universities (JB2021059).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 64.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 84.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    These twenty short films are made within the framework of partnership agreements with the Marguerite Yourcenar hotel school in Beuvry (The town of Beuvry is located in the Pas-de-Calais department in the Nord-Pas-de-Calais-Picardie region). These videos were donated by J-M. Mangiante.

  2. 2.

    Lexicoscope, an online tool for studying combinatorial profiles and extracting lexico-syntactic constructs: http://phraseotext.univ-grenoble-alpes.fr/lexicoscope/ (accessed 08/10/2020).

References

  1. Antony, L.: AntConc: design and development of a freeware corpus analysis toolkit for the technical writing classroom. In: Professional Communication Conference Proceedings, pp. 729–737 (2005)

    Google Scholar 

  2. Aussenac-Gilles, N., Condamines, A.: Entre textes et ontologies formelles: les Bases de Connaissances Terminologiques. In: Zacklad, M., Grundstein, M. (eds.) Ingénierie et capitalisation des connaissances, pp. 153–176. Hermès (2001)

    Google Scholar 

  3. Binon, J., Verlinde, S.: L’enseignement/apprentissage du vocabulaire et la lexicographie pédagogique du français sur objectifs spécifique (FOS): le domaine du français des affaires. Études de linguistique appliquée 3(135), 271–283 (2004)

    Google Scholar 

  4. Brunet, É.: Hyperbase, Logiciel documentaire et statistique pour la création et l’exploitation de bases hypertextuelles, Manuel de référence, Université de Nice (2006)

    Google Scholar 

  5. Carras, C., Tolas, J., Kohler, P., Sjilagyi, E.: Le français sur objectifs spécifiques et la classe de langue. CLE International (2007)

    Google Scholar 

  6. Challe, O.: Le français de spécialités. CLE International (2000)

    Google Scholar 

  7. Chomsky, N., Schützenberger, M.P.: The algebraic theory of context-free languages. In: Braffort, P., Hirschberg, D. (eds.) Computer Programming and Formal Systems, pp. 118–161. North-Holland, Amsterdam (1963)

    Chapter  Google Scholar 

  8. Choueka, Y., Klein, S.T., Neuwitz, E.: Automatic retrieval of frequent idiomatic and collocational expressions in a large corpus. J. Assoc. Literary Linguist. Comput. 4, 34–38 (1983)

    Google Scholar 

  9. Church, K.W., Hanks, P., Hindle, D.: Using statistics in lexical analysis. In: Zernik, U. (ed.) Lexical Acquisition: Exploiting On-Line Resources to Build a Lexicon, pp. 115–164. Lawrence Erlbaum, Hillsdale (1991)

    Google Scholar 

  10. Curado, F.A.: Lexical behaviour in academic and technical corpora: implications for ESP development. Lang. Learn. Technol. 5(3), 106–129 (2001)

    Google Scholar 

  11. Dubois, J.: La nouvelle grammaire du français, Larousse (1973)

    Google Scholar 

  12. Dumas, A.: Le grand dictionnaire de Cuisine, Couverture reliée, Jaquette illustrée couleur, gravures et planches noir et blanc, préface: Daniel Zimmermann. Phébus (2000)

    Google Scholar 

  13. Fiala, P., Habert, B., Lafon, P., Pineira, C.: Des mots aux syntagmes, figements et variations dans la Résolution générale du congrès de la CGT de 1978. Mots 14, 45–87 (1987)

    Article  Google Scholar 

  14. Fleury, S.: Le Métier Textométrique: Le Trameur, Manuel d’utilisation (2007). http://tal.univ-paris3.fr/trameur/

  15. Gledhill, C.: Collocations in Science Writing. Gunter Narr Verlag, Tübingen (2000)

    Google Scholar 

  16. Goldman, J.P., Nerima, L., Wehrli, E.: Collocation extraction using a syntactic parser. In: Proceedings of the ACL 2001 Workshop on Collocation, pp. 61–66 (2001)

    Google Scholar 

  17. Grefenstette, G., Teufel, S.: Corpus-based method for automatic identification of support verbs for nominalizations. In: Proceedings of the Seventh Conference of the European Chapter of the Association for Computational Linguistics, pp. 98–103 (1995)

    Google Scholar 

  18. Heiden, S., Magué, J.P., Pincemin, B.: TXM: une plateforme logicielle open-source pour la textométrie – conception et développement. In: JADT 2010, vol. 2, pp. 1021–1032. Edizioni Universitarie di Lettere Economia, Roma (2010)

    Google Scholar 

  19. Kilgarriff, A., Tugwell, D.: WORD SKETCH: extraction, combination and display of significant collocations for lexicography. In: Proceedings of the Workshop on Collocations: Computational Extraction, Analysis and Exploitation, ACL-EACL 2001, pp. 32–38 (2001)

    Google Scholar 

  20. Lafon, P., Salem, A.: L’inventaire des segments répétés d’un texte. Mots 6(1), 161–177 (1983). https://www.persee.fr/doc/mots_0243-6450_1983_num_6_1_1101

  21. Lebart, L.: DtmVic: data and text mining - visualization, inference, classification. Exploratory statistical processing of complex data sets comprising both numerical and textual data (2015). http://www.dtmvic.com/

  22. Lewis, M.: There is nothing as practical as a good theory. In: Lewis, M. (eds.) Teaching Collocation: Further Developments in the Lexical Approach, Language Teaching Publications, pp. 10–27 (2000)

    Google Scholar 

  23. Lin, D.: Extracting collocations from text corpora. In: First Workshop on Computational Terminology, Montréal, pp. 57–63 (1998)

    Google Scholar 

  24. Longrée, D., Luong, X., Mellet, S.: Les motifs: un outil pour la caractérisation topologique des textes. In: Heiden, S., Pincemin, B., Vosghanian, L., l’équipe Lexicometrica (eds.) Actes des 9ème Journées internationales d’analyse statistique des données textuelles – JADT 2008, pp. 733–744 (2008)

    Google Scholar 

  25. Longrée, D., Mellet, S.: Le motif: une unité phraséologique englobante? Étendre le champ de la phraséologie de la langue au discours. Langages 189, 68–80 (2013)

    Google Scholar 

  26. Macwhinney, B.: The CHILDES Project: Tools for Analyzing Talk. Mahwah, Lawrence Erlbaum Associates, New Jersey (2000)

    Google Scholar 

  27. Mangiante, J.M., Parpette, C.: Le français sur objectif spécifique: de l’analyse de besoins à l’élaboration d’un cours. Hachette (2004)

    Google Scholar 

  28. Mel’čuk, I., Arbatchewsky-Jumarie, N., Iordanskaja, L., Mantha, S., Polguère, A.: Dictionnaire explicatif et combinatoire du français contemporain: recherches lexico-sémantiques IV. Les Presses de l’Université de Montréal (1999)

    Google Scholar 

  29. Née, É., Sitri, F., Veniard, M.: Les routines, une catégorie pour l’analyse de discours: le cas des rapports éducatifs. Lidil 53, 71–93 (2016)

    Article  Google Scholar 

  30. Quiniou, S., Cellier, P., Charnois, T., Legallois, D.: What about sequential data mining techniques to identify linguistic patterns for stylistics? In: Gelbukh, A. (ed.) CICLing 2012. LNCS, vol. 7181, pp. 166–177. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-28604-9_14

    Chapter  Google Scholar 

  31. Salem, A.: Pratique des segments répétés: Essai de statistique textuelle. Klincksieck, Paris (1987)

    Google Scholar 

  32. Silberztein, M.: NooJ: an oriented object approach. In: Royauté, J., Silberztein, M. (eds.) INTEX pour la Linguistique et le Traitement Automatique des Langues. Presses universitaires de Franche-Comté, Besançon (2002)

    Google Scholar 

  33. Silberztein, M.: Formalizing Natural Languages: The NooJ Approach. Wiley (2016)

    Google Scholar 

  34. Smadja, F.: Retrieving collocations form text: Xtract. Comput. Linguist. 19(1), 143–177 (1993)

    Google Scholar 

  35. Tutin, A.: Pour une modélisation dynamique des collocations dans les textes. In: Actes d’Euralex, Lorient, pp. 207–221 (2004)

    Google Scholar 

  36. Tutin, A., Kraif, O.: Routines sémantico-rhétoriques dans l’écrit scientifique de sciences humaines: l’apport des arbres lexico-syntaxiques récurrents. Lidil 53, 119–141 (2016)

    Article  Google Scholar 

  37. Tutin, A.: La phraséologie transdisciplinaire des écrits scientifiques: des collocations aux routines sémantico-rhétoriques. In: Tutin, A., Grossmann, F. (eds.) L’écrit scientifique: du lexique au discours. Autour de Scientext, pp. 27–44. Presses universitaires de Rennes (2013)

    Google Scholar 

  38. Wilson, A.: Les Tables du Monde: Recettes et Ingrédients de A à Z. Könemann, Cologne (1977)

    Google Scholar 

  39. Yang, T.: Constitution et exploitation d’une base de données pour l’enseignement/apprentissage des phrasèmes NAdj du domaine culinaire auprès d’apprenants non-natifs. Thèse de doctorat. Université Sorbonne Nouvelle – Paris 3 (2019)

    Google Scholar 

  40. Yang, T.: Automatic extraction of verbal phrasemes in the culinary field with NooJ. In: Mauro Mirto, I., Monteleone, M., Silberztein, M. (eds.) NooJ 2018. CCIS, vol. 987, pp. 83–94. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-10868-7_8

    Chapter  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Tong Yang .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Yang, T. (2021). Using NooJ to Formalize French Cooking Expressions. In: Bigey, M., Richeton, A., Silberztein, M., Thomas, I. (eds) Formalizing Natural Languages: Applications to Natural Language Processing and Digital Humanities. NooJ 2021. Communications in Computer and Information Science, vol 1520. Springer, Cham. https://doi.org/10.1007/978-3-030-92861-2_13

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-92861-2_13

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-92860-5

  • Online ISBN: 978-3-030-92861-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics