skip to main content
10.1145/3421515.3421517acmotherconferencesArticle/Chapter ViewAbstractPublication PagessspsConference Proceedingsconference-collections
research-article

Typicality of Lexical Bundles in Different Sections of Scientific Articles

Published: 17 December 2020 Publication History

Abstract

This paper proposes a method to quantify the typicality of lexical bundles in sections of academic articles, specifically in the field of Natural Language Processing papers. Typicality is defined as the product of individual KL-divergence scores and the probability of a bundle to appear in a type of section. An evaluation of our typicality measure against two other baselines shows slight improvements according to the Silhouette coefficient.

References

[1]
Douglas Biber, Susan Conrad, and Viviana Cortes. 2004. If you look at...: Lexical Bundles in University Teaching and Textbooks. Applied Linguistics 25, 3 (09 2004), 371–405. https://doi.org/10.1093/applin/25.3.371 arXiv:https://academic.oup.com/applij/article-pdf/25/3/371/431268/250371.pdf
[2]
Joanne Boisson, Ting-Hui Kao, Jian-Cheng Wu, Tzu-Hsi Yen, and Jason S. Chang. 2013. Linggle: a Web-scale Linguistic Search Engine for Words in Context. In Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, Vol. System Demonstrations. Association for Computational Linguistics, Sofia, Bulgaria, 139–144. https://www.aclweb.org/anthology/P13-4024
[3]
Y. H. Chen and P. Baker. 2010. Lexical bundles in L1 and L2 academic writing. Language Learning and Technology 14, 2 (1 6 2010), 30–49. https://www.researchgate.net/publication/45681690_Lexical_Bundles_in_L1_and_L2_Academic_Writing
[4]
Susan Conrad and D. Biber. 2004. The Frequency and Use of Lexical Bundles in Conversation and Academic Prose. Lexicographica 20 (01 2004), 56–71.
[5]
Viviana Cortes. 2004. Lexical bundles in published and student disciplinary writing: Examples from history and biology. English for Specific Purposes 23, 4 (2004), 397 – 423. https://doi.org/10.1016/j.esp.2003.12.001
[6]
Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2019. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers). Association for Computational Linguistics, Minneapolis, Minnesota, 4171–4186. https://doi.org/10.18653/v1/N19-1423
[7]
Ken Hyland. 2008. As can be seen: Lexical bundles and disciplinary variation. English for Specific Purposes 27, 1 (2008), 4 – 21. https://doi.org/10.1016/j.esp.2007.06.001
[8]
Takumi Ito, Tatsuki Kuribayashi, Hayato Kobayashi, Ana Brassard, Masato Hagiwara, Jun Suzuki, and Kentaro Inui. 2019. Diamonds in the Rough: Generating Fluent Sentences from Early-Stage Drafts for Academic Writing Assistance. In Proceedings of the 12th International Conference on Natural Language Generation. Association for Computational Linguistics, Tokyo, Japan, 40–53. https://doi.org/10.18653/v1/W19-8606
[9]
Kenichi Iwatsuki and Akiko Aizawa. 2018. Using Formulaic Expressions in Writing Assistance Systems. In Proceedings of the 27th International Conference on Computational Linguistics (COLING 2018). Association for Computational Linguistics, Santa Fe, New Mexico, USA, 2678–2689. https://www.aclweb.org/anthology/C18-1227
[10]
S. Kullback and R. A. Leibler. 1951. On Information and Sufficiency. Ann. Math. Statist. 22, 1 (03 1951), 79–86. https://doi.org/10.1214/aoms/1177729694
[11]
Atsushi Mizumoto. 2017. Initial Evaluation of AWSuM: A Pilot Study. Vocabulary Learning and Instruction 6, 2 (dec 2017), 46–51. https://ci.nii.ac.jp/naid/120006408125/en/
[12]
Peter J. Rousseeuw. 1987. Silhouettes: A graphical aid to the interpretation and validation of cluster analysis. J. Comput. Appl. Math. 20 (1987), 53 – 65. https://doi.org/10.1016/0377-0427(87)90125-7
[13]
Danica Salazar. 2014. Lexical bundles in native and non-native scientific writing. Studies in corpus linguistics, Vol. 65. John Benjamins Publishing Company.

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
SSPS '20: Proceedings of the 2020 2nd Symposium on Signal Processing Systems
July 2020
125 pages
ISBN:9781450388627
DOI:10.1145/3421515
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 17 December 2020

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. lexical bundles
  2. typicality
  3. writing aid

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Funding Sources

  • JSPS KAKENHI

Conference

SSPS 2020

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 63
    Total Downloads
  • Downloads (Last 12 months)8
  • Downloads (Last 6 weeks)0
Reflects downloads up to 05 Mar 2025

Other Metrics

Citations

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media