skip to main content
10.1145/3575882.3575949acmotherconferencesArticle/Chapter ViewAbstractPublication Pagesic3inaConference Proceedingsconference-collections
research-article

Auto Code Comment Assessment for Online Judge using Word Embedding and Word Mover's Distance

Published: 27 February 2023 Publication History

Abstract

Comments in source code are a form of inline documentation created by programmers to help others understand the function of the program. The students of the basic programming subject need how to learn to write better code comments which can be difficulties for the lecturer assessing. Therefore, the author proposes an automatic source code comment assessment method for the online judge system with a corpus-based text similarity approach. Word2vec, GloVe, and fastText models will be used to train word vectors with the Indonesian Wikipedia Dump. The Similarities will be measured using Word Mover's Distance (WMD). Experiments were carried out using epoch variations during the training process. Spearman's rho correlation coefficient, mean average error (MAE), and performance measurements of each model will be compared. The methods with the proposed word embedding approach still provide not good results.

References

[1]
P. J. De Pasquale, M. E. Locasto, L. Kaczmarczyk, M. Martinovic. 2012. "//TODO: Help students improve commenting practices. In 2012 Frontiers in Education Conference Proceedings, 1-6.
[2]
Bai Yang, Zhang Liping, Zhao Fengrong. 2019. A Survey on Research of Code Comment. In ICMSS 2019: Proceedings of the 2019 3rd International Conference on Management Engineering, Software Engineering and Service Sciences, 45–51.
[3]
D. Steidl, B. Hummel, dan E. Juergens. 2013. Quality analysis of source code comments. In IEEE International Conference on Program Comprehension, 83–92.
[4]
Yuan Huang, Nan Jia, Qiang Zhou, Xiangping Chen, Yingfei Xiong, Xiaonan Luo. 2018. Guiding developers to make informative commenting decisions in source code. In Proceedings of the 40th International Conference on Software Engineering, 260 -261.
[5]
Peter J. De Pasquale, Michael E. Locasto, Lisa C. Kaczmarczyk. 2012. Identifying effective pedagogical practices for commenting computer source code. In Proceedings of the 43rd ACM technical symposium on Computer Science Education, 678.
[6]
W. H. Gomaa dan A. A. Fahmy. 2013. A Survey of Text Similarity Approaches. In International Journal Computing Application, Vol. 68.
[7]
W. H. Gomaa dan A. A. Fahmy. 2020. Ans2vec: A Scoring System for Short Answers. In The Internasional Conference on Advanced Machine Learning Technologies and Applications, 586–595.
[8]
C. Jin, B. He, dan J. Xu. 2017. A study of distributed semantic representations for automated essay scoring. In KSEM 2017: Knowledge Science, Engineering and Management, Vol. 10412.
[9]
Tsegaye Misikir Tashu, Tomas Horvath. Pair-Wise: Automatic Essay Evaluation using Word Mover's Distance. 2018. In Proceedings of the 10th International Conference on Computer Supported Education, 59-66. ISBN: 978-989-758-291-2.
[10]
Rosa Ariani Sukamto, Rani Megasari, Erna Piantari, M Nabillah Fihira Rischa. 2020. Code Comment Assessment Development for Basic Programming Subject using Online Judge. In Proceedings of the 7th Mathematics, Science, and Computer Science Education International Seminar.
[11]
E. B. Setiawan, D. H. Widyantoro, dan K. Surendro. 2016. Feature expansion using word embedding for tweet topic classification. In Proceeding 2016 10th International Conference on Telecommunication System, Services and Application.
[12]
S. Arora, Y. Liang, dan T. Ma. 2016. Simple but Tough-to-Beat Baseline for Sentence Embeddings. In International Conference on Learning Representations, 416–424.
[13]
T. Mikolov, K. Chen, G. Corrado, dan J. Dean. 2013. Efficient Estimation of Word Representations in Vector Space. In Proceedings of Workshop at ICLR. http://arxiv.org/abs/1301.3781.
[14]
J. Pennington, R. Socher, dan C. Manning. 2014. Glove: Global Vectors for Word Representation. In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 1532–1543.
[15]
P. Bojanowski, E. Grave, A. Joulin, dan T. Mikolov. 2017. Enriching Word Vectors with Subword Information. In Transactions of the Association for Computational Linguistics, 135–146. http://arxiv.org/abs/1607.04606.
[16]
M. J. Kusner, Y. Sun, N. I. Kolkin, dan K. Q. Weinberger. 2015. From word embeddings to document distances. In 32nd International Conference on Machine Learning, 957–966.

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
IC3INA '22: Proceedings of the 2022 International Conference on Computer, Control, Informatics and Its Applications
November 2022
415 pages
ISBN:9781450397902
DOI:10.1145/3575882
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 27 February 2023

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. automatic scoring
  2. code comment
  3. text similarity
  4. word embedding
  5. word mover's distance

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

IC3INA 2022

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 31
    Total Downloads
  • Downloads (Last 12 months)11
  • Downloads (Last 6 weeks)3
Reflects downloads up to 27 Feb 2025

Other Metrics

Citations

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media