Abstract
To reduce the cognitive overhead of understanding and organizing online learning materials using topic models, especially for new learners not familiar with related domains, this paper proposes an efficient and effective approach for generating high-quality labels as better interpretation of topics discovered and typically visualized as a list of top terms. Compared with previous methods dependent on complicated post-processing processes or external resources, our phrase-based topic inference method can generate and narrow down label candidates more naturally and efficiently. The proposed approach is demonstrated and examined with real data in our corporate learning platform.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
One of search APIs used by [4] is not available any more, so it is skipped in the experiments.
References
Bhatia, S., Lau, J.H., Baldwin, T.: Automatic labelling of topics with neural embeddings. In: Proceedings of COLING 2016, The 26th International Conference on Computational Linguistics: Technical Papers, pp. 953–963. The COLING 2016 Organizing Committee (2016). http://aclweb.org/anthology/C16-1091
Devlin, J., Chang, M., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding. CoRR abs/1810.04805 (2018)
El-Kishky, A., Song, Y., Wang, C., Voss, C.R., Han, J.: Scalable topical phrase mining from text corpora. Proc. VLDB Endow. 8(3), 305–316 (2014)
Lau, J.H., Grieser, K., Newman, D., Baldwin, T.: Automatic labelling of topic models. In: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, HLT 2011, vol. 1, pp. 1536–1545. Association for Computational Linguistics, Stroudsburg, PA, USA (2011)
Mei, Q., Shen, X., Zhai, C.: Automatic labeling of multinomial topic models. In: Proceedings of the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD 2007, pp. 490–499. ACM, New York (2007). https://doi.org/10.1145/1281192.1281246
Peters, M., et al.: Deep contextualized word representations. In: Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers), pp. 2227–2237. Association for Computational Linguistics (2018). https://doi.org/10.18653/v1/N18-1202. http://aclweb.org/anthology/N18-1202
Shang, J., Liu, J., Jiang, M., Ren, X., Voss, C.R., Han, J.: Automated phrase mining from massive text corpora. IEEE Trans. Knowl. Data Eng. 30(10), 1825–1837 (2018)
Vilnis, L., McCallum, A.: Word representations via Gaussian embedding. ICLR abs/1412.6623 (2015)
Wang, J., Xiang, J., Uchino, K.: Topic-specific recommendation for open education resources. In: Li, F.W.B., Klamma, R., Laanpere, M., Zhang, J., Manjón, B.F., Lau, R.W.H. (eds.) ICWL 2015. LNCS, vol. 9412, pp. 71–81. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-25515-6_7
Wang, J., Zhao, C., Uchino, K., Xiang, J.: Interactive topic model with enhanced interpretability. In: Proceedings of the 2nd Workshop on Explainable Smart Systems, EXSS 2019 (2019)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Wang, J., Uchino, K. (2019). Automatic Topic Labeling for Facilitating Interpretability of Online Learning Materials. In: Herzog, M., Kubincová, Z., Han, P., Temperini, M. (eds) Advances in Web-Based Learning – ICWL 2019. ICWL 2019. Lecture Notes in Computer Science(), vol 11841. Springer, Cham. https://doi.org/10.1007/978-3-030-35758-0_25
Download citation
DOI: https://doi.org/10.1007/978-3-030-35758-0_25
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-35757-3
Online ISBN: 978-3-030-35758-0
eBook Packages: Computer ScienceComputer Science (R0)