Abstract
A large number of online learning platforms and the explosive growth of the types and quantity of resources on the platform greatly increase the difficulty of learning content selection. Traditional language search uses resource ranking to build index, which cannot meet the needs of professional and accurate search. An intelligent search method is proposed in this paper using Bert language model for pre-training to improve the learning and reasoning ability of the machine. Based on Sentence-Bert model and the concise and effective twin network (Siamese), the sentence vector features are generated to complete the downstream search task. Finally, experiments on the online learning resources of MOOC and the State Grid were analyzed on Google Colab platform, and results show that achieve rapid keyword matching of relevant courseware names.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Wang, Z.: Research and implementation of text categorization based on machine learning. Nanjing University of Posts and Telecommunications (2018). (Chinese)
Yang, T.: Application of artificial intelligence technology in information retrieval based on big data. Value Eng. 38(10), 173–175 (2019). (Chinese)
Khurana, D., Koli, A., Khatter, K., et al.: Natural language processing: state of the art, current trends and challenges (2017)
Huikun, S.: A method for computing semantic similarity in information retrieval. J. Chizhou Univ. 30(03), 26–29 (2016)
Kui, L.: Research on semantic similarity computation of short tex. Harbin Engineering University (2016). (Chinese)
Devlin, J., Chang, M. W., Lee, K., et al.: BERT: pre-training of deep bidirectional transformers for language understanding (2018)
Reimers, N., Gurevych, I.: Sentence-BERT: sentence embeddings using Siamese BERT-networks (2019)
Xiaoli, L., Jimin, L., Zhongzhi, S.: Conceptual inference network and its application in text categorization. Comput. Res. Dev. 37(9), 1032–1038 (2009). (Chinese)
Yan, F., Enhong, C., Qingyi, W., et al.: Performance Study of hypertext concordance classifier. Comput. Res. Dev. 37(9), 1026–1031 (2009). (Chinese)
Xuanjing, H., Lide, W., Shizaki, Y., et al.: Text classification method independent of languages. Chin. J. Inf. 14(6), 1–7 (2000). (Chinese)
Qian, D., Yongcheng, W., Huihui, Z., et al.: Word weight and classification algorithm in automatic text classification. Chin. J. Inf. 14(3), 25–29 (2000). (Chinese)
Chuntao, H., Jinkang, Q., Jingmei, C., et al.: Application research of public opinion classification based on BERT model. Netw. Secur. Technol. Appl. 11, 41–44 (2019)
Liang, Y., Zhe, J., Chengsheng, M., et al.: Traditional Chinese medicine clinical records classification with BERT and domain specific corpora. J. Am. Med. Inf. Assoc.: JAMIA 26(12), 12–23 (2019)
Xiaohui, Z., Yaoyun, Z., Qin, Z., et al.: Extracting comprehensive clinical information for breast cancer using deep learning methods. Int. J. Med. Inf. 132, 103985 (2019)
Jwa, H., Oh, D., Park, K., et al.: exBAKE: automatic fake news detection model based on Bidirectional Encoder Representations from Transformers (BERT). Appl. Sci.-Basel 9(19), 32–46 (2019)
Yingjie, W., Bin, X., Ningbo, L.: A pre-trained technological language representation for Chinese technological text analysis. Comput. Eng. 12-08 (2019)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
He, J., Lu, Q., Tong, Y., Chen, Y. (2021). Course Classification of Online Learning Platform Based on Sentence-Bert Model. In: Tian, Y., Ma, T., Khan, M.K. (eds) Big Data and Security. ICBDS 2020. Communications in Computer and Information Science, vol 1415. Springer, Singapore. https://doi.org/10.1007/978-981-16-3150-4_50
Download citation
DOI: https://doi.org/10.1007/978-981-16-3150-4_50
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-16-3149-8
Online ISBN: 978-981-16-3150-4
eBook Packages: Computer ScienceComputer Science (R0)