Prompt-Learning for Semi-supervised Text Classification

Yuan, Chengzhe; Zhou, Zekai; Tang, Feiyi; Lin, Ronghua; Mao, Chengjie; Teng, Luyao

doi:10.1007/978-981-99-7254-8_3

Chengzhe Yuan^12,15,
Zekai Zhou¹⁴,
Feiyi Tang^13,15,
Ronghua Lin^14,15,
Chengjie Mao^14,15 &
…
Luyao Teng^13,15

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 14306))

Included in the following conference series:

International Conference on Web Information Systems Engineering

731 Accesses

Abstract

In the Semi-Supervised Text Classification (SSTC) task, the performance of the SSTC-based models heavily rely on the accuracy of the pseudo-labels for unlabeled data, which is not practical in real-world scenarios. Prompt-learning has recently proved to be effective to alleviate the low accuracy problem caused by the limited label data in SSTC. In this paper, we present a Pattern Exploiting Training with Unsupervised Data Augmentation (PETUDA) method to address SSCT under limited labels setting. We first exploit the potential of the PLMs using prompt learning, convert the text classification task into a cloze-style task, and use the masked prediction ability of the PLMs to predict the categories. Then, we use a variety of data augmentation methods to enhance the model performance with unlabeled data, and introduce a consistency loss in the model training process to make full use of unlabeled data. Finally, we conduct extensive experiments on three text classification benchmark datasets. Empirical results show that PETUDA consistently outperforms the baselines in all cases.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 99.00; Price excludes VAT (USA)

Softcover Book: USD 129.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Brown, T., et al.: Language models are few-shot learners. In: NeurIPS 2020, pp. 1877–1901 (2020)
Google Scholar
Chang, M., Ratinov, L., Roth, D., Srikumar, V.: Importance of semantic representation: dataless classification. In: AAAI 2008, pp. 830–835 (2008)
Google Scholar
Chen, J., Yang, Z., Yang, D.: MixText: linguistically-informed interpolation of hidden space for semi-supervised text classification. In: ACL 2020, pp. 2147–2157 (2020)
Google Scholar
Han, X., Zhao, W., Ding, N., Liu, Z., Sun, M.: PTR: prompt tuning with rules for text classification. AI Open 3, 182–192 (2022)
Article Google Scholar
Li, C., Li, X., Ouyang, J.: Semi-supervised text classification with balanced deep representation distributions. In: ACL 2021, pp. 5044–5053 (2021)
Google Scholar
Murtadha, A., et al.: Rank-aware negative training for semi-supervised text classification. CoRR abs/2306.07621 (2023)
Google Scholar
Schick, T., Schütze, H.: Exploiting cloze-questions for few-shot text classification and natural language inference. In: EACL 2021, pp. 255–269 (2021)
Google Scholar
Song, R., et al.: Label prompt for multi-label text classification. Appl. Intell. 53(8), 8761–8775 (2023)
Article Google Scholar
Xie, Q., Dai, Z., Hovy, E., Luong, M.T., Le, Q.V.: Unsupervised data augmentation for consistency training. In: NeurIPS 2020, pp. 6256–6268 (2020)
Google Scholar
Zhang, X., Zhao, J.J., LeCun, Y.: Character-level convolutional networks for text classification. In: (NIPS 2015), pp. 649–657 (2015)
Google Scholar
Zhu, Y., Zhou, X., Qiang, J., Li, Y., Yuan, Y., Wu, X.: Prompt-learning for short text classification. CoRR abs/2202.11345 (2022)
Google Scholar

Download references

Acknowledgements

This work was supported in part by the National Natural Science Foundation of China under Grant U1811263, the Science and Technology Program of Guangzhou under Grant 2023A04J1728, the Talent Research Start-Up Foundation of Guangdong Polytechnic Normal University (No. 2021SDKYA098).

Author information

Authors and Affiliations

School of Electronics and Information, Guangdong Polytechnic Normal University, Guangzhou, 510665, Guangdong, China
Chengzhe Yuan
School of Information Engineering, Guangzhou Panyu Polytechnic, Guangzhou, 511483, Guangdong, China
Feiyi Tang & Luyao Teng
School of Computer Science, South China Normal University, Guangzhou, 510631, Guangdong, China
Zekai Zhou, Ronghua Lin & Chengjie Mao
Pazhou Lab, Guangzhou, 510330, Guangdong, China
Chengzhe Yuan, Feiyi Tang, Ronghua Lin, Chengjie Mao & Luyao Teng

Authors

Chengzhe Yuan
View author publications
You can also search for this author in PubMed Google Scholar
Zekai Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Feiyi Tang
View author publications
You can also search for this author in PubMed Google Scholar
Ronghua Lin
View author publications
You can also search for this author in PubMed Google Scholar
Chengjie Mao
View author publications
You can also search for this author in PubMed Google Scholar
Luyao Teng
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Luyao Teng .

Editor information

Editors and Affiliations

Renmin University of China, Beijing, China
Feng Zhang
Victoria University, Footscray, VIC, Australia
Hua Wang
Qatar University, Doha, Qatar
Mahmoud Barhamgi
Swinburne University of Technology, Hawthorn, Australia
Lu Chen
Swinburne University of Technology, Hawthorn, Australia
Rui Zhou

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Yuan, C., Zhou, Z., Tang, F., Lin, R., Mao, C., Teng, L. (2023). Prompt-Learning for Semi-supervised Text Classification. In: Zhang, F., Wang, H., Barhamgi, M., Chen, L., Zhou, R. (eds) Web Information Systems Engineering – WISE 2023. WISE 2023. Lecture Notes in Computer Science, vol 14306. Springer, Singapore. https://doi.org/10.1007/978-981-99-7254-8_3

Download citation

DOI: https://doi.org/10.1007/978-981-99-7254-8_3
Published: 21 October 2023
Publisher Name: Springer, Singapore
Print ISBN: 978-981-99-7253-1
Online ISBN: 978-981-99-7254-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics