Foundation models for topic modeling: a case study

Zeng, Han; Sun, Jia-Ming; Li, Chun-Shu; Li, Zhuying; Wei, Tong

doi:10.1007/s11704-024-40069-7

Foundation models for topic modeling: a case study

Letter
Published: 18 November 2024

Volume 19, article number 192325, (2025)
Cite this article

Frontiers of Computer Science Aims and scope Submit manuscript

Han Zeng^1,2,
Jia-Ming Sun^1,2,
Chun-Shu Li^1,2,
Zhuying Li^1,2 &
…
Tong Wei^1,2

185 Accesses
Explore all metrics

Summary

In summary, GPT-4 outperforms the other two LLMs throughout the entire process, primarily due to the limitations of the models’ scale. Meanwhile, Qwen-72B performs considerably better than Qwen-14B and achieves comparable results, albeit slightly inferior, to those of GPT-4. We summarized our experiments in Table 2.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Discover the latest articles and news from researchers in related subjects, suggested using machine learning.

References

Pham C M, Hoyle A, Sun S, Iyyer M. Topicgpt: a prompt-based topic modeling framework. 2023, arXiv preprint arXiv: 2311.01449
Bai J, Bai S, Chu Y, et al. Qwen technical report. 2023, arXiv preprint arXiv: 2309.16609
Li Z, Zhang X, Zhang Y, Long D, Xie P, Zhang M. Towards general text embeddings with multi-stage contrastive learning. 2023, arXiv preprint arXiv: 2308.03281
McInnes L, Healy J, Saul N, Grossberger L. Umap: uniform manifold approximation and projection. The Journal of Open Source Software, 2018, 3(29): 861
Article Google Scholar

Download references

Author information

Authors and Affiliations

School of Computer Science and Engineering, Southeast University, Nanjing, 210096, China
Han Zeng, Jia-Ming Sun, Chun-Shu Li, Zhuying Li & Tong Wei
Key Laboratory of Computer Network and Information Integration, Southeast University, Nanjing, 210096, China
Han Zeng, Jia-Ming Sun, Chun-Shu Li, Zhuying Li & Tong Wei

Authors

Han Zeng
View author publications
Search author on:PubMed Google Scholar
Jia-Ming Sun
View author publications
Search author on:PubMed Google Scholar
Chun-Shu Li
View author publications
Search author on:PubMed Google Scholar
Zhuying Li
View author publications
Search author on:PubMed Google Scholar
Tong Wei
View author publications
Search author on:PubMed Google Scholar

Corresponding author

Correspondence to Tong Wei.

Ethics declarations

Competing interests The authors declare that they have no competing interests or financial conflicts to disclose.

Electronic supplementary material