Conferences >2024 7th International Confer...

An Automatic Video Tag Extraction Method based on Large Language Model Text Content Parsing

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

In the information age, short videos have become an integral part of daily life, leading to an exponential increase in the volume of video data on short video platforms. ...Show More

Metadata

Abstract:

In the information age, short videos have become an integral part of daily life, leading to an exponential increase in the volume of video data on short video platforms. Without effective organization and classification of these video resources, management becomes a significant challenge. Therefore, the development of efficient video categorization techniques is crucial. This study proposes an automatic video tag extraction method based on Large Language Model text content parsing (VTE-LLM). The method integrates OCR (Optical Character Recognition) and CRNN (Convolutional Recurrent Neural Network) models to extract subtitle information from videos, and utilizes Large Language Models for text parsing to automatically generate video classifications and tags. This approach effectively addresses the cost and efficiency issues associated with traditional methods. Experimental results demonstrate that the proposed method achieves high accuracy across multiple video types.

Published in: 2024 7th International Conference on Data Science and Information Technology (DSIT)

Date of Conference: 20-22 December 2024

Date Added to IEEE Xplore: 18 February 2025

ISBN Information:

DOI: 10.1109/DSIT61374.2024.10881284

Conference Location: Nanjing, China