Loading [MathJax]/extensions/MathMenu.js
An Automatic Video Tag Extraction Method based on Large Language Model Text Content Parsing | IEEE Conference Publication | IEEE Xplore

An Automatic Video Tag Extraction Method based on Large Language Model Text Content Parsing


Abstract:

In the information age, short videos have become an integral part of daily life, leading to an exponential increase in the volume of video data on short video platforms. ...Show More

Abstract:

In the information age, short videos have become an integral part of daily life, leading to an exponential increase in the volume of video data on short video platforms. Without effective organization and classification of these video resources, management becomes a significant challenge. Therefore, the development of efficient video categorization techniques is crucial. This study proposes an automatic video tag extraction method based on Large Language Model text content parsing (VTE-LLM). The method integrates OCR (Optical Character Recognition) and CRNN (Convolutional Recurrent Neural Network) models to extract subtitle information from videos, and utilizes Large Language Models for text parsing to automatically generate video classifications and tags. This approach effectively addresses the cost and efficiency issues associated with traditional methods. Experimental results demonstrate that the proposed method achieves high accuracy across multiple video types.
Date of Conference: 20-22 December 2024
Date Added to IEEE Xplore: 18 February 2025
ISBN Information:
Conference Location: Nanjing, China

Contact IEEE to Subscribe

References

References is not available for this document.