Abstract:
Document layout analysis plays a vital role in computer vision research. Current document layout analysis methods mostly use pixel-based classification for document layou...Show MoreMetadata
Abstract:
Document layout analysis plays a vital role in computer vision research. Current document layout analysis methods mostly use pixel-based classification for document layout analysis. However, the method based on pixel classification is insufficient for maintaining the continuity of the classification area. In this paper, we propose a document layout analysis method based on positional encoding and bounding box specification. We maintain the continuity of the analysis area by constructing a document layout analysis framework based on the bounding box. In addition, we also integrate a positional encoding module in the framework to maintain the detailed information in the document layout analysis and modeling process. Experimental results prove that our proposed method has achieved state-of-the-art results.
Date of Conference: 16-19 October 2022
Date Added to IEEE Xplore: 18 October 2022
ISBN Information: