skip to main content
10.1145/3452940.3453072acmotherconferencesArticle/Chapter ViewAbstractPublication PagesiciteeConference Proceedingsconference-collections
short-paper

A Table Recognition and Extraction Algorithm in Dongba Character Documents Based on Hough Transform

Published: 17 May 2021 Publication History

Abstract

Dongba hieroglyphs is a text form that transitions from pictographs to pictographs and phonetic characters. In order to distinguish them from modern characters, many Dongba hieroglyphs use tables to distinguish between Dongba hieroglyphs and modern characters. Through the division function of the table, the structure of the document is clearly, and readers can also understand the reading order of Dongba characters more quickly. It can be seen that analyzing the structure of the document, identifying and separating the tables in the document, classifying and storing Dongba characters and annotations are very important tasks. Therefore, we used the Hough transform algorithm to realize the recognition and extraction of the table by analyzing the structural features of the document image. It lays the foundation for realizing the extraction, classification and storage of Dongba hieroglyphs and the establishment of Dongba hieroglyphs database.

References

[1]
F.Z. Zheng, 2005 Word research of Naxi Dongba hieroglyphic, Nationalities Publishing House, Beijing, China.
[2]
L.M. He, 2003 Naxi Pictographs Copybook, Yunnan Nationalities Publishing House, Yunnan, Kunming, China.
[3]
Hough V, Paul C (1962). Method and means for recognizing complex patterns: US, 3069654, 12--18
[4]
H.Q. Wang, R.W. Dai (1997). Projection Based Recursive Algorithm for Document Understanding. Pattern Recognition and Artificial Intelligence, 02, 118--126.
[5]
Z. Zhang, H.X. Ni, 2013 Proficient in Matlab digital image processing and recognition. Posts & Telecom Press, Beijing, China.

Index Terms

  1. A Table Recognition and Extraction Algorithm in Dongba Character Documents Based on Hough Transform
        Index terms have been assigned to the content through auto-classification.

        Recommendations

        Comments

        Information & Contributors

        Information

        Published In

        cover image ACM Other conferences
        ICITEE '20: Proceedings of the 3rd International Conference on Information Technologies and Electrical Engineering
        December 2020
        687 pages
        ISBN:9781450388665
        DOI:10.1145/3452940
        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        Published: 17 May 2021

        Permissions

        Request permissions for this article.

        Check for updates

        Author Tags

        1. Document segmentation
        2. Dongba hieroglyphs document
        3. Hough transform
        4. Table extraction

        Qualifiers

        • Short-paper
        • Research
        • Refereed limited

        Funding Sources

        • the Talent Introduction Research Fund of Suzhou Vocational University
        • the Scientific Research Fund of Yunnan Education Department

        Conference

        ICITEE2020

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • 0
          Total Citations
        • 52
          Total Downloads
        • Downloads (Last 12 months)7
        • Downloads (Last 6 weeks)0
        Reflects downloads up to 28 Feb 2025

        Other Metrics

        Citations

        View Options

        Login options

        View options

        PDF

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        HTML Format

        View this article in HTML Format.

        HTML Format

        Figures

        Tables

        Media

        Share

        Share

        Share this Publication link

        Share on social media