Abstract
Electronic presentations are used in numerous scenarios, such as lectures and meetings. In recent years, the widespread use of electronic presentations means that presentation slide data is increasing as one of industry’s most important information resources. Therefore, it is necessary to develop a practical usage method for the reutilisation of the data on slides. An approach to achieve this is to focus on visual structure information within a slide, because visual structure information is one of the most valuable, easy to understand methods for humans. However, since visual structure information is not explicitly defined in the slide data itself, computers have difficulty comprehending structure information directly. In this paper, we propose a method of extracting structure information from slide information. The proposed method is composed of two steps: organising objects within the slide as units, such as title, body text, figure and table, and structuring the units as a hierarchy tree based on a top-down approach.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Anjewierden, A.: AIDAS: Incremental Logical Structure Discovery in PDF Documents. In: Procs. the 6th International Conference on Document Analysis and Recognition, pp. 374–378 (2001)
Hayama, T., Nanba, H., Kunifuji, S.: Alignment between a Technical Paper and Presentation Sheets Using a Hidden Markov Model. In: Proc. Active Media Technology 2005, pp. 102–106 (2005)
Ishihara, T., Takagi, H., Itoh, T., Asakawa, C.: Analyzing Visual Layout for a Non-Visual Presentation-Document Interface. In: Proc. the 8th International ACM SIGACCESS Conference on Computers and Accessibility, pp. 165–172 (2006)
Nanba, H., Abekawa, T., Okumura, M., Saito, S.: Bilingual presri: Integration of multiple research paper databases. In: Proc. the 7th RIAO Conference: Coupling approaches, coupling media and coupling languages for information retrieval, pp. 195–211 (2004)
Nanno, T., Saito, S., Okumura, M.: Structuring Web Pages Based on Repetition of. Elements. In: Proc. the 2nd International Workshop on Web Document Analysis, pp. 58–60 (2003)
Rosenfeld, B., Feldman, R., Aumann, Y.: Structural extraction from visual layout of documents. In: Procs. the 11th International Conference on Information and Knowledge Management, pp. 203–210 (2002)
Watanabe, T., Luo, Q., Sugie, N.: Layout Recognition of Multi-Kinds of Table-Form. Documents, IEEE Transactions on Pattern Analysis and Machine Intelligence 17(4), 432–445 (1995)
Yang, Y., Zhang, H.: HTML Page Analysis Based on Visual Cues. In: Procs. the 6th International Conference on Document Analysis and Recognition, pp. 859–864, 10–13 (2001)
Zhai, Y., Liu, B.: Structured Data Extraction from the Web Based on Partial Tree Alignment. IEEE Transactions on Knowledge and Data Engineering 18(12), 1614–1628 (2006)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Hayama, T., Nanba, H., Kunifuji, S. (2008). Structure Extraction from Presentation Slide Information. In: Ho, TB., Zhou, ZH. (eds) PRICAI 2008: Trends in Artificial Intelligence. PRICAI 2008. Lecture Notes in Computer Science(), vol 5351. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-89197-0_62
Download citation
DOI: https://doi.org/10.1007/978-3-540-89197-0_62
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-89196-3
Online ISBN: 978-3-540-89197-0
eBook Packages: Computer ScienceComputer Science (R0)