Automated Generating Thai Stupa Image Descriptions with Grid Pattern and Decision Tree

Prasomphan, Sathit; nomrubporn, Panuwut; Pathanarat, Pirat

doi:10.1007/978-981-10-2777-2_11

Sathit Prasomphan¹³,
Panuwut nomrubporn¹³ &
Pirat Pathanarat¹³

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 652))

Included in the following conference series:

International Conference on Soft Computing in Data Science

778 Accesses

Abstract

This research presents a novel algorithm for generating descriptions of stupa image such as stupa’s era, stupa’s architecture and other description by using information inside image which divided into grid and learning stupa description from the generated information with decision tree. In this paper, we get information inside image by divided image into several grid patterns, for example 10 × 10 and use data inside that image to submit to the decision tree model. The proposed algorithm aims to generate the descriptions in each stupa image. Decision tree was used for being the classifier for generating the description. We have presented a new approach to feature extraction based on analysis of information in image by using the grid information. The algorithms were tested with stupa image dataset in Phra Nakhon Si Ayutta province, Sukhothai province and Bangkok. The experimental results show that the proposed framework can efficiently give the correct descriptions to the stupa image compared to using the traditional method.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Karpathy, A., Fei-Fei, L.: Deep visual-semantic alignments for generating image descriptions. In: The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 3128–3137 (2015)
Google Scholar
Socher, R., Karpathy, A., Le, Q.V., Manning, C.D., Ng, A.Y.: Grounded compositional semantics for finding and describing images with sentences. TACL 2, 207–218 (2014)
Google Scholar
Zaremba, W., Sutskever, I., Vinyals, O.: Recurrent neural network regularization. arXiv preprint arXiv:1409.2329 (2014)
Young, P., Lai, A., Hodosh, M., Hockenmaier, J.: From image descriptions to visual denotations: new similarity metrics for semantic inference over event descriptions. TACL 2, 67–78 (2014)
Google Scholar
Hodosh, M., Young, P., Hockenmaier, J.: Framing image description as a ranking task: data, models and evaluation metrics. J. Artif. Intell. Res. 47, 853–899 (2013)
MathSciNet MATH Google Scholar
Su, H., Wang, F., Yi, L., Guibas, L.J.: 3D-assisted image feature synthesis for novel views of an object. In: International Conference on Computer Vision (ICCV), Santiago (2015)
Google Scholar
Temple Architecture: Available online at http://www.thailandbytrain.com/TempleGuide.html
Farhadi, A., Hejrati, M., Sadeghi, M.A., Young, P., Rashtchian, C., Hockenmaier, J., Forsyth, D.: Every picture tells a story: generating sentences from images. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part IV. LNCS, vol. 6314, pp. 15–29. Springer, Heidelberg (2010)
Chapter Google Scholar
Lowe, D.G.: Object recognition from local scale-invariant features. In: Proceedings of the 7th International Conference on Computer Vision (ICCV 1999), Corfu, Greece (1999)
Google Scholar
Lowe, D.G.: Distinctive image features from scale-invariant key points. Int. J. Comput. Vis. 60(2), 91–110 (2004)
Article Google Scholar

Download references

Acknowledgment

This research was funded by King Mongkut’s University of Technology North Bangkok. Contract no. KMUTNB-59-GEN-048.

Author information

Authors and Affiliations

Department of Computer and Information Science, Faculty of Applied Science, King Mongkut’s University of Technology North Bangkok, 1518 Pracharat 1 Road, Wongsawang, Bangsue, Bangkok, 10800, Thailand
Sathit Prasomphan, Panuwut nomrubporn & Pirat Pathanarat

Authors

Sathit Prasomphan
View author publications
You can also search for this author in PubMed Google Scholar
Panuwut nomrubporn
View author publications
You can also search for this author in PubMed Google Scholar
Pirat Pathanarat
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Sathit Prasomphan .

Editor information

Editors and Affiliations

University of Tennessee, Knoxville, Tennessee, USA
Michael W. Berry
Universiti Teknologi MARA, Shah Alam, Malaysia
Azlinah Hj. Mohamed
Faculty of Computer and Mathematical Sciences, Universiti Teknologi MARA, Shah Alam, Malaysia
Bee Wah Yap

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Prasomphan, S., nomrubporn, P., Pathanarat, P. (2016). Automated Generating Thai Stupa Image Descriptions with Grid Pattern and Decision Tree. In: Berry, M., Hj. Mohamed, A., Yap, B. (eds) Soft Computing in Data Science. SCDS 2016. Communications in Computer and Information Science, vol 652. Springer, Singapore. https://doi.org/10.1007/978-981-10-2777-2_11

Download citation

DOI: https://doi.org/10.1007/978-981-10-2777-2_11
Published: 18 September 2016
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-2776-5
Online ISBN: 978-981-10-2777-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics