Domain N-Gram Construction and Its Application to Text Editor

Hwang, Myunggwon; Choi, Dongjin; Lee, Hyogap; Kim, Pankoo

doi:10.1007/978-3-642-20039-7_27

Domain N-Gram Construction and Its Application to Text Editor

Myunggwon Hwang²²,
Dongjin Choi²²,
Hyogap Lee²² &
…
Pankoo Kim²²

Conference paper

1059 Accesses
3 Citations

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 6591))

Abstract

Google has published n-gram data which was constructed from huge document set gathered until 2005. However, it is hard to use the data in real world applications due to its huge volume. In this paper, we propose a method to construct domain n-gram data in which a specific domain group is interested and apply the data to text editor for practical efficiency in evaluation. It contains diverse test results according to typing speed level of people and comparison results with other works. The result of this research is conducted through applying to typing only however it has big importance in a point of being capable of expecting its effectiveness because the n-gram data is widely applicable to many fields.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Budanitsky, A., Hirst, G.: Evaluating WordNet-based Measures of lexical semantic relatedness. Computational Linguistics 32(1), 13–47 (2006)
Article MATH Google Scholar
Zukerman, I., Albrecht, D.W.: Predictive Statistical Models for User Modeling. User Modeling and User-Adapted Interaction 11, 5–18 (2004)
Article MATH Google Scholar
Cavnar, W.B., Trenkle, J.M.: N-Gram-Based Text Categorization. In: Proceedings of SDAIR 1994, 3rd Annual Symposium on Document Analysis and Information Retrieval, pp. 161–175 (1994)
Google Scholar
Baeza-Yates, R., Hurtado, C., Mendoza, M.: Query recommendation using query logs in search engines. In: Lindner, W., Fischer, F., Türker, C., Tzitzikas, Y., Vakali, A.I. (eds.) EDBT 2004. LNCS, vol. 3268, pp. 395–397. Springer, Heidelberg (2004)
Chapter Google Scholar
Khudanpur, S., Wu, J.: A Mazimum Entropy Language Model Integrating N-Grams and Topic Dependencies for Conversational Speech Recognition. In: Proceedings of ICASSP 1999, pp. 553–556 (1999)
Google Scholar
Soon-Beak, K., Soo-Heum, L.: The Pupil Motion Tracking Based on Active Shape Model Using Feature Weight Vector. In: Proceedings of the Korea Institute of Signal Processing and Systems Conference, pp. 205–208 (November 2005)
Google Scholar
Morimoto, C.H., Koons, D., Amir, A., Flickner, M.: Frame-Rate Pupil Detector and Gaze Tracker. In: ICCV 1999 FRAME-RATE workshop (September 1999)
Google Scholar
Hwang, M.G., Choi, D.J., Lee, H.G., Kim, P.K.: Text Editor based on Google Trigram and its Usability. In: Proceedings of the UKSim 4th European Modeling Symposium on Computer Modeling and Simulation, pp. 12–15 (2010)
Google Scholar
Brants, T., Franz, A.: Web 1T 5-gram Corpus Version 1.1 (LDC2006T13) (April 2006)
Google Scholar
Choi, D.J., Hwang, M.G., Kim, P.K.: Semantic Context Extraction of Wikipedia. In: The Proceedings of the 2010 International Conference on Semantic Web and Web Services (SWWS 2010), pp. 38–41 (2010)
Google Scholar
Velardi, P., Navigli, R., D’Amadio, P.: Mining the Web to Create Specialized Glossaries. IEEE Intelligent Systems 23(5) (2008)
Google Scholar

Download references

Author information

Authors and Affiliations

814 IT Building, 375 Seoseok-dong, Dong-gu, 501-759, Gwangju, South Korea
Myunggwon Hwang, Dongjin Choi, Hyogap Lee & Pankoo Kim

Authors

Myunggwon Hwang
View author publications
You can also search for this author in PubMed Google Scholar
Dongjin Choi
View author publications
You can also search for this author in PubMed Google Scholar
Hyogap Lee
View author publications
You can also search for this author in PubMed Google Scholar
Pankoo Kim
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Wroclaw University of Technology, 50-370, Wroclaw, Poland
Ngoc Thanh Nguyen
Department of Computer Engineering, Yeungnam University, 712-749, Dae-Dong, Gyeungsan, Korea
Chong-Gun Kim
Institute of Informatics, Automation and Robotics, Wroclaw University of Technology, 50-370, Wrocław, Poland
Adam Janiak

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Hwang, M., Choi, D., Lee, H., Kim, P. (2011). Domain N-Gram Construction and Its Application to Text Editor. In: Nguyen, N.T., Kim, CG., Janiak, A. (eds) Intelligent Information and Database Systems. ACIIDS 2011. Lecture Notes in Computer Science(), vol 6591. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-20039-7_27

Download citation

DOI: https://doi.org/10.1007/978-3-642-20039-7_27
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-20038-0
Online ISBN: 978-3-642-20039-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics