Abstract
Fuzzy grammar has been introduced as an approach to represent and learn text fragments where the set of learned patterns are represented by combining similar segments to represent regularities and marking interchangeable segments. This paper is dedicated to present a procedural scheme towards learning text fragment in text categorization task facilitated by fuzzy grammars. A few issues are involved in developing fuzzy grammars which are (i) determination of the number of text classes to develop (ii) the selection of text fragments, F relevant to each text class (iii) determining frequent and important terms or keywords, V to develop the set of terminal, T and compound grammars, N. (iv) Conversion of text fragments into grammars, (v) Combination of grammars into a compact form. Comparison between fuzzy grammar and other location entity identifier such as LbjTagger, LingPipe, Newswire and OpenCalais is observed where results have shown that this method outperforms other standard machine learning and statistical-based approach.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Todorovic, B.T., Rancic, S.R., Markovic, I.M., Mulalic, E.H., Ilic, V.M.: Named entity recognition and classification using context Hidden Markov Model. In: 2008 9th Symposium on Neural Network Applications in Electrical Engineering, pp. 43–46 (2008)
Xu, R., Supekar, K., Huang, Y., Das, A., Garber, A.: Combining Text Classification and Hidden Markov Model Techniques for Structuring Randomized Clinical Trial Abstracts. In: Annual Symposium of AMIA, pp. 824–828 (2006)
Chieu, H.L.: Named Entity Recognition: A Maximum Entropy Approach Using Global Information. Language
McCallum, A., Pereira, F., Lafferty, J.: Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In: 18th International Conference on Machine Learning (ICML 2001), pp. 282–289 (2001)
Nadeau, D., Turney, P.D., Matwin, S.: Unsupervised Named-Entity Recognition: Generating Gazetteers and Resolving Ambiguity, pp. 266–277 (2006)
Mikheev, A., Moens, M., Grover, C.: Named Entity Recognition without Gazetteers. Technology (1998)
Watt, A.: Beginning Regular Expressions. Wiley (2005)
Brazma, A.: Learning of Regular Expressions by Pattern Matching. In: Computational Learning Theory, pp. 392–403 (1995)
Martin, T., Shen, Y., Azvine, B.: Incremental Evolution of Fuzzy Grammar Fragments to Enhance Instance Matching and Text Mining. IEEE Transactions on Fuzzy Systems 16, 1425–1438 (2008)
Martin, T., Shen, Y., Azvine, B.: Automated semantic tagging using fuzzy grammar fragments. In: 2008 IEEE International Conference on Fuzzy Systems (IEEE World Congress on Computational Intelligence), pp. 2224–2229 (June 2008)
Sharef, N.M., Martin, T., Shen, Y.: Order Independent Incremental Evolving Fuzzy Grammar Fragment Learner. In: Ninth International Conference on Intelligent Systems Design and Applications, pp. 1221–1226 (2009)
Sharef, N.M., Shen, Y.: Text Fragment Extraction using Incremental Evolving Fuzzy Grammar Fragments Learner. In: World Congress on Computational Intelligence, pp. 18–23 (2010)
Sharef, N.M., Martin, T., Shen, Y.: Minimal Combination for Incremental Grammar Fragment Learning (2009)
Sharef, N.M.: Text Fragment Identification with Evolving Fuzzy Grammars (2010)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Mohd Sharef, N. (2012). Location Recognition with Fuzzy Grammar. In: Lukose, D., Ahmad, A.R., Suliman, A. (eds) Knowledge Technology. KTW 2011. Communications in Computer and Information Science, vol 295. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-32826-8_26
Download citation
DOI: https://doi.org/10.1007/978-3-642-32826-8_26
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-32825-1
Online ISBN: 978-3-642-32826-8
eBook Packages: Computer ScienceComputer Science (R0)