Simultaneous feature and parameter selection using multiobjective optimization: application to named entity recognition

Ekbal, Asif; Saha, Sriparna

doi:10.1007/s13042-014-0268-7

Simultaneous feature and parameter selection using multiobjective optimization: application to named entity recognition

Original Article
Published: 06 July 2014

Volume 7, pages 597–611, (2016)
Cite this article

International Journal of Machine Learning and Cybernetics Aims and scope Submit manuscript

Asif Ekbal¹ &
Sriparna Saha¹

462 Accesses
15 Citations
Explore all metrics

Abstract

In this paper, we propose an efficient algorithm based on the concept of multiobjective optimization (MOO) for performing feature selection and parameter optimization of any machine learning technique. Feature and parameter combinations have significant effect to the accuracy of the classifier. We perform feature selection and parameter optimization for four different classifiers, namely conditional random field, support vector machine, memory based learner and maximum entropy. The proposed algorithms are evaluated for solving the problems of named entity recognition, an important component in many text processing applications. Currently we experiment with four different languages, namely Bengali, Hindi, Telugu and English. At first the proposed MOO based technique is used to determine the appropriate features and parameters. For each of the classifiers, the algorithm produces a set of solutions on the final Pareto optimal front. Each solution represents a classifier with a particular feature and parameter combination. All these solutions are thereafter combined using a MOO based classifier ensemble technique. Evaluation results show that the proposed approach attains the F-measure (harmonic mean of recall and precision) values of 90.48, 90.44, 78.71 and 88.68 % for Bengali, Hindi, Telugu and English, respectively. We also show that for all the experimental settings the proposed feature and parameter optimization technique performs reasonably better than the baseline systems, developed with random feature subsets. Comparisons with the existing works also show the efficacy of our proposed algorithm.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

MODE: multiobjective differential evolution for feature selection and classifier ensemble

Article 10 January 2015

Utpal Kumar Sikdar, Asif Ekbal & Sriparna Saha

Feature selection for entity extraction from multiple biomedical corpora: A PSO-based approach

Article 17 August 2017

Shweta Yadav, Asif Ekbal & Sriparna Saha

Evolutionary Approach for Classifier Ensemble: An Application to Bio-molecular Event Extraction

Notes

http://ltrc.iiit.ac.in/ner-ssea-08.
http://crfpp.sourceforge.net.
http://chasen-org/taku/software/yamcha/.
http://maxent.sourceforge.net/.
http://ltrc.iiit.ac.in/ner-ssea-08.
http://www.eci.gov.in/DevForum/Fullname.asp.

References

Yao L, Sun C, Wu Y, Wang X, Wang X (2011) Biomedical named entity recognition using generalized expectation criteria. Int J Mach Learn Cybern 2(4):235–243
Article Google Scholar
Cunningham H (2002) GATE, a general architecture for text engineering. Comput Humanit 36:223–254
Article Google Scholar
Babych B, Hartley A (2003) Improving machine translation quality with automatic named entity recognition. In: Proceedings of EAMT/EACL 2003 workshop on MT and other language technology tools, pp 1–8
Moldovan D, Harabagiu S, Girju R, Morarescu P, Lacatusu F, Novischi A, Badulescu A, Bolohan O (2002) LCC tools for question answering. In: Text retrieval conference (TREC)
Nobata C, Sekine S, Isahara H, Grishman R (2002) Summarization system integrated with named entity tagging and IE pattern discovery. In: Proceedings of third international conference on language resources and evaluation (LREC 2002), Spain
Miller S, Crystal M, Fox H, Ramshaw L, Schawartz R, Stone R, Weischedel R, the Annotation Group (1998) BBN: description of the SIFT system as used for MUC-7. In: MUC-7, Fairfax, Virginia
Bikel DM, Schwartz RL, Weischedel RM (1999) An algorithm that learns what’s in a name. Mach Learn 34(1–3):211–231
Article MATH Google Scholar
Borthwick A (1999) Maximum entropy approach to named entity recognition. Ph.D. thesis, New York University
Borthwick A, Sterling J, Agichtein E, Grishman R (1998) NYU: description of the MENE named entity system as used in MUC-7. In: MUC-7, Fairfax
Wang XZ, Dong CR (2009) Improving generalization of fuzzy if-then rules by maximizing fuzzy entropy. IEEE Trans Fuzzy Syst 17(3):556–567
Article Google Scholar
Wang XZ, Dong LC, Yan JH (2012) Maximum ambiguity-based sample selection in fuzzy decision tree induction. IEEE Trans Knowl Data Eng 24(8):1491–1505
Article Google Scholar
Sekine S (1998) Description of the Japanese NE system used for MET-2. In: MUC-7, Fairfax, Virginia
Bennet SW, Aone C, Lovell C (1997) Learning to tag multilingual texts through observation. In: Proceedings of empirical methods of natural language processing, Providence, Rhode Island, pp 109–116
McCallum, A, Li W (2003) Early results for named entity recognition with conditional random fields, feature induction and web-enhanced lexicons. In: Proceedings of CoNLL, Canada, pp 188–191
Lafferty, JD, McCallum A, Pereira FCN (2001) Conditional random fields: probabilistic models for segmenting and labeling sequence data. In: ICML, pp 282–289
Chen WJ, Shao YH, Hong N (2013) Laplacian smooth twin support vector machine for semi-supervised classification. Int J Mach Learn Cybern 5(3):459–468
Sun L, Mu WS, Qi B, Zhou ZJ (2014) A new privacy-preserving proximal support vector machine for classification of vertically partitioned data. Int J Mach Learn Cybern. doi:10.1007/s13042-014-0245-1
Collins M, Singer Y (1999) Unsupervised models for named entity classification. In: Proceedings of the joint SIGDAT conference on empirical methods in natural language processing and very large corpora
Riloff E, Jones R (1999) Learning dictionaries for information extraction by multi-level bootstrapping. In: Proceedings AAAI ’99/IAAI ’99: Proceedings of the sixteenth national conference on artificial intelligence and the eleventh conference on innovative applications of artificial intelligence, pp 474–479
Yangarber R, Lin W, Grishman R (2002) Unsupervised learning of generalized names. In: Proceedings of the 19th international conference on computational linguistics (COLING-2002), pp 1–7
Alfonseca E, Manandhar S (1999) An unsupervised method for general named entity recognition and automated concept discovery. In: Proceedings AAAI ’99/IAAI ’99: Proceedings of the sixteenth national conference on artificial intelligence and the eleventh conference on innovative applications of artificial intelligence, pp 474–479
Shinyama Y, Sekine S (2004) Named entity discovery using comparable news articles. In: Proceedings of the international conference on computational linguistics (COLING), Switzerland, pp 848–855
Etzioni O, Cafarrella M, Downey D, Popescu AM, Shaked T, Soderland S, Weld DS, Yates A (2005) Unsupervised named entity extraction from the web: an experimental study. Artif Intell 165:91–134
Article Google Scholar
Mikheev A, Grover C, Moens M (1998) Description of the LTG system used for MUC-7. In: MUC-7, Fairfax, Virginia
Srihari R, Niu C, Li W (2002) A hybrid approach for named entity and sub-type tagging. In: Proceedings of sixth conference on applied natural language processing (ANLP), pp 247–254
Yu X (2007) Chinese named entity recognition with cascaded hybrid model. In: Proceedings of NAACL HLT 2007, Prague, pp 197–200
Ekbal A, Bandyopadhyay S (2009) A conditional random field approach for named entity recognition in Bengali and Hindi. Linguist Issues Lang Technol (LiLT) 2(1):1–44
Google Scholar
Ekbal A, Naskar S, Bandyopadhyay S (2007) Named entity recognition and transliteration in Bengali. Named Entities: Recognit Classif Use Spec Issue Lingvist Investig J 30(1):95–114
Google Scholar
Patel A, Ramakrishnan G, Bhattacharya P (2009) Relational learning assisted construction of rule base for Indian language NER. In: Proceedings of ICON 2009: 7th international conference on natural language processing, India
Li W, McCallum A (2004) Rapid development of hindi named entity recognition using conditional random fields and feature induction. ACM Trans Asian Lang Inf Process 2(3):290–294
Article Google Scholar
Saha S, Sarkar S, Mitra P (2008) A hybrid feature set based maximum entropy Hindi named entity recognition. In: Proceedings of the 3rd international joint conference in natural langauge processing (IJCNLP 2008), pp 343–350
Shishtla PM, Pingali P, Varma V (2008) A character n-gram based approach for improved recall in Indian language NER. In: Proceedings of the IJCNLP-08 workshop on NER for South and South East Asian Languages, pp 101–108
Liu H, Yu L (2005) Toward integrating feature selection algorithms for classification and clustering. IEEE Trans Knowl Data Eng 17(4):491–502
Article Google Scholar
Liu H, Motoda H (1998) Feature selection for knowledge discovery and data mining. Kluwer Academic Publishers, Norwell
Book MATH Google Scholar
Oh IS, Lee JS, Moon BR (2004) Hybrid genetic algorithms for feature selection. IEEE Trans Pattern Anal Mach Intell 26(11):1424–1437
Article Google Scholar
Ekbal A, Saha S (2012) Multiobjective optimization for classifier ensemble and feature selection: an application to named entity recognition. IJDAR 15(2):143–166
Article Google Scholar
Ekbal A, Saha S (2013) Full length article: Simulated annealing based classifier ensemble techniques: application to part of speech tagging. Inf Fusion 14(3):288–300
Article Google Scholar
Deb K (2001) Multi-objective optimization using evolutionary algorithms. Wiley, England
MATH Google Scholar
Deb K, Pratap A, Agarwal S, Meyarivan T (2002) A fast and elitist multiobjective genetic algorithm: NSGA-II. IEEE Trans Evolut Comput 6(2):181–197
Article Google Scholar
Daelemans W, den Bosch AV (2005) Memory-based language processing. Cambridge University Press, Cambridge
Book Google Scholar
Aha DW, Kibler D, Albert M (1991) Instance-based learning algorithms. Mach Learn 6:37–66
Google Scholar
Daelemans W, Zavrel J, van den Bosch A, van der Sloot K (2010) Mbt:memory-based tagger. In: Version 3.2, reference guide. ILK technical report 10–04. http://ilk.uvt.nl/downloads/pub/papers/ilk.1004.pdf
Darroch J, Ratcliff D (1972) Generalized iterative scaling for log-linear models. Ann Math Stat 43:1470–1480
Article MathSciNet MATH Google Scholar
Vapnik VN (1995) The nature of statistical learning theory. Springer, New York
Book MATH Google Scholar
Holland JH (1975) Adaptation in natural and artificial systems. The University of Michigan Press, Ann Arbor
Google Scholar
Tjong Kim Sang EF, De Meulder F (2003) Introduction to the Conll-2003 shared task: language independent named entity recognition. In: Proceedings of the seventh conference on natural language learning at HLT-NAACL 2003, pp 142–147
Florian R, Ittycheriah A, Jing H, Zhang T (2003) Named entity recognition through classifier combination. In: Proceedings of the Seventh conference on natural language learning at HLT-NAACL 2003
Lin D, Wu X (2009) Phrase Clustering for discriminative learning. In: Proceedings of 47th annual meeting of the ACL and the 4th IJCNLP of the AFNLP, pp 1030–1038
Suzuki J, Isozaki H (2008) Semi-supervised sequential labeling and segmentation using Gigaword Scale unlabeled data. In: Proceedings of ACL/HLT-08, pp 665–673
Chieu HL, Ng HT (2003) Named entity recognition with a maximum entropy approach. In: Proceedings of CoNLL-2003, HLT-NAACL 2003, pp 160–163
Wu D, Ngai G, Carput M (2003) A stacked, voted, stacked model for named entity recognition. In: Proceedings of the CoNLL-2003, HLT-NAACL
Klein D, Smarr J, Nguyen H, Manning CD (2003) Named entity recognition with character-level models. In: Proceedings of CoNLL-2003, HLT-NAACL 2003, pp 188–191
Ekbal A, Bandyopadhyay S (2008) A web-based Bengali news corpus for named entity recognition. Lang Resour Eval J 42(2):173–182
Article Google Scholar
Singh AK (2008) Named entity recognition for South and South East Asian languages: taking stock. In: Proceedings of the IJCNLP-08 workshop on NER for South and South East Asian Languages, IJCNLP-08, India

Download references

Author information

Authors and Affiliations

Department of Computer Science and Engineering, Indian Institute of Technology, Patna, India
Asif Ekbal & Sriparna Saha

Authors

Asif Ekbal
View author publications
You can also search for this author in PubMed Google Scholar
Sriparna Saha
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Asif Ekbal.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Ekbal, A., Saha, S. Simultaneous feature and parameter selection using multiobjective optimization: application to named entity recognition. Int. J. Mach. Learn. & Cyber. 7, 597–611 (2016). https://doi.org/10.1007/s13042-014-0268-7

Download citation

Received: 15 October 2013
Accepted: 13 May 2014
Published: 06 July 2014
Issue Date: August 2016
DOI: https://doi.org/10.1007/s13042-014-0268-7

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Simultaneous feature and parameter selection using multiobjective optimization: application to named entity recognition

Abstract

Access this article

Similar content being viewed by others

MODE: multiobjective differential evolution for feature selection and classifier ensemble

Feature selection for entity extraction from multiple biomedical corpora: A PSO-based approach

Evolutionary Approach for Classifier Ensemble: An Application to Bio-molecular Event Extraction

Notes

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Simultaneous feature and parameter selection using multiobjective optimization: application to named entity recognition

Abstract

Access this article

Similar content being viewed by others

MODE: multiobjective differential evolution for feature selection and classifier ensemble

Feature selection for entity extraction from multiple biomedical corpora: A PSO-based approach

Evolutionary Approach for Classifier Ensemble: An Application to Bio-molecular Event Extraction

Notes

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation