A Study on Corpus-based Stopword Lists in Indian Language IR
Abstract
References
Index Terms
- A Study on Corpus-based Stopword Lists in Indian Language IR
Recommendations
Effect of Stopwords and Stemming Techniques in Urdu IR
AbstractThis paper explores and evaluates the effect of different stopword removal and stemming techniques in Urdu IR. The issues are examined from four viewpoints. Is there any performance difference between non-corpus-based and corpus-based stopword ...
A Fast Corpus-Based Stemmer
Stemming is a mechanism of word form normalization that transforms the variant word forms to their common root. In an Information Retrieval system, it is used to increase the system’s performance, specifically the recall and desirably the precision. ...
Lemmatization and stopword elimination in Greek web searching
EATIS '07: Proceedings of the 2007 Euro American conference on Telematics and information systemsThis paper explores the effect of noun lemmatization and stopword removal in Greek Web searching. A light lemmatizer is presented and applied in a retrieval experiment. Stopwords are removed from user queries. In both experiments an increase in ...
Comments
Information & Contributors
Information
Published In
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
Check for updates
Author Tags
Qualifiers
- Research-article
Funding Sources
- IIT (B.H.U), Varanasi
- National Supercomputing Mission, Government of India at the IIT (B.H.U)
Contributors
Other Metrics
Bibliometrics & Citations
Bibliometrics
Article Metrics
- 0Total Citations
- 144Total Downloads
- Downloads (Last 12 months)59
- Downloads (Last 6 weeks)5
Other Metrics
Citations
View Options
Login options
Check if you have access through your login credentials or your institution to get full access on this article.
Sign in