Skip to main content

Part of the book series: Advances in Intelligent and Soft Computing ((AINSC,volume 131))

Abstract

Today computing devices generate abundant information which has to be classified and stored so that navigation becomes easier. Semi-supervised learning which is in-between supervised learning and unsupervised learning is explored, and fuzziness is incorporated in the process of textual classification. We apply semi-supervised classification where we have very less training data when compared to the supervised training. In addition to unlabeled data, the algorithm is provided with some supervision information but not for all example data. In addition traditional KNN takes similar weights to all the features in all classes, which is not reasonable. Based on the concept of variance, assigning different weights to the feature in different classes is explored resulting in enhancements to the traditional KNN algorithm.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 259.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 329.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Wajeed, M.A., Adilakshmi, T.: Text Classification Using Machine Learning. Journal of Theoretical and Applied Information Technology 7(2), 119–123 (2009)

    Google Scholar 

  2. Yen, J., Langari, R.: Fuzzy Logic-Intelligence, Control, and Information. Prentice- Hall (1999)

    Google Scholar 

  3. Wang, J.S., Lee, C.S.G.: Self-Adaptive Neurofuzzy Inference Systems for Classification Applications. IEEE Trans. Fuzzy Systems 10(6), 790–802 (2002)

    Article  Google Scholar 

  4. Correa, R.F., Ludermir, T.B.: Automatic Text Categorization: Case Study. In: Proceedings of the VII Brazilian Symposium on Neural Networks, Pernambuc, Brazil (November 2002)

    Google Scholar 

  5. http://www.daviddlewis.com/resources/testcollections/reuters21578

  6. Jiang, J.-Y., Liou, R.-J., Lee, S.-J.: Fuzzy Self-Constructing Feature Clustering Algorithm for Text Classification. IEEE Transaction on Knowledge & Data Engineering 23(3) (March 2011)

    Article  Google Scholar 

  7. Wajeed, M.A., Adilakshmi, T.: Different Similarity Measures for Text Classification Using KNN. In: Proceedings of the International Conference on Computer Communication Technology at National Institute of Technology, Allahabad, September 15-17 (2011)

    Google Scholar 

  8. Sebastiani, F.: Text classification, automatic. In: Brown, K. (ed.) The Encyclopedia of Language and Linguistics, 2nd edn., vol. 14. Elsevier Science, Amsterdam (2004)

    Google Scholar 

  9. tartarus.org/ martin/PorterStemmer

    Google Scholar 

  10. Yeung, C.-M.A., Gibbins, N., Shadbolt, N.: A k-Nearest-Neighbour Method for Classifying Web Search Results with Data in Folksonomies, wi-iat. In: IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology, vol. 1, pp. 70–76 (2008)

    Google Scholar 

  11. Dalmau, M.C., Flórez, O.W.M.: Experimental Results of the Signal Processing Approach to Distributional Clustering of Terms on Reuters-21578 Collection. In: Proc. 29th European Conf. IR Research, pp. 678–681 (2007)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to M. A. Wajeed .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer India Pvt. Ltd.

About this paper

Cite this paper

Wajeed, M.A., Adilakshmi, T. (2012). Incorporating Fuzzy Clusters in Semi-supervised Text Categorization Using Enhanced KNN Algorithm. In: Deep, K., Nagar, A., Pant, M., Bansal, J. (eds) Proceedings of the International Conference on Soft Computing for Problem Solving (SocProS 2011) December 20-22, 2011. Advances in Intelligent and Soft Computing, vol 131. Springer, New Delhi. https://doi.org/10.1007/978-81-322-0491-6_39

Download citation

  • DOI: https://doi.org/10.1007/978-81-322-0491-6_39

  • Publisher Name: Springer, New Delhi

  • Print ISBN: 978-81-322-0490-9

  • Online ISBN: 978-81-322-0491-6

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics