Loading [MathJax]/extensions/MathMenu.js
Tweets mining using WIKIPEDIA and impurity cluster measurement | IEEE Conference Publication | IEEE Xplore

Tweets mining using WIKIPEDIA and impurity cluster measurement


Abstract:

Twitter is one of the fastest growing online social networking services. Tweets can be categorized into trends, and are related with tags and follower/following social re...Show More

Abstract:

Twitter is one of the fastest growing online social networking services. Tweets can be categorized into trends, and are related with tags and follower/following social relationships. The categorization is neither accurate nor effective due to the short length of tweet messages and noisy data corpus. In this paper, we attempt to overcome these challenges with an extended feature vector along with a semi-supervised clustering technique. In order to achieve this goal, the training set is expanded with Wikipedia topic search result, and the feature set is extended. When building the clustering model and doing the classification, impurity measurement is introduced into our classifier platform. Our experiment results show that the proposed techniques outperform other classifiers with reasonable precision and recall.
Date of Conference: 23-26 May 2010
Date Added to IEEE Xplore: 14 June 2010
ISBN Information:
Conference Location: Vancouver, BC

References

References is not available for this document.