Abstract
We recently proposed a novel sentential association based approach SAT-MOD for text classification, which views a sentence rather than a document as an association transaction, and uses a novel heuristic called MODFIT to select the most significant itemsets for constructing a category classifier. Based on SAT-MOD, we have developed a prototype system called SAT-Class. In this demo, we demonstrate the effectiveness of our text classification system, and also the readability and refinability of acquired classification rules.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Agrawal, R., Srikant, R.: Fast Algorithms for Mining Association Rules. In: Proc. of the 20th Very Large Data Bases, pp. 487–499 (1994)
Sebastiani, F.: Machine Learning in Automated Text Categorization. ACM Computing Surveys 34(1), 1–47 (2002)
Liu, B., Hsu, W., Ma, Y.: Integrating classification and association rule mining. In: SIGKDD, pp. 80–86 (1998)
Antonie, M., Zaiane, O.R.: Text Document Categorization by Term Association. In: Proc. of IEEE Intl. Conf. on Data Mining, pp. 19–26 (2002)
Meretakis, D., Fragoudis, D., Lu, H., Likothanassis, S.: Scalable Association-based Text Classification. In: Proc. of ACM CIKM (2000)
Feng, J., Liu, H., Zou, J.: SAT-MOD: Moderate Itemset Fittest for Text Classification. In: www 2005 (2005) (Submitted)
The Reuters-21578 Dataset
http://www.daviddlewis.com/resources/testcollections/reuters21578/
Klimt, B., Yang, Y.: The Enron Corpus: A New Dataset for Email Classification Research. In: Proc. European Conf. on Machine Learning, Pisa, Italy (2004)
Quinlan, J.R., Cameron-Jones, R.M.: FOIL: A Midterm Report. In: Proc. European Conf. MachineLearning, pp. 3–20 (1993)
Borgelt, C.: Efficient Implementation of Apriori and Eclat. In: Proc. of the first Workshop on Frequent Itemset Mining Implementations (2003)
The RDBMS DM4., http://www.dameng.cn
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Feng, J., Liu, H., Feng, Y. (2005). Sentential Association Based Text Classification Systems. In: Zhang, Y., Tanaka, K., Yu, J.X., Wang, S., Li, M. (eds) Web Technologies Research and Development - APWeb 2005. APWeb 2005. Lecture Notes in Computer Science, vol 3399. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-31849-1_101
Download citation
DOI: https://doi.org/10.1007/978-3-540-31849-1_101
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-25207-8
Online ISBN: 978-3-540-31849-1
eBook Packages: Computer ScienceComputer Science (R0)