Abstract
As of today, opinion mining has been widely used to identify the strength and weakness of products (e.g., cameras) or services (e.g., services in medical clinics or hospitals) based upon people’s feedback such as user reviews. Feature extraction is a crucial step for opinion mining which has been used to collect useful information from user reviews. Most existing approaches only find individual features of a product without the structural relationships between the features which usually exists. In this paper, we propose an approach to extract features and feature relationship, represented as tree structure called a feature hierarchy, based on frequent patterns and associations between patterns derived from user reviews. The generated feature hierarchy profiles the product at multiple levels and provides more detailed information about the product. Our experiment results based on some popularly used review datasets show that the proposed feature extraction approach can identify more correct features than the baseline model. Even though the datasets used in the experiment are about cameras, our work can be applied to generate features about a service such as the services in hospitals or clinics.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Abbasi, A., Chen, H., Salem, A.: Sentiment analysis in multiple languages: Feature selection for opinion classification in Web forums. ACM Transactions on Information Systems 26(3) (2008)
Ding, X., Liu, B., Yu, P.S.: A Holistic Lexicon-Based Approach to Opinion Mining. In: International Conference on Web Search and Web Data Mining, pp. 231–240 (2008)
Hu, M., Liu, B.: Mining Opinion Features in Customer Reviews. In: 19th National Conference on Artifical Intelligence, pp. 755–760 (2004)
Hu, M., Liu, B.: Mining and Summarizing Customer Reviews. In: 10th ACM SIGKDD International Conference on Knowledge discovery and Data Mining, pp. 168–177. ACM, New York (2004)
Hu, W., Gong, Z., Guo, J.: Mining Product Features from Online Reviews. In: IEEE 7th International Conference on E-Business Engineering, pp. 24–29 (2010)
Lau, R.Y.K., Lai, C.C.L., Ma, J., Li, Y.: Automatic Domain Ontology Extraction for Context-Sensitive Opinion Mining. In: Thirtieth International Conference on Information Systems (2009)
Pasquier, N., Bastide, Y., Taouil, R., Lakhal, L.: Efficient mining of association rules using closed itemset lattices. Information Systems 24(1), 25–46 (1999)
Popescu, A.-M., Etzioni, O.: Extracting Product Features and Opinions from Reviews. In: Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing, pp. 339–346. Association for Computational Linguistics Stroudsburg, PA (2005)
Scaffidi, C., Bierhoff, K., Chang, E., Felker, M., Ng, H., Jin, C.: Red Opal: Product-Feature Scoring from Reviews. In: 8th ACM Conference on Electronic Commerce, pp. 182–191. ACM, New York (2007)
Subrahmanian, V., Reforgiato, D.: AVA: Adjective-Verb-Adverb Combinations for Sentiment Analysis. In: IEEE Intelligent Systems, pp. 43–50 (2008)
Wright, A.: Our sentiments, exactly. Communications of the ACM 52(4), 14–15 (2009)
Xu, Y., Li, Y., Shaw, G.: Reliable Representations for Association Rules. Data and Knowledge Engineering 70(6), 237–256 (2011)
Opinion Mining, Sentiment Analysis, and Opinion Spam Detection, http://www.cs.uic.edu/~liub/FBS/sentiment-analysis.html
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Tian, N., Xu, Y., Li, Y., Pasi, G. (2013). Structured Feature Extraction Using Association Rules. In: Li, J., et al. Trends and Applications in Knowledge Discovery and Data Mining. PAKDD 2013. Lecture Notes in Computer Science(), vol 7867. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-40319-4_24
Download citation
DOI: https://doi.org/10.1007/978-3-642-40319-4_24
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-40318-7
Online ISBN: 978-3-642-40319-4
eBook Packages: Computer ScienceComputer Science (R0)