Abstract
User-generated content (UGC) implies user-behaviors. Mining on such data helps understanding the relationship between social media and the real world. Howevr, UGC is usually of low quality, which results in the difficulty of semantic entity extraction. In this paper, we propose a method towards high-quality semantic entity refinement on forums by employing external resources. Experiments on real-life Chinese online forums show the effectiveness of our method.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Toivonen, H.: Apriori algorithm. In: Sammut, C., Webb, G.I. (eds.) Encyclopedia of Machine Learning, pp. 39–40. Springer, US (2010)
Qian, W., Chen, F., Du, J., Zhang, W., Zhang, C., Ma, H., Cai, P., Zhou, M., Zhou, A.: aUCWeb: A prototype for analyzing user-created web data. In: Yu, J.X., Kim, M.H., Unland, R. (eds.) DASFAA 2011, Part II. LNCS, vol. 6588, pp. 442–445. Springer, Heidelberg (2011)
Tanev, H., Piskorski, J., Atkinson, M.: Real-time news event extraction for global crisis monitoring. In: Kapetanios, E., Sugumaran, V., Spiliopoulou, M. (eds.) NLDB 2008. LNCS, vol. 5039, pp. 207–218. Springer, Heidelberg (2008)
Cai, P., Luo, H., Zhou, A.: Semantic entity detection by integrating CRF and SVM. In: Chen, L., Tang, C., Yang, J., Gao, Y. (eds.) WAIM 2010. LNCS, vol. 6184, pp. 483–494. Springer, Heidelberg (2010)
Liu, M., Li, W., Wu, M., Hu, H.: Event-based extractive summarization using event semantic relevance from external linguistic resource. In: ALPIT, pp. 117–122 (2007)
Hersh, W., Bhupatiraju, R., Price, S.: Phrases, boosting, and query expansion using external knowledge resources for genomic information retrieval. In: TREC, pp. 503–509 (2003)
Wang, P., Domeniconi, C.: Building semantic kernels for text classification using wikipedia. In: SIGKDD, pp. 713–721. ACM, New York (2008)
Tsagkias, M., de Rijke, M., Weerkamp, W.: Linking online news and social media. In: WSDM, pp. 565–574. ACM, New York (2011)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Du, J., Zhang, W., Cai, P., Ma, L., Qian, W., Zhou, A. (2011). Towards High-Quality Semantic Entity Detection over Online Forums. In: Datta, A., Shulman, S., Zheng, B., Lin, SD., Sun, A., Lim, EP. (eds) Social Informatics. SocInfo 2011. Lecture Notes in Computer Science, vol 6984. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-24704-0_31
Download citation
DOI: https://doi.org/10.1007/978-3-642-24704-0_31
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-24703-3
Online ISBN: 978-3-642-24704-0
eBook Packages: Computer ScienceComputer Science (R0)