Abstract
Personal Family Network is an important component of social networks, therefore, it is of great importance of how to extract personal family relationships. We propose a novel method to construct personal families based on bootstrapping and coreference resolution on top of a search engine. It begins with seeds of personal relations to discover relational patterns in a bootstrapping fashion, then personal relations are further extracted via these learned patterns, finally family networks are fused using cross-document coreference resolution. The experimental results on a large-scale corpus of Gigaword show that, our method can build accurate family networks, thereby laying the foundation for social network analysis.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Kautz, H., Selman, B., Shah, M.: Referral Web: combining social networks and collaborative filtering. Communications of the ACM 40(3), 63–65 (1997)
Flink, M.P.: Semantic web technology for the extraction and analysis of social networks. Web Semantics: Science, Services and Agents on the World Wide Web 3(2), 211–223 (2005)
Tang, J., Zhang, J., Yao, L., et al.: ArnetMiner: extraction and mining of academic social networks. In: Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 990–998. ACM (2008)
Elson, D.K., Dames, N., McKeown, K.R.: Extracting social networks from literary fiction. In: Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, pp. 138–147. Association for Computational Linguistics (2010)
Agarwal, A., Corvalan, A., Jensen, J., et al.: Social Network Analysis of Alice in Wonderland. NAACL-HLT 2012, 88 (2012)
van de Camp, M., van den Bosch, A.: A link to the past: constructing historical social networks. In: Proceedings of the 2nd Workshop on Computational Approaches to Subjectivity and Sentiment Analysis, pp. 61–69. Association for Computational Linguistics (2011)
Zhou, G.D., Zhang, M.: Extracting relation information from text documents by exploring various types of knowledge. Information Processing & Management 43(4), 969–982 (2007)
Oh, J.H., Uchimoto, K., Torisawa, K.: Bilingual co-training for monolingual hyponymy-relation acquisition. In: Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and The 4th International Joint Conference on Natural Language Processing of the AFNLP, vol. 1, pp. 432–440. Association for Computational Linguistics (2009)
Zhang, M., Su, J., Wang, D., Zhou, G., Tan, C.-L.: Discovering relations between named entities from a large raw corpus using tree similarity-based clustering. In: Dale, R., Wong, K.-F., Su, J., Kwong, O.Y. (eds.) IJCNLP 2005. LNCS (LNAI), vol. 3651, pp. 378–389. Springer, Heidelberg (2005)
Hearst, M.A.: Automatic acquisition of hyponyms from large text corpora. In: Proceedings of the 14th Conference on Computational linguistics, vol. 2, pp. 539–545. Association for Computational Linguistics (1992)
Pantel, P., Pennacchiotti, M.: Espresso: Leveraging generic patterns for automatically harvesting semantic relations. In: Proceedings of the 21st International Conference on Computational Linguistics and the 44th Annual Meeting of the Association for Computational Linguistics, pp. 113–120. Association for Computational Linguistics (2006)
Conglei, Y., Nan, D.: An Extraction Method on Web. Pattern Recognition and Artificial Intelligence 2(6) (2007) (in Chinese)
Tian, G., Qian, M., Huaping, Z.: A Research on Social Network Extraction Based on Web Search Engine. Chinese Product Reviews Filtering (2009) (in Chinese)
Peng, C., Gu, J., Qian, L.: Research on Tree Kernel-Based Personal Relation Extraction. In: Zhou, M., Zhou, G., Zhao, D., Liu, Q., Zou, L. (eds.) NLPCC 2012. CCIS, vol. 333, pp. 225–236. Springer, Heidelberg (2012)
Zhu, J., Nie, Z., Liu, X., et al.: StatSnowball: a statistical approach to extracting entity relationships. In: Proceedings of the 18th International Conference on World Wide Web, pp. 101–110. ACM (2009)
Cover, T., Thomas, J., Proakis, J.G., et al.: Elements of information theory. telecommunications. Wiley series (1991)
Pantel, P., Ravichandran, D.: Automatically labeling semantic classes. In: Proceedings of HLT/NAACL, vol. 4, pp. 321–328 (2004)
Gooi, C.H., Allan, J.: Cross-document coreference on a large scale corpus. In: HLT-NAACL, pp. 9–16 (2004)
Mayfield, J., Alexander, D., Dorr, B., et al.: Cross-document coreference resolution: A key technology for learning by reading. In: AAAI Spring Symposium on Learning by Reading and Learning to Read (2009)
Malin, B.: Unsupervised name disambiguation via social network similarity. In: Workshop on Link Analysis, Counterterrorism, and Security, vol. 1401, pp. 93–102 (2005)
Bagga, A.: Evaluation of coreferences and coreference resolution systems. In: Proceedings of the First Language Resource and Evaluation Conference, pp. 563–566 (1998)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Gu, J., Hu, Y., Qian, L., Zhu, Q. (2013). Research on Building Family Networks Based on Bootstrapping and Coreference Resolution. In: Zhou, G., Li, J., Zhao, D., Feng, Y. (eds) Natural Language Processing and Chinese Computing. NLPCC 2013. Communications in Computer and Information Science, vol 400. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-41644-6_19
Download citation
DOI: https://doi.org/10.1007/978-3-642-41644-6_19
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-41643-9
Online ISBN: 978-3-642-41644-6
eBook Packages: Computer ScienceComputer Science (R0)