Abstract
This paper proposes a method based on Markov Logic Network (MLN) to determine the time order of entity attribute values. We use the characteristics of web sources’ currency, web sources inter-dependency and attribute data currency in a certain web source as predicates in MLN. We define five rules (new rules can be added) to infer the currency of different values provided by different sources. On one hand, this method considers currency problem based on entity attribute instead of the entire entity, which is critical to improve the quality of data provided by Web Integration Systems; on the other hand, this method summarizes characteristics of web sources and web data based on carefully analysis. It is noteworthy that it is not complicate for the MLN model to incorporate new rules, which shows that the proposed method is extensible.
Keywords
This work is supported by the Shandong Province Natural Science Fund (No. ZR2015PF011).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Fan, W., Geerts, F., Wijsen, J.: Determining the currency of data. ACM Trans. Database Syst. (TODS) 37(4), 25 (2012)
Li, X., Dong, X.L., Lyons, K., et al.: Truth finding on the deep web: is the problem solved? Proc. VLDB Endow. 6(2), 97–108 (2012)
Dong, X.L., Naumann, F.: Data fusion–resolving data conflicts for integration. PVLDB 2(2), 1654–1655 (2009)
Galland, A., Abiteboul, S., Marian, A., Senellart, P.: Corroborating information from disagreeing views. In: WSDM, pp. 131–140 (2010)
Pasternack, J., Roth, D.: Knowing what to believe (when you already know something). In: COLING, pp. 877–885 (2010)
Pasternack, J., Roth, D.: Making better informed trust decisions with generalized fact-finding. In: IJCAI, pp. 2324–2329 (2011)
Eckerson, W.W.: Data quality and the bottom line: achieving business success through a commitment to high quality data. Data Warehousing Institute (2002)
Chiang, Y.H., Doan, A.H., Naughton, J.F.: Modeling entity evolution for temporal record matching. In: Proceedings of the ACM Conference on Management of Data (SIGMOD), pp 1175–1186 (2014)
Pal, A., et al.: Information integration over time in unreliable and uncertain environments. In: Proceedings of the 19th International World Wide Web Conference, pp. 789–798 (2012)
Christen, P., Gayler, R.W.: Adaptive temporal entity resolution on dynamic databases. In: Proceedings of the 17th Pacific-Asia Conference in Knowledge Discovery and Data Mining, pp. 558–569 (2013)
Li, P., Dong, X., Maurino, A., Srivastava, D.: Linking temporal records. Proc. VLDB Endow. 4(11), 956–967 (2011)
Chiang, Y.H., Doan, A., Naughton, J.F.: Tracking entities in the dynamic world: A fast algorithm for matching temporal records. Proc. VLDB Endow. 7(6), 469–480 (2014)
Richardson, M., Domingos, P.: Markov logic networks. Mach. Learn. (ML) 62(1–2), 107–136 (2006)
Dong, X.L., Berti-Equille, L., Srivastava, D.: Truth discovery and copying detection in a dynamic world. PVLDB 2(1), 562–573 (2009)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer Science+Business Media Singapore
About this paper
Cite this paper
Zhang, Y., Zhang, R. (2016). Determining Web Data Currency Based on Markov Logic Network. In: Che, W., et al. Social Computing. ICYCSEE 2016. Communications in Computer and Information Science, vol 623. Springer, Singapore. https://doi.org/10.1007/978-981-10-2053-7_30
Download citation
DOI: https://doi.org/10.1007/978-981-10-2053-7_30
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-2052-0
Online ISBN: 978-981-10-2053-7
eBook Packages: Computer ScienceComputer Science (R0)