Abstract
E-commerce is a continuously growing and competitive market. There are several motivations for e-shoppers, sellers and manufacturers to require an automated approach for matching product offers from various online sources referring to the same or a similar real-world product. Currently, there are several approaches for the assignment of identical and similar product offers. These existing approaches are not sufficient for performing a precise comparison as they only return a similarity value for two compared products but do not give any information for further calculations and analyses. The contribution of this paper is a novel approach and an algorithm for matching identical and very similar product offers based on the pairwise comparison of the product names. For this purpose the approach uses different similarity values which are based on an existing string similarity measure. The approach is independent from a specific product domain or data source.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsNotes
- 1.
Bing Shopping had been discontinued in 2013.
References
Nagelvoort, B., van Welie, R., van den Brink, P., Weening, A., Abraham, J.: Europe B2C E-commerce LIGHT Report 2014 (2014). https://www.ecommerce-europe.eu/website/facts-figures/light-version/download
PostNord: E-Commerce in Europe 2014 (2014). http://www.postnord.com/en/media/publications/e-commerce-archive/
Civic Consulting: Consumer market study on the functioning of e-commerce and Internet marketing and selling techniques in the retail of goods (2011). http://www.civic-consulting.de/reports/study_ecommerce_goods_en.pdf
Horch, A., Kett, H., Weisbecker, A.: Mining e-commerce data from e-shop websites. In: Proceedings of the 2015 IEEE Trustcom/BigDataSE/ISPA, pp. 153–160 (2015)
Horch, A., Kett, H., Weisbecker, A.: A lightweight approach for extracting product records from the web. In: Proceedings of the 11th International Conference on Web Information Systems and Technologies, pp. 420–430 (2015)
Horch, A., Kett, H., Weisbecker, A.: Extracting product unit attributes from product offers by using an ontology. In: Proceedings of the Second International Conference on Computer Science, Computer Engineering, and Social Media, pp. 67–71 (2015)
Yang, L., Sarker, B.K., Bhavsar, V.C., Boley, H.: A weighted-tree simplicity algorithm for similarity matching of partial product descriptions. In: IASSE, pp. 55–60 (2005)
Balog, K.: On the investigation of similarity measures for product resolution. In: Proceedings of the Workshop on Discovering Meaning on the Go in Large Heterogeneous Data 2011, pp. 49–54 (2011)
Gopalakrishnan, V., Iyengar, S.P., Madaan, A., Rastogi, R., Sengamedu, S.: Matching product titles using web-based enrichment. In: Proceedings of the 21st ACM International Conference on Information and Knowledge Management, pp. 605–614 (2012)
Getoor, L., Machanavajjhala, A.: Entity resolution: theory, practice & open challenges. In: International Conference on Very Large Data Bases (2012)
Wang, J., Kraska, T., Franklin, M.J., Feng, J.: CrowdER: crowdsourcing entity resolution. Proc. VLDB Endow. 5, 1483–1494 (2012)
Kolb, L., Rahm, E.: Parallel entity resolution with Dedoop. Datenbank-Spektrum 1, 23–32 (2013)
Passos, A., Kumar, V., McCallum, A.: Lexicon infused phrase embeddings for named entity resolution. In: Proceedings of the Eighteenth Conference on Computational Natural Language Learning, pp. 78–86 (2014)
Thor, A.: Toward an adaptive string similarity measure for matching product offers. Informatik 2010: Service Science - Neue Perspektiven für die Informatik, Beiträge der 40. Jahrestagung der Gesellschaft für Informatik e.V. (GI) 1, 702–710 (2010)
Kannan, A., Givoni, I.E., Agrawal, R., Fuxman, A.: Matching unstructured product offers to structured product specifications. In: Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 404–412 (2011)
Köpcke, H., Thor, A., Thomas, S., Rahm, E.: Tailoring entity resolution for matching product offers. In: Proceedings of the 15th International Conference on Extending Database Technology, pp. 545–550 (2012)
Londhe, N., Gopalakrishnan, V., Zhang, A., Ngo, H.Q., Srihari, R.: Matching titles with cross title web-search enrichment and community detection. Proc. VLDB Endow. 7, 1167–1178 (2014)
Yetgin, Z., Gözükara, F.: New metrics for clustering of identical products over imperfect data. Turk. J. Electr. Eng. Comput. Sci. 23, 1–14 (2014)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Horch, A., Kett, H., Weisbecker, A. (2016). Matching Product Offers of E-Shops. In: Cao, H., Li, J., Wang, R. (eds) Trends and Applications in Knowledge Discovery and Data Mining. PAKDD 2016. Lecture Notes in Computer Science(), vol 9794. Springer, Cham. https://doi.org/10.1007/978-3-319-42996-0_21
Download citation
DOI: https://doi.org/10.1007/978-3-319-42996-0_21
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-42995-3
Online ISBN: 978-3-319-42996-0
eBook Packages: Computer ScienceComputer Science (R0)