Abstract
E-commerce is a constantly growing and competitive market. Online prices are updated daily or even more frequently, and it is very important for e-shoppers to find the lowest price online. Therefore, e-shop owners need to know the prices of their competitors and must be able to adjust their own prices in order to remain competitive. The manual monitoring of all prices of all products and competitors is too time-consuming; hence, the e-shop owners need software support for that task. For the development of such software tools the developers need a profound comprehension of the structure and design of e-shop websites. Existing software tools for Web data extraction are based on the findings of different website analyzes. The existing tools show: The more specific and detailed the analysis and the analyzed websites, the better the data extraction results. This paper presents the results and the derived findings of a deep analysis of 50 different e-shop websites in order to provide new insights for the development and improvement of software tools for product data extraction from e-shop websites.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Notes
- 1.
Compound tag path from the root node of the website’s HTML tree to a particular element of the Web page.
References
Ecommerce Foundation: European B2C E-commerce Report 2015 (Light Version). Report, Ecommerce Foundation (2015)
Civic Consulting: Consumer market study on the functioning of e-commerce and Internet marketing and selling techniques in the retail of goods. Report, Civic Consulting (2011)
PostNord: E-commerce in Europpe 2015. Report, PostNord (2015)
Liu, B., Grossman, R., Zhai, Y.: Mining data records in web pages. In: Proceedings of the Ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD-2003), pp. 601–606. ACM, New York (2003)
Grigalis, T.: Towards automatic structured web data extraction system. In: Local Proceedings and Materials of Doctoral Consortium of the Tenth International Baltic Conference on Databases and Information Systems, pp. 197–201, Vilnius (2012). CEUR-WS.org
Grigalis, T., Čenys, A.: Unsupervised structured data extraction from template-generated web pages. J. Univ. Comput. Sci. (J.UCS) 20(2), 169–192 (2011)
Popescu, A.-M., Etzioni, O.: Extracting product features and opinions from reviews. In: Proceedings of the Conference on Human Language Technology and Empirical Methods in Natural Language Processing, pp. 339–346. Association for Computational Linguistics, Stroudsburg, PA, USA (2005)
Horch, A., Kett, H., Weisbecker, A.: A lightweight approach for extracting product records from the web. In: Proceedings of the 11th International Conference on Web Information Systems and Technologies, pp. 420–430. SciTePress (2015)
Horch, A., Kett, H., Weisbecker, A.: Mining E-commerce data from E-shop websites. In: 2015 IEEE TrustCom/BigDataSE/ISPA, vol. 2, pp. 153–160. IEEE (2015)
Horch, A., Kett, H., Weisbecker, A.: Extracting product unit attributes from product offers by using an ontology. In: Second International Conference on Computer Science, Computer Engineering, and Social Media, CSCESM 2015, pp. 67–71. IEEE (2015)
Horch, A., Kett, H., Weisbecker, A.: Extracting product offers from e-shop websites. In: Monfort, V., Krempels, K.-H., Majchrzak, T.A., Turk, Ž. (eds.) WEBIST 2015. LNBIP, vol. 246, pp. 232–251. Springer, Heidelberg (2016). doi:10.1007/978-3-319-30996-5_12
Horch, A., Kett, H., Weisbecker, A.: Matching product offers of e-shops. In: Cao, H., Li, J., Wang, R. (eds.) PAKDD 2016. LNCS (LNAI), vol. 9794, pp. 248–259. Springer, Heidelberg (2016). doi:10.1007/978-3-319-42996-0_21
Horch, A., Kett, H., Weisbecker, A.: A web application for the collection and comparison of E-shop product data. In: 2016 6th International Workshop on Computer Science and Engineering, WCSE 2016, pp. 167–179. International Workshop on Computer Science and Engineering (WCSE) (2016)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Horch, A., Wohlfrom, A., Weisbecker, A. (2017). An E-Shop Analysis with a Focus on Product Data Extraction. In: Bridge, D., Stuckenschmidt, H. (eds) E-Commerce and Web Technologies. EC-Web 2016. Lecture Notes in Business Information Processing, vol 278. Springer, Cham. https://doi.org/10.1007/978-3-319-53676-7_5
Download citation
DOI: https://doi.org/10.1007/978-3-319-53676-7_5
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-53675-0
Online ISBN: 978-3-319-53676-7
eBook Packages: Computer ScienceComputer Science (R0)