Skip to main content

An E-Shop Analysis with a Focus on Product Data Extraction

  • Conference paper
  • First Online:
E-Commerce and Web Technologies (EC-Web 2016)

Part of the book series: Lecture Notes in Business Information Processing ((LNBIP,volume 278))

Included in the following conference series:

  • 1317 Accesses

Abstract

E-commerce is a constantly growing and competitive market. Online prices are updated daily or even more frequently, and it is very important for e-shoppers to find the lowest price online. Therefore, e-shop owners need to know the prices of their competitors and must be able to adjust their own prices in order to remain competitive. The manual monitoring of all prices of all products and competitors is too time-consuming; hence, the e-shop owners need software support for that task. For the development of such software tools the developers need a profound comprehension of the structure and design of e-shop websites. Existing software tools for Web data extraction are based on the findings of different website analyzes. The existing tools show: The more specific and detailed the analysis and the analyzed websites, the better the data extraction results. This paper presents the results and the derived findings of a deep analysis of 50 different e-shop websites in order to provide new insights for the development and improvement of software tools for product data extraction from e-shop websites.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

Notes

  1. 1.

    Compound tag path from the root node of the website’s HTML tree to a particular element of the Web page.

References

  1. Ecommerce Foundation: European B2C E-commerce Report 2015 (Light Version). Report, Ecommerce Foundation (2015)

    Google Scholar 

  2. Civic Consulting: Consumer market study on the functioning of e-commerce and Internet marketing and selling techniques in the retail of goods. Report, Civic Consulting (2011)

    Google Scholar 

  3. PostNord: E-commerce in Europpe 2015. Report, PostNord (2015)

    Google Scholar 

  4. Liu, B., Grossman, R., Zhai, Y.: Mining data records in web pages. In: Proceedings of the Ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD-2003), pp. 601–606. ACM, New York (2003)

    Google Scholar 

  5. Grigalis, T.: Towards automatic structured web data extraction system. In: Local Proceedings and Materials of Doctoral Consortium of the Tenth International Baltic Conference on Databases and Information Systems, pp. 197–201, Vilnius (2012). CEUR-WS.org

  6. Grigalis, T., Čenys, A.: Unsupervised structured data extraction from template-generated web pages. J. Univ. Comput. Sci. (J.UCS) 20(2), 169–192 (2011)

    Google Scholar 

  7. Popescu, A.-M., Etzioni, O.: Extracting product features and opinions from reviews. In: Proceedings of the Conference on Human Language Technology and Empirical Methods in Natural Language Processing, pp. 339–346. Association for Computational Linguistics, Stroudsburg, PA, USA (2005)

    Google Scholar 

  8. Horch, A., Kett, H., Weisbecker, A.: A lightweight approach for extracting product records from the web. In: Proceedings of the 11th International Conference on Web Information Systems and Technologies, pp. 420–430. SciTePress (2015)

    Google Scholar 

  9. Horch, A., Kett, H., Weisbecker, A.: Mining E-commerce data from E-shop websites. In: 2015 IEEE TrustCom/BigDataSE/ISPA, vol. 2, pp. 153–160. IEEE (2015)

    Google Scholar 

  10. Horch, A., Kett, H., Weisbecker, A.: Extracting product unit attributes from product offers by using an ontology. In: Second International Conference on Computer Science, Computer Engineering, and Social Media, CSCESM 2015, pp. 67–71. IEEE (2015)

    Google Scholar 

  11. Horch, A., Kett, H., Weisbecker, A.: Extracting product offers from e-shop websites. In: Monfort, V., Krempels, K.-H., Majchrzak, T.A., Turk, Ž. (eds.) WEBIST 2015. LNBIP, vol. 246, pp. 232–251. Springer, Heidelberg (2016). doi:10.1007/978-3-319-30996-5_12

    Chapter  Google Scholar 

  12. Horch, A., Kett, H., Weisbecker, A.: Matching product offers of e-shops. In: Cao, H., Li, J., Wang, R. (eds.) PAKDD 2016. LNCS (LNAI), vol. 9794, pp. 248–259. Springer, Heidelberg (2016). doi:10.1007/978-3-319-42996-0_21

    Chapter  Google Scholar 

  13. Horch, A., Kett, H., Weisbecker, A.: A web application for the collection and comparison of E-shop product data. In: 2016 6th International Workshop on Computer Science and Engineering, WCSE 2016, pp. 167–179. International Workshop on Computer Science and Engineering (WCSE) (2016)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Andrea Horch .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Cite this paper

Horch, A., Wohlfrom, A., Weisbecker, A. (2017). An E-Shop Analysis with a Focus on Product Data Extraction. In: Bridge, D., Stuckenschmidt, H. (eds) E-Commerce and Web Technologies. EC-Web 2016. Lecture Notes in Business Information Processing, vol 278. Springer, Cham. https://doi.org/10.1007/978-3-319-53676-7_5

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-53676-7_5

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-53675-0

  • Online ISBN: 978-3-319-53676-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics