Skip to main content

Customer Unification in E-Commerce

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 8206))

Abstract

Data mining applied to social media is gaining popularity. It is worth noticing that most e-commerce services also cause the formation of small communities not only services oriented toward socializing people. The results of their analysis are easier to implement. Besides, we can expect a better perception of the business by its own users, therefore the analysis of their behavior is justified. In the paper we introduce an algorithm which identifies particular customers among not logged or not registered users of a given e-commerce service. The identification of a customer is based on data that was given so as to accomplish selling procedure. Customers rarely use exactly the same identification data each time. In consequence, it is possible to check if customers create a group of unrelated individuals or if there are symptoms of social behavior.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Asur, S., Huberman, B.A., Szabó, G., Wang, C.: Trends in social media: Persistence and decay. CoRR, abs/1102.1402 (2011)

    Google Scholar 

  2. Awadallah, R., Ramanath, M., Weikum, G.: Opinionetit: understanding the opinions-people network for politically controversial topics. In: Proceedings of the 20th ACM International Conference on Information and Knowledge Management, CIKM 2011, pp. 2481–2484. ACM, New York (2011)

    Google Scholar 

  3. Beeferman, D., Berger, A.: Agglomerative clustering of a search engine query log. In: Proceedings of the Sixth ACM SIGKDD on Knowledge Discovery and Data Mining, KDD 2000, pp. 407–416. ACM, New York (2000)

    Chapter  Google Scholar 

  4. Berkhin, P.: Survey of clustering data mining techniques. Technical report, Accrue Software, San Jose, CA (2002)

    Google Scholar 

  5. Gorawski, M.: Architecture of parallel spatial data warehouse: Balancing algorithm and resumption of data extraction. In: Software Engineering: Evolution and Emerging Technologies, pp. 49–59 (2005)

    Google Scholar 

  6. Gorawski, M.: Extended cascaded star schema and ECOLAP operations for spatial data warehouse. In: Corchado, E., Yin, H. (eds.) IDEAL 2009. LNCS, vol. 5788, pp. 251–259. Springer, Heidelberg (2009)

    Chapter  Google Scholar 

  7. Gorawski, M.: Multiversion spatio-temporal telemetric data warehouse. In: Grundspenkis, J., Kirikova, M., Manolopoulos, Y., Novickis, L. (eds.) ADBIS 2009. LNCS, vol. 5968, pp. 63–70. Springer, Heidelberg (2010)

    Chapter  Google Scholar 

  8. Gorawski, M.: Time complexity of page filling algorithms in materialized aggregate list (mal) and mal/trigg materialization cost. Control and Cybernetics 38(1), 153–172 (2009)

    MATH  Google Scholar 

  9. Gorawski, M., Bańkowski, S., Gorawski, M.: Selection of structures with grid optimization, in multiagent data warehouse. In: Fyfe, C., Tino, P., Charles, D., Garcia-Osorio, C., Yin, H. (eds.) IDEAL 2010. LNCS, vol. 6283, pp. 292–299. Springer, Heidelberg (2010)

    Chapter  Google Scholar 

  10. Gorawski, M., Bularz, J.: Protecting private information by data separation in distributed spatial data warehouse. In: ARES, pp. 837–844 (2007)

    Google Scholar 

  11. Gorawski, M., Chechelski, R.: Online balancing of aR-tree indexed distributed spatial data warehouse. In: Wyrzykowski, R., Dongarra, J., Meyer, N., Waśniewski, J. (eds.) PPAM 2005. LNCS, vol. 3911, pp. 470–477. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  12. Gorawski, M., Chrószcz, A., Gorawska, A.: User Identity Unification in e-Commerce. In: Swiątek, J., Grzech, A., Swiątek, P., Tomczak, J.M. (eds.) Advances in Systems Science. AISC, vol. 240, pp. 163–172. Springer, Heidelberg (2014)

    Chapter  Google Scholar 

  13. Gorawski, M., Chrószcz, A.: Query processing using negative and temporal tuples in stream query engines. In: Szmuc, T., Szpyrka, M., Zendulka, J. (eds.) CEE-SET 2009. LNCS, vol. 7054, pp. 70–83. Springer, Heidelberg (2012)

    Chapter  Google Scholar 

  14. Gorawski, M., Chrószcz, A.: Streamapas: Query language and data model. In: CISIS, pp. 75–82 (2009)

    Google Scholar 

  15. Gorawski, M., Chrószcz, A.: Optimization of operator partitions in stream data warehouse. In: Proceedings of the ACM 14th International Workshop on Data Warehousing and OLAP, DOLAP 2011, pp. 61–66. ACM, New York (2011)

    Chapter  Google Scholar 

  16. Gorawski, M., Dowlaszewicz, K.: Algorithms for efficient top-k spatial preference query execution in a heterogeneous distributed environment. In: ICEIS (1), pp. 43–48 (2009)

    Google Scholar 

  17. Gorawski, M., Dyga, A.: Indexing of spatio-temporal telemetric data based on distributed mobile bucket index. In: Parallel and Distributed Computing and Networks, pp. 292–297 (2006)

    Google Scholar 

  18. Gorawski, M., Dyga, A.: Multi-dimensional dynamic bucket index based on mobile agent system architecture. In: Bressan, S., Küng, J., Wagner, R. (eds.) DEXA 2006. LNCS, vol. 4080, pp. 924–934. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  19. Gorawski, M., Dyga, A.: Indexing of spatio-temporal telemetric data based on adaptive multi-dimensional bucket index. Fundam. Inform. 90(1-2), 73–86 (2009)

    MATH  Google Scholar 

  20. Gorawski, M., Faruga, M.: Stah-tree: Hybrid index for spatio temporal aggregation. In: ICEIS (1), pp. 113–118 (2007)

    Google Scholar 

  21. Gorawski, M., Gebczyk, W.: Distributed approach of continuous queries with knn join processing in spatial data warehouse. In: ICEIS (1), pp. 131–136 (2007)

    Google Scholar 

  22. Gorawski, M., Gorawski, J.I.M.: Algorytm adaptacyjnego balansowania obcienia zapyta w rozproszonych przestrzenno-temporalnych hurtowniach danych. Studia Informatica 32(2A), 75–88 (2011)

    Google Scholar 

  23. Gorawski, M., Gorawski, M.: Balanced spatio-temporal data warehouse with r-mvb, stcat and bitmap indexes. In: PARELEC, pp. 43–48 (2006)

    Google Scholar 

  24. Gorawski, M., Gorawski, M.: Modified R-MVB tree and BTV algorithm used in a distributed spatio-temporal data warehouse. In: Wyrzykowski, R., Dongarra, J., Karczewski, K., Wasniewski, J. (eds.) PPAM 2007. LNCS, vol. 4967, pp. 199–208. Springer, Heidelberg (2008)

    Chapter  Google Scholar 

  25. Gorawski, M., Jureczek, P.: Continuous pattern mining using the fCPGrowth algorithm in trajectory data warehouses. In: Graña Romay, M., Corchado, E., Garcia Sebastian, M.T. (eds.) HAIS 2010, Part I. LNCS, vol. 6076, pp. 187–195. Springer, Heidelberg (2010)

    Chapter  Google Scholar 

  26. Gorawski, M., Jureczek, P.: Regions of interest in trajectory data warehouse. In: Nguyen, N.T., Le, M.T., Świątek, J. (eds.) ACIIDS 2010. LNCS, vol. 5990, pp. 74–81. Springer, Heidelberg (2010)

    Chapter  Google Scholar 

  27. Gorawski, M., Jureczek, P.: Extensions for continuous pattern mining. In: Yin, H., Wang, W., Rayward-Smith, V. (eds.) IDEAL 2011. LNCS, vol. 6936, pp. 194–203. Springer, Heidelberg (2011)

    Chapter  Google Scholar 

  28. Gorawski, M., Kaminski, M.: On-line balancing of horizontally-range-partitioned data in distributed spatial telemetric data warehouse. In: DEXA Workshops, pp. 273–280 (2006)

    Google Scholar 

  29. Gorawski, M., Lis, D.: Architektura cuda w bezopnieniowych hurtowniach danych. Studia Informatica 32(2A), 157–167 (2011)

    Google Scholar 

  30. Gorawski, M., Lorek, M., Gorawska, A.: Cuda powered user-defined types and aggregates. In: AINA Workshops, pp. 1423–1428 (2013)

    Google Scholar 

  31. Gorawski, M., Lorek, M., Gorawski, M.: Encrypted adaptive storage model – analysis and performance tests. In: Fischer-Hübner, S., Katsikas, S., Quirchmayr, G. (eds.) TrustBus 2012. LNCS, vol. 7449, pp. 118–128. Springer, Heidelberg (2012)

    Chapter  Google Scholar 

  32. Gorawski, M., Malczok, R.: Multi-thread processing of long aggregates lists. In: Wyrzykowski, R., Dongarra, J., Meyer, N., Waśniewski, J. (eds.) PPAM 2005. LNCS, vol. 3911, pp. 59–66. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  33. Gorawski, M., Malczok, R.: AEC algorithm: A heuristic approach to calculating density-based clustering eps parameter. In: Yakhno, T., Neuhold, E.J. (eds.) ADVIS 2006. LNCS, vol. 4243, pp. 90–99. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  34. Gorawski, M., Malczok, R.: Calculation of density-based clustering parameters supported with distributed processing. In: Tjoa, A.M., Trujillo, J. (eds.) DaWaK 2006. LNCS, vol. 4081, pp. 417–426. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  35. Gorawski, M., Malczok, R.: Materialized ar-tree in distributed spatial data warehouse. Intell. Data Anal. 10(4), 361–377 (2006)

    Google Scholar 

  36. Gorawski, M., Malczok, R.: Towards automatic eps calculation in density-based clustering. In: Manolopoulos, Y., Pokorný, J., Sellis, T.K. (eds.) ADBIS 2006. LNCS, vol. 4152, pp. 313–328. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  37. Gorawski, M., Malczok, R.: Towards stream data parallel processing in spatial aggregating index. In: Wyrzykowski, R., Dongarra, J., Karczewski, K., Wasniewski, J. (eds.) PPAM 2007. LNCS, vol. 4967, pp. 209–218. Springer, Heidelberg (2008)

    Chapter  Google Scholar 

  38. Gorawski, M., Malczok, R.: Answering range-aggregate queries over objects generating data streams. In: Kitagawa, H., Ishikawa, Y., Li, Q., Watanabe, C. (eds.) DASFAA 2010. LNCS, vol. 5982, pp. 436–439. Springer, Heidelberg (2010)

    Chapter  Google Scholar 

  39. Gorawski, M., Malczok, R.: Cam2s: An integrated indexing structure for spatial objects generating data streams. In: CISIS (2010)

    Google Scholar 

  40. Gorawski, M., Malczok, R.: Indexing spatial objects in stream data warehouse. In: Nguyen, N.T., Katarzyniak, R., Chen, S.-M. (eds.) Advances in Intelligent Information and Database Systems. SCI, vol. 283, pp. 53–65. Springer, Heidelberg (2010)

    Chapter  Google Scholar 

  41. Gorawski, M., Marks, P.: Resumption of data extraction process in parallel data warehouses. In: Wyrzykowski, R., Dongarra, J., Meyer, N., Waśniewski, J. (eds.) PPAM 2005. LNCS, vol. 3911, pp. 478–485. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  42. Gorawski, M., Marks, P.: Checkpoint-based resumption in data warehouses. In: Sacha, K. (ed.) Software Engineering Techniques: Design for Quality. IFIP, vol. 227, pp. 313–323. Springer, Boston (2006)

    Chapter  Google Scholar 

  43. Gorawski, M., Marks, P.: Fault-tolerant distributed stream processing system. In: DEXA Workshops, pp. 395–399 (2006)

    Google Scholar 

  44. Gorawski, M., Marks, P.: Distributed stream processing analysis in high availability context. In: ARES, pp. 61–68 (2007)

    Google Scholar 

  45. Gorawski, M., Marks, P.: Towards reliability and fault-tolerance of distributed stream processing system. In: DepCoS-RELCOMEX, pp. 246–253 (2007)

    Google Scholar 

  46. Gorawski, M., Marks, P.: Towards automated analysis of connections network in distributed stream processing system. In: Haritsa, J.R., Kotagiri, R., Pudi, V. (eds.) DASFAA 2008. LNCS, vol. 4947, pp. 670–677. Springer, Heidelberg (2008)

    Chapter  Google Scholar 

  47. Gorawski, M., Marks, P., Gorawski, M.: Collecting data streams from a distributed radio-based measurement system. In: Haritsa, J.R., Kotagiri, R., Pudi, V. (eds.) DASFAA 2008. LNCS, vol. 4947, pp. 702–705. Springer, Heidelberg (2008)

    Chapter  Google Scholar 

  48. Gorawski, M., Morzy, T., Wrembel, R., Zgrzywa, A.: Advanced data proceedings and analysis techniques. Control and Cybernetics 40, 581–583 (2012)

    Google Scholar 

  49. Gorawski, M., Panfil, S.: A system of privacy preserving distributed spatial data warehouse using relation decomposition. In: ARES, pp. 522–527 (2009)

    Google Scholar 

  50. Gorawski, M., Pasterak, K.: Agkpstream a operatory strumieniowe. Studia Informatica 33(2A), 181–195 (2012)

    Google Scholar 

  51. Gorawski, M., Pasterak, K.: Schedulery strumieniowe w agkpstream. Studia Informatica 33(2A), 197–210 (2012)

    Google Scholar 

  52. Gorawski, M., Pluciennik, E.: Distributed data mining by means of SQL enhancement. In: Meersman, R., Tari, Z., Herrero, P. (eds.) OTM-WS 2008. LNCS, vol. 5333, pp. 34–35. Springer, Heidelberg (2008)

    Chapter  Google Scholar 

  53. Gorawski, M., Płuciennik-Psota, E.: Distributed data mining methodology with classification model example. In: Nguyen, N.T., Kowalczyk, R., Chen, S.-M. (eds.) ICCCI 2009. LNCS, vol. 5796, pp. 107–117. Springer, Heidelberg (2009)

    Chapter  Google Scholar 

  54. Gorawski, M., Siedlecki, Z.: Implementation, optimization and performance tests of privacy preserving mechanisms in homogeneous collaborative association rules mining. In: Meersman, R., et al. (eds.) OTM 2011, Part I. LNCS, vol. 7044, pp. 347–366. Springer, Heidelberg (2011)

    Chapter  Google Scholar 

  55. Gorawski, M., Siedlecki, Z.: Optimization of privacy preserving mechanisms in homogeneous collaborative association rules mining. In: ARES, pp. 347–352 (2011)

    Google Scholar 

  56. Gorawski, M., Stachurski, K.: On efficiency and data privacy level of association rules mining algorithms within parallel spatial data warehouse. In: ARES, pp. 936–943 (2006)

    Google Scholar 

  57. Guha, S., Rastogi, R., Shim, K.: Cure: an efficient clustering algorithm for large databases. In: Proceedings of the 1998 ACM SIGMOD International Conference on Management of Data, SIGMOD 1998, pp. 73–84. ACM, New York (1998)

    Chapter  Google Scholar 

  58. Han, J.: Data Mining: Concepts and Techniques. Morgan Kaufmann Publishers Inc., San Francisco (2005)

    Google Scholar 

  59. Heller, K.A., Ghahramani, Z.: Bayesian hierarchical clustering

    Google Scholar 

  60. Jain, A.K., Murty, M.N., Flynn, P.J.: Data clustering: a review. ACM Comput. Surv. 31(3), 264–323 (1999)

    Article  Google Scholar 

  61. Mathioudakis, M., Koudas, N.: Twittermonitor: trend detection over the twitter stream

    Google Scholar 

  62. McCallum, A., Nigam, K., Ungar, L.H.: Efficient clustering of high-dimensional data sets with application to reference matching. In: Proceedings of the Sixth ACM SIGKDD Conference on Knowledge Discovery and Data Mining, KDD 2000, pp. 169–178. ACM, New York (2000)

    Chapter  Google Scholar 

  63. Olson, C.F.: Parallel algorithms for hierarchical clustering. Parallel Computing 21, 1313–1325 (1993)

    Article  MathSciNet  Google Scholar 

  64. Walter, B., Bala, K., Kulkarni, M., Pingali, K.: Fast agglomerative clustering for rendering. In: IEEE Symposium on Interactive Ray Tracing (RT), pp. 81–86 (August 2008)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Gorawski, M., Chrószcz, A., Gorawska, A. (2013). Customer Unification in E-Commerce. In: Yin, H., et al. Intelligent Data Engineering and Automated Learning – IDEAL 2013. IDEAL 2013. Lecture Notes in Computer Science, vol 8206. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-41278-3_18

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-41278-3_18

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-41277-6

  • Online ISBN: 978-3-642-41278-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics