Abstract
In this work, we focus on understanding what happens under the hood of HTTP traffic. We proposed a systematic approach to reconstruct users click-through patterns in web starting from raw network data. We validated the proposed method using a synthesized network traffic data set and a real-world 4G network traffic data set. The precision calculated by the proposed model is 93.7 % and the recall is 91.6 %. The experiment results on the real-world 4G network traffic data show that the click-through stream of 4G network is more deeper than that of 3G network.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Erman, J., Gerber, A., Sen, S.: HTTP in the home: it Is not just about PCs. In: ACM SIGCOMM (2011)
Ihm, S., Pai, V.: Towards understanding modern web traffic. In: ACM IMC (2011)
Butkiewicz, M., Madhyastha, H., Sekar, V.: Understanding website complexity: measurements, metrics, and implications. In: ACM IMC (2011)
Maier, G., Feldmann, A., Paxson, V., Allman, M.: On dominant characteristics of residential broadband internet traffic. In: Proceedings of the Internet Measurement Conference, Chicago, Illinois (2009)
Broder, A., et al.: Graph structure in the web. Comput. Netw. 33(1), 309–320 (2000)
Gill, P., Arlitt, M., Carlsson, N., Mahanti, A., Williamson, C.: Characterizing organizational use of web-based services: methodology, challenges, observations, and insights. ACM TWEB 5(4), 1–23 (2011)
Schneider, F., Feldmann, A., Krishnamurthy, B., Willinger, W.: Understanding online social network usage from a network perspective. In: Proceedings of the IMC (2009)
Nazir, A., Raza, S., Gupta, D., Chuah, C.-N., Krishnamurthy, B.: Network level footprints of facebook applications. In: Proceedings of the IMC (2009)
Schneider, F., Agarwal, S., Alpcan, T., Feldmann, A.: The new web: characterizing AJAX traffic. In: Claypool, M., Uhlig, S. (eds.) PAM 2008. LNCS, vol. 4979, pp. 31–40. Springer, Heidelberg (2008)
Nah, F.: A study on tolerable waiting yime: how long are web users willing to wait? Behav. Inf. Technol. 23(3), 153–163 (2004)
Labovitz, C., Lekel-Johnson, S., Oberheide, J., Jahanian, F.: Internet inter-domain traffic. In: ACM SIGCOMM (2010)
Belson, D.: Akamai state of the internet report. SIGOPS Oper. Syst. Rev. 44(3), 27–C37 (2010)
Crescenzi, V, Mecca, G., Merialdo, P.: RoadRunner: towards automatic data extraction from large web sites. In: VLDB, pp. 109–118 (2001)
Fetterly, D., Manasse, M., Najork, M., et al.: A large-scale study of the evolution of web pages. Softw. Pract. Experience Spec. Issue Web Technol. 34(2), 213–237 (2004)
Acknowledgment
This work was supported in part by China Postdoctoral Science Foundation (Grant No. 2014M562223), Shenzhen Basic Research Project (Grant No. JCYJ20140610151856729), and Natural Science Foundation of Guangdong Province (Grant No. 2014A030310154).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Wang, S., Shen, Y., Hu, J., Xuan, Z., Lu, Z. (2015). Hadoop-Based Analysis for Large-Scale Click-Through Patterns in 4G Network. In: Xu, K., Zhu, H. (eds) Wireless Algorithms, Systems, and Applications. WASA 2015. Lecture Notes in Computer Science(), vol 9204. Springer, Cham. https://doi.org/10.1007/978-3-319-21837-3_81
Download citation
DOI: https://doi.org/10.1007/978-3-319-21837-3_81
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-21836-6
Online ISBN: 978-3-319-21837-3
eBook Packages: Computer ScienceComputer Science (R0)