Abstract
Internet technologies are constantly evolving as well as the way people use them. Search engines help users to find higher and better relevant results to their searches. Cloud Computing is an evolution of the Internet services and provides a step further ecosystem that can be used to improve the search of more relevant results. Each search engine is based on different modules in order to retrieve the results expected by users using specific keywords. Social networks appear as a reliable Web technology that can directly support a content search. Several studies have been performed showing the growth of social networks in people lives. Using the cloud computing paradigm it is possible to propose a more scalable and efficient way to explore public information available on online social networks. This paper includes the analyses of several social networks services, available contents, cloud-crawlers, and information extraction. In order to collect relevant data from social networks, a social crawler on cloud is proposed. The new approach provides a cloud-based crawler for low-cost, effective, and personalized search models. Moreover, a new algorithm to rank Web documents is proposed and demonstrated. The proposed system is evaluated in comparison with the top Internet search engine, Google, its behavior is very promising, and it is ready for use.
Similar content being viewed by others
References
Batsakis S, Petrakis EGM, Milios E (2009) Improving the performance of focused web crawlers. Data Knowl Eng 68(10):1001–1013
Brin S, Page L (1998) The anatomy of a large-scale hypertextual Web search engine. In: Proceedings of the seventh international conference on World Wide Web 7. Elsevier Science Publishers B. V., Brisbane, pp 107–117
Brin S, Page L (1998) The anatomy of a large-scale hypertextual Web search engine. Comput Netw ISDN Syst 30(1-7):107–117
Chen M, Maom S, Liu Y (171–209) Big data: a survey. Mob Netw Appl 19 (2). doi:10.1007/s11036-013-0489-0
Dixit A, Sharma AK (2010) A mathematical model for crawler revisit frequency. In: IEEE 2nd international advance computing conference (IACC), pp 316-319, vol 19
Gori M, Numerico T (2003) Social networks and web minorities. Cogn Syst Res 4(4):355–364
Gupta A, Jindal R (2008) An overview of ranking algorithms for search engines. In: Proceedings of the 2nd national conference; INDIACom-2008, New Delhi
Hashizume K, Fernandez EB, Larrondo-Petrie MM (2012) A pattern for software-as-a-service in clouds. ASE/IEEE Int Conf BioMed Comput (BioMedCom) 0:140–144
Heymann P, Ramage D, Garcia-Molina H (2008) Social tag prediction. In: Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval,ACM, Singapore, Singapore, pp 531–538
Israeli A, Feitelson DG (2010) The Linux kernel as a case study in software evolution. J Syst Soft 83(3):485–501
Jason MP (2008) Tagging and searching: search retrieval effectiveness of folksonomies on the World Wide Web. Inf Process Manag 44(4):1562–1579
Kleinberg JM (1999) Authoritative sources in a hyperlinked environment. J ACM 46(5):604–632
Kwak H, Lee C, Park H, Moon S (2010) What is Twitter, a social network or a news media? In: Proceedings of the 19th international conference on world wide web, ACM, Raleigh, North Carolina, USA, pp 591–600
Miao TA -A, Das O’Brien T, Zhen WGZ (2007) A link-based ranking scheme for focused search. In: Proceedings of the 16th international conference on world wide web, Canada
Mishra AA, Kamat C (2011) Article: migration of search engine process into the cloud. Int J Comput Appl 19:19–23. Published by Foundation of Computer Science
Moghe U, Lakkadwala P, Mishra D (2012) Cloud computing: survey of different utilization techniques. In: Software engineering (CONSEG), 2012 CSI sixth international conference on, pp 1-4m, doi:10.1109/CONSEG.2012.6349524
Page L, Brin S, Motwani R, Winograd T (1999) The PageRank citation ranking: bringing order to the web tech. rep.
Papazoglou M, van den Heuvel W (2011) Blueprinting the cloud. Internet Comput IEEE 15 (6): 74–79. doi:10.1109/MIC.2011.147
Mell P, Grance T (2009) The NIST definition of cloud computing
Selvan MP, Sekar Ac, Dharshini AP (2012) Survey on web page ranking algorithms. Int J Comput Appl 41:1–7
Silverstein C, Marais H, Henzinger M, Moricz M (1999) Analysis of a very large web search engine query log. SIGIR Forum 33(1):6–12
Suakanto S, Supangkat S, Suhardi R, Saragih, Nugraha I (2012) Building crawler engine on cloud computing infrastructure. In: Cloud computing and social networking (ICCCSN), 2012 international conference on, pp 1-5, doi:10.1109/ICCCSN.2012.6215751
Weng J, Lim E-P, Jiang J, He Q (2010) TwitterRank: finding topic-sensitive influential twitterers. In: Proceedings of the third ACM international conference on Web search and data mining, ACM, New York, New York, USA, pp. 261–270
Wolf JL, Squillante MS, Yu PS, Sethuraman J, Ozsen L (2002) Optimal crawling strategies for web search engines. In: Proceedings of the 11th international conference on World Wide Web, ACM, Honolulu, Hawaii, USA, pp. 136–147
Xin W, Jamaliding Q, Okamoto T (2009) Discovering social network to improve recommender system for group learning support. In: Computational intelligence and software engineering, 2009. CiSE 2009. International conference on, pp 1-4, vol 11
Yan L, Gui Z, Du W, Guo Q (2011) An improved PageRank method based on genetic algorithm for web search. Procedia Eng 15(0):2983–2987
Yu SJ (2012) The dynamic competitive recommendation algorithm in social network . Inf Sci 187:1–14
Acknowledgements
This work has been partially supported by the Instituto de Telecomunicações, Next Generation Networks and Applications Group (NetGNA), Portugal, and by National Funding from the FCT - Fundação para a Ciência e a Tecnologia through the PEst-OE/EEI/LA0008/2013 Project.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
F. Costa, J.E., Rodrigues, J.J.P.C., Simões, T.M.C. et al. Exploring Social Networks and Improving Hypertext Results for Cloud Solutions. Mobile Netw Appl 21, 215–221 (2016). https://doi.org/10.1007/s11036-014-0513-z
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11036-014-0513-z