Abstract
The study of the similarity between web applications has extended alongside with the informational explosion resulted from the fast communication means through Internet. The copyright of web applications is difficult to be appreciated in this domain and this is the reason for the development of novel web technologies and mechanisms of measuring the similarity between two webpages. In this paper, we will present a modality of measurement of the similarity degree between two webpages regarding the HTML tag-based webpages. The degree of similarity will be determined approximately, being dependent of the webpages used from the both websites and the tags set used in the comparison of the webpages. The selection of webpages in order to determine the degree of similarity between two webpages will be made using genetic algorithms. In the final part of the paper there are presented the results obtained with the implementation of the algorithm presented in the paper.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Okundaye, B., Ewert, S., Sanders, I.: Determining image similarity from pattern matching of abstract syntax trees of tree picture grammars. PRASA Johannesburg, pp. 83–90 (2013)
Caldas, L.G., Norford, L.K.: A design optimization tool based on a genetic algorithm. In: Automation in Construction, ACADIA 1999, vol. 11, no. 2, pp. 173–184 (2002)
Darby, S., Mortimer-Jones, T.V., Johnston, R.L., Roberts, C.: Theoretical study of Cu–Au nanoalloy clusters using a genetic algorithm. J. Chem. Phys. 116(4), 1536 (2002)
Kim, H.-S., Cho, S.-B.: Application of interactive genetic algorithm to fashion design. Eng. Appl. Artif. Intell. 13(6), 635–644 (2000)
Remani, N.V.J.M., Rachakonda, S.R., Kurra, R.S.R.: Similarity of inference face matching on angle oriented face recognition. J. Inf. Eng. Appl. 1(1) (2011)
Popescu, D.A., Maria, D.C.: Similarity measurement of web sites using sink web pages. In: 34th International Conference on Telecommunications and Signal Processing, TSP 2011, 18–20 August, Budapest, Hungary, pp. 24–26. IEEE Xplore (2011)
Popescu, D.A., Nicolae, D.: Determining the similarity of two web applications using the edit distance. In: 6th International Workshop on Soft Computing Applications, 24–26 July 2014, Timisoara, Romania, 6th IEEE SOFA. LNCS, pp. 681–690 (2014)
Popescu, D.A., Dan, R.: Approximately similarity measurement of web sites. In: ICONIP, Neural Information Processing. Proceedings LNCS. Springer, 9–12 November 2015
Rahmani, S., Mousavi, S.M., Kamali, M.J.: Modeling of road-traffic noise with the use of genetic algorithm. Appl. Soft Comput. 11(1), 1008–1013 (2011)
Dietterich, T.G.: An experimental comparison of three methods for constructing ensembles of decision trees: bagging, boosting, and randomization. Mach. Learn. 40(2), 139–157 (2000)
Cormen, T.H., Leiserson, C.E., Rivest, R.R.: Introduction to Algorithms. MIT Press, Cambridge (1990)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer International Publishing AG
About this paper
Cite this paper
Popescu, D.A., Domșa, O., Bold, N. (2018). About the Applications of the Similarity of Websites Regarding HTML-Based Webpages. In: Balas, V., Jain, L., Balas, M. (eds) Soft Computing Applications. SOFA 2016. Advances in Intelligent Systems and Computing, vol 633. Springer, Cham. https://doi.org/10.1007/978-3-319-62521-8_12
Download citation
DOI: https://doi.org/10.1007/978-3-319-62521-8_12
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-62520-1
Online ISBN: 978-3-319-62521-8
eBook Packages: EngineeringEngineering (R0)