Establishing guidelines on how to improve the Web site content based on the identification of representative pages | IEEE Conference Publication | IEEE Xplore