Data extraction from Web forums based on similarity of page layout | IEEE Conference Publication | IEEE Xplore