Extracting news content with visual unit of web pages | IEEE Conference Publication | IEEE Xplore