ClusTex: Information Extraction from HTML Pages | IEEE Conference Publication | IEEE Xplore