Obtaining language models of web collections using query-based sampling techniques | IEEE Conference Publication | IEEE Xplore