Extending Spark Analytics through Tika-Based Information Extraction and Retrieval | IEEE Conference Publication | IEEE Xplore