Abstract
In big data, the information retrieval (IR) embraces the discovery of documents from a compilation of dataset which are related to the user query. Usually, the data retrieval systems are used to scan the data. The existent approaches that employ customary IR are wearisome for big document collections. Recently, IR approaches were developed, albeit these are faster comparing to the traditional method but the quality of the document retrieved is less. To overcome such difficulties, here, effectual big data retrieval utilizing Deep Learning Modified Neural Networks (DLMNN) is proposed. Initially, the general pre-processing along with feature extraction steps are taken place. In preprocessing stage, unwanted data are removed and also used for changing the unstructured data in to structured data then in FE is done using frequency and entropy calculation of the given input data. Secondly, find the closed recurrent item dataset, after that find the weight of provided data using entropy measure and frequent item measure. In the 3rd step, the documents are clustered utilizing the k-means algorithm and then classified using DLMNN. The K-Means algorithm is utilized to partition the collection of documents into several clusters then the DLMNN is used for classifying the documents into positive and negative classes. The proposed DLNN weight parameters are optimized utilizing the cuckoo search (CS) optimization algorithm. The last stage on the training process section is generating a training data-base. In the other part, the retrieval process is performed, in this section it pre-processes the user query and discovers the frequency item set then gets retrieval data. Finally, check the similarity assessment, if the information is found then it is visualized, otherwise the document is returned to the initial position. Experimental results contrasted with the previous MRT and IRI-RAS techniques concerning precision, recall, F-measure along with computation time. The proposed document IR is better when comparing with existent methods.
Similar content being viewed by others
References
Irfan S, Babu BV (2016) Information retrieval in big data using evolutionary computation: A survey. In: Computing, Communication and Automation (ICCCA), International Conference on, pp. 208-213, IEEE
Zhao F, Zhu Y, Jin H, Yang LT (2016) A personalized hashtag recommendation approach using LDA-based topic model in microblog environment. Futur Gener Comput Syst 65:196–206. https://www.sciencedirect.com/science/article/pii/S0167739X15003258
DineshMavaluru RS, Sugumaran V (2014) Big data analytics in information retrieval: promise and potential. In: Proceedings of 08th IRF International Conference. Bengaluru, pp. 41-46
Kodabagi MM, Sarashetti D, Naik V (2016) A Text Information Retrieval Technique for Big Data Using Map Reduce. Bonfring International Journal of Software Engineering and Soft Computing 6:22–26
Cuzzocrea A, Lee W, Leung CK (2015) High-recall information retrieval from linked big data. In: Computer Software and Applications Conference (COMPSAC), IEEE 39th Annual, Vol. 2, pp. 712-717, IEEE
Chiranjeevi HS, Shenoy M, Prabhu S, Sundhar S (2016) DSSM with text hashing technique for text document retrieval in next-generation search engine for big data and data analytics. In: Engineering and Technology (ICETECH), IEEE International Conference on, pp. 395-399, IEEE
Portilla Herrera NA, López Gomez F, Bucheli VA, SolartePabón O (2017) Semantic annotation and retrieval of scientific documents in a big data environment. IET digital library 7th Latin American Conference on Networked and Electronic Media, pp. 33-38
Ketu S, Agarwal S (2015) Performance enhancement of distributed K-Means clustering for big Data analytics through in-memory computation. In: Contemporary Computing (IC3), Eighth International Conference on, pp. 318-324, IEEE
Chen C, Zhu X, Shen P, Hu J (2014) A hierarchical clustering method for big data oriented ciphertext search. In: Computer Communications Workshops (INFOCOM WKSHPS), IEEE Conference on, pp. 559-564, IEEE
Wang Y, Liu J, Huang Y, Feng X (2016) Using hashtag graph-based topic model to connect semantically-related words without co-occurrence in microblogs. IEEE Trans Knowl Data Eng 28(7):1919–1933
Caballero I, Serrano M, Piattini M (2014) A data quality in use model for big data. In: International Conference on Conceptual Modeling, pp. 65-74. Springer, Cham
Sakr NA, ELdesouky AI, Arafat H (2016) An efficient fast-response content-based image retrieval framework for big data. Comput Electr Eng 54:522–538
Li Y, Zhang Y, Huang X, Zhu H, Ma J (2018) Large-scale remote sensing image retrieval by deep hashing neural networks. IEEE Trans Geosci Remote Sens 56(2):950–965
Liu L, Yu M, Shao L (2017) Learning short binary codes for large-scale image retrieval. IEEE Trans Image Process 26(3):1289–1299
Prasanth T, Gunasekaran M (2017) A mutual refinement technique for big data retrieval using hash tag graph. Cluster Computing, pp. 1-11
BegümDemir, and Lorenzo Bruzzone (2016) Hashing-based scalable remote sensing image search and retrieval in large archives. IEEE Trans Geosci Remote Sens 54(2):892–904
Kehua G, Liang Z, Tang Y, Chi T (2018) SOR: An optimized semantic ontology retrieval algorithm for heterogeneous multimedia big data. J Comput Sci 28:455–465
Zou F, Tang X, Li K, Wang Y, Song J, Yang S, Ling H (2018) Hidden semantic hashing for fast retrieval over large scale document collection. Multimedia Tools and Applications 77(3):3677–3697
Joshi AS, Kulkarni O, Kakandikar GM, Nandedkar VM (2017) Cuckoo Search Optimization-A Review. Materials Today: Proceedings 4(8):7262–7269
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Prasanth, T., Gunasekaran, M. Effective Big Data Retrieval Using Deep Learning Modified Neural Networks. Mobile Netw Appl 24, 282–294 (2019). https://doi.org/10.1007/s11036-018-1204-y
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11036-018-1204-y