Skip to main content
Log in

Effective Big Data Retrieval Using Deep Learning Modified Neural Networks

  • Published:
Mobile Networks and Applications Aims and scope Submit manuscript

Abstract

In big data, the information retrieval (IR) embraces the discovery of documents from a compilation of dataset which are related to the user query. Usually, the data retrieval systems are used to scan the data. The existent approaches that employ customary IR are wearisome for big document collections. Recently, IR approaches were developed, albeit these are faster comparing to the traditional method but the quality of the document retrieved is less. To overcome such difficulties, here, effectual big data retrieval utilizing Deep Learning Modified Neural Networks (DLMNN) is proposed. Initially, the general pre-processing along with feature extraction steps are taken place. In preprocessing stage, unwanted data are removed and also used for changing the unstructured data in to structured data then in FE is done using frequency and entropy calculation of the given input data. Secondly, find the closed recurrent item dataset, after that find the weight of provided data using entropy measure and frequent item measure. In the 3rd step, the documents are clustered utilizing the k-means algorithm and then classified using DLMNN. The K-Means algorithm is utilized to partition the collection of documents into several clusters then the DLMNN is used for classifying the documents into positive and negative classes. The proposed DLNN weight parameters are optimized utilizing the cuckoo search (CS) optimization algorithm. The last stage on the training process section is generating a training data-base. In the other part, the retrieval process is performed, in this section it pre-processes the user query and discovers the frequency item set then gets retrieval data. Finally, check the similarity assessment, if the information is found then it is visualized, otherwise the document is returned to the initial position. Experimental results contrasted with the previous MRT and IRI-RAS techniques concerning precision, recall, F-measure along with computation time. The proposed document IR is better when comparing with existent methods.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6

Similar content being viewed by others

References

  1. Irfan S, Babu BV (2016) Information retrieval in big data using evolutionary computation: A survey. In: Computing, Communication and Automation (ICCCA), International Conference on, pp. 208-213, IEEE

  2. Zhao F, Zhu Y, Jin H, Yang LT (2016) A personalized hashtag recommendation approach using LDA-based topic model in microblog environment. Futur Gener Comput Syst 65:196–206. https://www.sciencedirect.com/science/article/pii/S0167739X15003258

  3. DineshMavaluru RS, Sugumaran V (2014) Big data analytics in information retrieval: promise and potential. In: Proceedings of 08th IRF International Conference. Bengaluru, pp. 41-46

  4. Kodabagi MM, Sarashetti D, Naik V (2016) A Text Information Retrieval Technique for Big Data Using Map Reduce. Bonfring International Journal of Software Engineering and Soft Computing 6:22–26

    Article  Google Scholar 

  5. Cuzzocrea A, Lee W, Leung CK (2015) High-recall information retrieval from linked big data. In: Computer Software and Applications Conference (COMPSAC), IEEE 39th Annual, Vol. 2, pp. 712-717, IEEE

  6. Chiranjeevi HS, Shenoy M, Prabhu S, Sundhar S (2016) DSSM with text hashing technique for text document retrieval in next-generation search engine for big data and data analytics. In: Engineering and Technology (ICETECH), IEEE International Conference on, pp. 395-399, IEEE

  7. Portilla Herrera NA, López Gomez F, Bucheli VA, SolartePabón O (2017) Semantic annotation and retrieval of scientific documents in a big data environment. IET digital library 7th Latin American Conference on Networked and Electronic Media, pp. 33-38

  8. Ketu S, Agarwal S (2015) Performance enhancement of distributed K-Means clustering for big Data analytics through in-memory computation. In: Contemporary Computing (IC3), Eighth International Conference on, pp. 318-324, IEEE

  9. Chen C, Zhu X, Shen P, Hu J (2014) A hierarchical clustering method for big data oriented ciphertext search. In: Computer Communications Workshops (INFOCOM WKSHPS), IEEE Conference on, pp. 559-564, IEEE

  10. Wang Y, Liu J, Huang Y, Feng X (2016) Using hashtag graph-based topic model to connect semantically-related words without co-occurrence in microblogs. IEEE Trans Knowl Data Eng 28(7):1919–1933

    Article  Google Scholar 

  11. Caballero I, Serrano M, Piattini M (2014) A data quality in use model for big data. In: International Conference on Conceptual Modeling, pp. 65-74. Springer, Cham

  12. Sakr NA, ELdesouky AI, Arafat H (2016) An efficient fast-response content-based image retrieval framework for big data. Comput Electr Eng 54:522–538

    Article  Google Scholar 

  13. Li Y, Zhang Y, Huang X, Zhu H, Ma J (2018) Large-scale remote sensing image retrieval by deep hashing neural networks. IEEE Trans Geosci Remote Sens 56(2):950–965

    Article  Google Scholar 

  14. Liu L, Yu M, Shao L (2017) Learning short binary codes for large-scale image retrieval. IEEE Trans Image Process 26(3):1289–1299

    Article  MathSciNet  Google Scholar 

  15. Prasanth T, Gunasekaran M (2017) A mutual refinement technique for big data retrieval using hash tag graph. Cluster Computing, pp. 1-11

  16. BegümDemir, and Lorenzo Bruzzone (2016) Hashing-based scalable remote sensing image search and retrieval in large archives. IEEE Trans Geosci Remote Sens 54(2):892–904

    Article  Google Scholar 

  17. Kehua G, Liang Z, Tang Y, Chi T (2018) SOR: An optimized semantic ontology retrieval algorithm for heterogeneous multimedia big data. J Comput Sci 28:455–465

    Article  MathSciNet  Google Scholar 

  18. Zou F, Tang X, Li K, Wang Y, Song J, Yang S, Ling H (2018) Hidden semantic hashing for fast retrieval over large scale document collection. Multimedia Tools and Applications 77(3):3677–3697

    Article  Google Scholar 

  19. Joshi AS, Kulkarni O, Kakandikar GM, Nandedkar VM (2017) Cuckoo Search Optimization-A Review. Materials Today: Proceedings 4(8):7262–7269

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to T. Prasanth.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Prasanth, T., Gunasekaran, M. Effective Big Data Retrieval Using Deep Learning Modified Neural Networks. Mobile Netw Appl 24, 282–294 (2019). https://doi.org/10.1007/s11036-018-1204-y

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11036-018-1204-y

Keywords

Navigation