Abstract
In order to process and analyze very large volumes of images, efficient representation and structuring techniques are required. Since, current computing machines can provide large memory size, trading off reasonable amount of memory in order to achieve efficient and parallelizable representation of images is preferable. In this paper, we propose a new structure to represent and store images based on in-memory graph concept. Our method of computation provides a faster execution time than the traditional array-based representation. Each pixel of an image is represented as a one node in the graph. In addition, nodes have pointers to other neighboring nodes (pixels). The structure represents an image as a grid of connected linked lists and each grid is connected to other grids. Using our method, an image can be represented in one of three different representations which are: octal linked list, quadratic linked list, and dual linked list representations. We provide experiments and evaluations using the dual linked list representation as it requires less memory space. We apply our methodology for medical images as a proof of the concept to find a region of interest in an image. We have collected and used real medical images to build and process the graph which we call MedGraph. Our experimental results show that the proposed MedGraph technique improves the searching time for finding a region of interest when compared to the traditional representation. It is worth mentioning here that MedGraph is a generic representation strategy that can be applied to any type of images, although this paper uses medical images as a proof of the concept.
Similar content being viewed by others
References
Andrews S, Hamarneh G (2015) Multi-region probabilistic dice similarity coefficient using the Aitchison distance and bipartite graph matching. Commun Res Rep (CoRR). arXiv:1509.07244
Apache cassandra (2015) http://cassandra.apache.org/
Beckwith R (2011) Managing big data: cloud computing and co-location centers. J Pet Technol 63:42–45
Bradski G (2000) The opencv library, Dr. Dobbs Journal of Software Tools
Bryant RE, Katz RH, Lazowska ED (2008) Big-data computing: creating revolutionary breakthroughs in commerce, science, and society
Castleman KR (1995) Digital Image Processing, 1st Edition. Prentice Hall
Cheng M, Mitra NJ, Huang X, Torr PH, Hu S (2015) Global contrast based salient region detection. IEEE Trans Pattern Anal Mach Intell 37(3):569–582
Dynamodb: Amazon NOSQL database service (2015) http://goo.gl/4mXuLp
DataStax (2012) Big Data: Beyond the Hype. Why Big Data Matters to You, White paper
Dean J, Ghemawat S (2008) MapReduce: Simplified data processing on large clusters. Commun ACM 51(1):107–113
Douglas CC (2014) An open framework for dynamic big-data-driven application systems (DBDDAS) development. Procedia Computer Science 29:1246–1255
Felzenszwalb PF, Huttenlocher DP (2004) Efficient graph-based image segmentation. Int J Comput Vis 59(2):167–181
Frieze A, Gionis A, Tsourakakis C (2013) Algorithmic techniques for modeling and mining large graphs (amazing). In: Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD ’13, ACM, New York, NY, USA, 1523–1523
Gantz J, Reinsel D (2011) Extracting value from chaos, IDC
Girshick R, Donahue J, Darrell T, Malik J (2014) Rich feature hierarchies for accurate object detection and semantic segmentation. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp 580–587
Girshick RB (2015) Fast R-CNN. Comput Res Rep (CoRR). arXiv:1504.08083
Global Pulse (2012) Big Data for Development: Challenges and opportunities, White Paper. http://unglobalpulse.org
Gonzalez JE, Low Y, Gu H, Bickson D, Guestrin C (2012) Powergraph: Distributed graph-parallel computation on natural graphs. In: Proceedings of the 10th USENIX Conference on Operating Systems Design and Implementation, OSDI’12, USENIX Association, Berkeley, CA, USA, 17–30
Giraph: graph processing system (2015) http://giraph.apache.org
Huang J, Griffith JF, Wang D, Shi L (2015) Graph-cut-based segmentation of proximal femur from computed tomography images with shape prior. Journal of Medical and Biological Engineering 35(5)
Jegou H, Douze M, Schmid C (2008) Hamming embedding and weak geometric consistency for large scale image search. In: Proceedings of the 10th European Conference on Computer Vision: Part I, ECCV ’08, Springer-Verlag, Berlin, Heidelberg, 304–317
Jegou H, Douze M, Schmid C, Perez P (2010) Aggregating local descriptors into a compact image representation. In: Computer Vision and Pattern Recognition (CVPR), IEEE Conference on, 3304–3311
Li J, Liu Y, Gao H (2011) Efficient algorithms for summarizing graph patterns. IEEE Trans Knowl Data Eng 23(9):1388–1405
Liu Y, Li J, Gao H (2008) Summarizing graph patterns. In: Data Engineering, ICDE 2008. IEEE 24th International Conference on, 903–912
Lumpkin G (2015) Integrate for Insight, The Oracle Corporation, White paper
Matti M, Kvernvik T (2012) Applying big-data technologies to network architecture. In: Ericsson Review
Mongodb: cross-platform document-oriented databases (2015) https://www.mongodb.org
Opencv software (2015) http://opencv.org/
Samih A, Wang R, Maciocco C, Kharbutli M, Solihin Y (2014) Collaborative memories in clusters: Opportunities and challenges. In: Transactions on Computational Science XXII, Vol 8360 of Lecture Notes in Computer Science, Springer Berlin Heidelberg, 17–41
Shao B, Wang H, Xiao Y (2012) Managing and mining large graphs: Systems and implementations. In: Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data, SIGMOD ’12, ACM, 589–592
Stonebraker M, Madden S, Dubey P (2013) Intel big data science and technology center vision and execution plan. SIGMOD 42(1):44–49
The apache software foundation (2015) http://www.apache.org
The oracle corporation (2015) http://www.oracle.com/index.html
Thomsen JR, Yiu ML, Jensen CS (2012) Effective caching of shortest paths for location-based services. In: Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data, ACM, 313–324
Vavilapalli VK, Murthy AC, Douglas C, Agarwal S, Konar M, Evans R, Graves T, Lowe J, Shah H, Seth S, Saha B, Curino C, O’Malley O, Radia S, Reed B, Baldeschwieler E (2013) Apache hadoop yarn: yet another resource negotiator. In: Proceedings of the 4th Annual Symposium on Cloud Computing, SOCC ’13, ACM, New York, NY, USA, 5:1, 5–16
Vertica for big data analytics (2015) http://www.vertica.com/tag/hadoop
Villars RL, Olofson CW, Eastwood M (2011) Big data: what it is and why you should care, White Paper, AMD
Walder R, Burrows A, Ott CD, Livne E, Lichtenstadt I, Jarrah M (2005) Anisotropies in the neutrinos and heating profiles in two-dimensional, time-dependent, multigroup radiation hydrodynamics simulations of rotating core-collapse supernovae. Astron J 626(1)
Xin RS, Gonzalez JE, Franklin MJ, Stoica I (2013) Graphx: A resilient distributed graph system on spark. In: First International Workshop on Graph Data Management Experiences and Systems, GRADES ’13, ACM, New York, NY, USA, 2:1–2:6
Young IT, Gerbrands JJ, van Vliet LJ, data Koninklijke Bibliotheek C, Haag D, Theodore YI, Jacob GJ, Vliet V, Jozef L (1995) Fundamentals of image processing
Young IT, Gerbrands JJ, van Vliet f LJ (1995) Fundamentals of image processing, delft: Delft University of Technology
Zhou C, Gao J, Sun B, Yu JX (2014) MOCgraph: Scalable distributed graph processing using message online computing. Proc VLDB Endow 8(4):377–388
Zhou Q, Ma L, Celenk M, Chelberg D (2005) Content-based image retrieval based on ROI detection and relevance feedback. Multimed Tools Appl 27(2):251–281
Acknowledgments
The authors would like to thank the Deanship of Research at the Jordan University of Science and Technology for funding this work, grant number 20150050.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Jarrah, M., Al-Quraan, M., Jararweh, Y. et al. MedGraph: a graph-based representation and computation to handle large sets of images. Multimed Tools Appl 76, 2769–2785 (2017). https://doi.org/10.1007/s11042-016-3262-0
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-016-3262-0