MedGraph: a graph-based representation and computation to handle large sets of images

In order to process and analyze very large volumes of images, efficient representation and structuring techniques are required. Since, current computing machines can provide large memory size, trading off reasonable amount of memory in order to achieve efficient and parallelizable representation of images is preferable. In this paper, we propose a new structure to represent and store images based on in-memory graph concept. Our method of computation provides a faster execution time than the traditional array-based representation. Each pixel of an image is represented as a one node in the graph. In addition, nodes have pointers to other neighboring nodes (pixels). The structure represents an image as a grid of connected linked lists and each grid is connected to other grids. Using our method, an image can be represented in one of three different representations which are: octal linked list, quadratic linked list, and dual linked list representations. We provide experiments and evaluations using the dual linked list representation as it requires less memory space. We apply our methodology for medical images as a proof of the concept to find a region of interest in an image. We have collected and used real medical images to build and process the graph which we call MedGraph. Our experimental results show that the proposed MedGraph technique improves the searching time for finding a region of interest when compared to the traditional representation. It is worth mentioning here that MedGraph is a generic representation strategy that can be applied to any type of images, although this paper uses medical images as a proof of the concept.

[1]  Michael Stonebraker,et al.  Intel "big data" science and technology center vision and execution plan , 2013, SGMD.

[2]  Reynold Xin,et al.  GraphX: a resilient distributed graph system on Spark , 2013, GRADES.

[3]  Jianzhong Li,et al.  Efficient Algorithms for Summarizing Graph Patterns , 2011, IEEE Transactions on Knowledge and Data Engineering.

[4]  Alan M. Frieze,et al.  Algorithmic techniques for modeling and mining large graphs (AMAzING) , 2013, KDD.

[5]  Craig C. Douglas,et al.  An Open Framework for Dynamic Big-data-driven Application Systems (DBDDAS) Development , 2014, ICCS.

[6]  Mona M At T I A N,et al.  Applying big-data technologies to network architecture , 2012 .

[7]  Shi-Min Hu,et al.  Global contrast based salient region detection , 2011, CVPR 2011.

[8]  Ghassan Hamarneh,et al.  Multi-Region Probabilistic Dice Similarity Coefficient using the Aitchison Distance and Bipartite Graph Matching , 2015, ArXiv.

[9]  Joseph Gonzalez,et al.  PowerGraph: Distributed Graph-Parallel Computation on Natural Graphs , 2012, OSDI.

[10]  智一 吉田,et al.  Efficient Graph-Based Image Segmentationを用いた圃場図自動作成手法の検討 , 2014 .

[11]  Carlo Curino,et al.  Apache Hadoop YARN: yet another resource negotiator , 2013, SoCC.

[12]  Qiang Zhou,et al.  Content-Based Image Retrieval Based on ROI Detection and Relevance Feedback , 2005, Multimedia Tools and Applications.

[13]  Sanjay Ghemawat,et al.  MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.

[14]  Christian S. Jensen,et al.  Effective caching of shortest paths for location-based services , 2012, SIGMOD Conference.

[15]  Jianzhong Li,et al.  Summarizing Graph Patterns , 2008, 2008 IEEE 24th International Conference on Data Engineering.

[16]  Robin Beckwith Australia's Quiet Success Story: The Riches of Bass Strait , 2011 .

[17]  Robin Beckwith Managing Big Data: Cloud Computing and Co-Location Centers , 2011 .

[18]  Chang Zhou,et al.  MOCgraph: Scalable Distributed Graph Processing Using Message Online Computing , 2014, Proc. VLDB Endow..

[19]  Ren Wang,et al.  Collaborative Memories in Clusters: Opportunities and Challenges , 2014, Trans. Comput. Sci..

[20]  E. Livne,et al.  Anisotropies in the Neutrino Fluxes and Heating Profiles in Two-dimensional, Time-dependent, Multigroup Radiation Hydrodynamics Simulations of Rotating Core-Collapse Supernovae , 2004 .

[21]  Ross B. Girshick,et al.  Fast R-CNN , 2015, 1504.08083.

[22]  Cordelia Schmid,et al.  Aggregating local descriptors into a compact image representation , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[23]  Lin Shi,et al.  Graph-Cut-Based Segmentation of Proximal Femur from Computed Tomography Images with Shape Prior , 2015 .

[24]  Trevor Darrell,et al.  Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[25]  Haixun Wang,et al.  Managing and mining large graphs: systems and implementations , 2012, SIGMOD Conference.

[26]  Cordelia Schmid,et al.  Hamming Embedding and Weak Geometric Consistency for Large Scale Image Search , 2008, ECCV.

[27]  Ian T. Young,et al.  Fundamentals of Image Processing , 1998 .