ABSTRACT
Inferring gene regulatory network (GRN) is one of the major challenges in bioinformatics. A great amount of gene expression data is being produced raising the issue of GRN reconstruction. This later becomes an even more difficult task to perform when the biological dataset is very large. GRN reconstruction can be achieved through clustering. In this paper we propose a framework for gene expression data clustering for the purpose of GRN reconstruction. The proposed framework is based on a parallel BFR clustering using MapReduce programing model. Experimental results using several gene expression datasets from the literature show the effectiveness of the proposed framework
- P.D'haeseleer, "how does gene expression clustering work", Nature biotechnology, Vol 23,Num 12, December 2005. Google ScholarCross Ref
- S.owen, R.Anil, T.Dunning,E.Freidman, " Mahout in Action",2012Google Scholar
- P. Harun, E. Burak, D. P. Andy, and Y. Cetin, "Clustering of high throughput gene expression data,", Computers & OperationResearch, vol. 39, pp. 3046--3061, 2012. Google ScholarDigital Library
- P.S. Bradley, U.Fayyad, and C. Reina, "Scaling Clustering Algorithms to Large Databases", KDD-98 Proceedings, 1998.Google Scholar
- G.N.Dimitrakopoulos, I.A.Maraziotis, K.Sgarbas, "A Clustering Based Method Accelerating Gene Regulatory Network Reconstruction", 14th International Conference on Computational Science, ICCS 2014. Google ScholarCross Ref
- J.Loskovec, A.Rajaraman and J.D.Ullma, "Mining of massive Data" 2014.Google Scholar
- C.Panseand and M.Kshirsagar "Survey on modelling methods applicable to gene regulatory network".International Bioinformatics & Biosciences (IJBB) Vol.3, pp.13--32, 2013. Google ScholarCross Ref
- J. Dean and S. Ghemawat, "Mapreduce: Simplified data processing on large clusters,", pp. 137--150, 2004.Google ScholarDigital Library
- M. Souto, I. Costa, D. de Araujo, T. Ludermir, and A. Schliep, "Clustering Cancer Gene Expression Data: A Comparative Study", BMC Bioinformatics, vol. 9, pp. 497, 2008. Google ScholarCross Ref
- S.Wang, I.Pand, D.Johnson, I.Emam, F.Guitton, A.Oehmicen, and Y. Guo, "Optimising parallel R correlation matrix calculations on gene expression data using MapReduce", Wanget al. BMC Bioinformatics 2014. Google ScholarCross Ref
- A.S. Arefin, C.Riveros, R.Berretta, P.Moscato, "Computing Large-scale Distance Matrices on GPU", he 7th International Conference on Computer Science & Education ICCSE, 2012. Google ScholarCross Ref
- I. Aljarah, and S. Ludwing, "Parallel Particle Swarm Optimization Clustering Algorithm based on MapReduce Methodology", NaBIC, vol. 4, pp. 104--111, 2012. Google ScholarCross Ref
- M.Daoudi, S.Hamena, Z.Benmounah and M.Batouche, "Parallel Differential Evolution Clustering Algorithm based on MapReduce", 6th International Conference of Soft Computing and Pattern Recognition, SoCPaR 2014.Google Scholar
- V.A. Likic, M. J. McConville, T. Lithgow, and A. Bacic1, "Systems Biology: The Next Frontier for Bioinformatics", Advances in Bioinformatics Vol 2010, pp.1--10,2010.Google ScholarCross Ref
- L.V. Bijuraj, "Clustering and its Applications", Proceedings of National Conference on New Horizons in IT - NCNHIT 2013.Google Scholar
- N.S.Nithya1, Dr.K.Duraiswamy,P.Gomathy "A Survey on Clustering Techniques in Medical Diagnosis",International Journal of Computer Science Trends and Technology (IJCST) -- Vo1, 2013.Google Scholar
- H.Liu, J.Lu, "Brief Survey of K-means Clustering Algorithms", Applied Mechanics and Materials, 2010.Google Scholar
Recommendations
An Integrative Tool for Gene Regulatory Network Reconstruction Based on Microarray Data
BIBE '09: Proceedings of the 2009 Ninth IEEE International Conference on Bioinformatics and BioengineeringThe transcriptional regulation of gene expression has been known to be a key mechanism in the functioning of the cell and the gene expression is influenced by the transcriptional regulatory strengths. In this paper we extend the function of a former ...
Transcriptional gene regulatory network reconstruction through cross platform gene network fusion
PRIB'07: Proceedings of the 2nd IAPR international conference on Pattern recognition in bioinformaticsMicroarray gene expression data is used to model differential activity in Gene Regulatory Networks (GRN) to elucidate complex cellular processes, though network modeling is susceptible to errors due to both noisy nature of gene expression data and ...
Inference of Gene Regulatory Network by Bayesian Network Using Metropolis-Hastings Algorithm
ADMA '07: Proceedings of the 3rd international conference on Advanced Data Mining and ApplicationsBayesian networks are widely used to infer genes regulatory network from their transcriptional expression data. Bayesian network of the best score is usually chosen as genes regulatory model. However, without the hint from biological ground truth, and ...
Comments