skip to main content
10.1145/3090354.3090368acmotherconferencesArticle/Chapter ViewAbstractPublication PagesbdcaConference Proceedingsconference-collections
research-article

Revisiting BFR Clustering Algorithm for Large Scale Gene Regulatory Network Reconstruction using MapReduce

Authors Info & Claims
Published:29 March 2017Publication History

ABSTRACT

Inferring gene regulatory network (GRN) is one of the major challenges in bioinformatics. A great amount of gene expression data is being produced raising the issue of GRN reconstruction. This later becomes an even more difficult task to perform when the biological dataset is very large. GRN reconstruction can be achieved through clustering. In this paper we propose a framework for gene expression data clustering for the purpose of GRN reconstruction. The proposed framework is based on a parallel BFR clustering using MapReduce programing model. Experimental results using several gene expression datasets from the literature show the effectiveness of the proposed framework

References

  1. P.D'haeseleer, "how does gene expression clustering work", Nature biotechnology, Vol 23,Num 12, December 2005. Google ScholarGoogle ScholarCross RefCross Ref
  2. S.owen, R.Anil, T.Dunning,E.Freidman, " Mahout in Action",2012Google ScholarGoogle Scholar
  3. P. Harun, E. Burak, D. P. Andy, and Y. Cetin, "Clustering of high throughput gene expression data,", Computers & OperationResearch, vol. 39, pp. 3046--3061, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. P.S. Bradley, U.Fayyad, and C. Reina, "Scaling Clustering Algorithms to Large Databases", KDD-98 Proceedings, 1998.Google ScholarGoogle Scholar
  5. G.N.Dimitrakopoulos, I.A.Maraziotis, K.Sgarbas, "A Clustering Based Method Accelerating Gene Regulatory Network Reconstruction", 14th International Conference on Computational Science, ICCS 2014. Google ScholarGoogle ScholarCross RefCross Ref
  6. J.Loskovec, A.Rajaraman and J.D.Ullma, "Mining of massive Data" 2014.Google ScholarGoogle Scholar
  7. C.Panseand and M.Kshirsagar "Survey on modelling methods applicable to gene regulatory network".International Bioinformatics & Biosciences (IJBB) Vol.3, pp.13--32, 2013. Google ScholarGoogle ScholarCross RefCross Ref
  8. J. Dean and S. Ghemawat, "Mapreduce: Simplified data processing on large clusters,", pp. 137--150, 2004.Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. M. Souto, I. Costa, D. de Araujo, T. Ludermir, and A. Schliep, "Clustering Cancer Gene Expression Data: A Comparative Study", BMC Bioinformatics, vol. 9, pp. 497, 2008. Google ScholarGoogle ScholarCross RefCross Ref
  10. S.Wang, I.Pand, D.Johnson, I.Emam, F.Guitton, A.Oehmicen, and Y. Guo, "Optimising parallel R correlation matrix calculations on gene expression data using MapReduce", Wanget al. BMC Bioinformatics 2014. Google ScholarGoogle ScholarCross RefCross Ref
  11. A.S. Arefin, C.Riveros, R.Berretta, P.Moscato, "Computing Large-scale Distance Matrices on GPU", he 7th International Conference on Computer Science & Education ICCSE, 2012. Google ScholarGoogle ScholarCross RefCross Ref
  12. I. Aljarah, and S. Ludwing, "Parallel Particle Swarm Optimization Clustering Algorithm based on MapReduce Methodology", NaBIC, vol. 4, pp. 104--111, 2012. Google ScholarGoogle ScholarCross RefCross Ref
  13. M.Daoudi, S.Hamena, Z.Benmounah and M.Batouche, "Parallel Differential Evolution Clustering Algorithm based on MapReduce", 6th International Conference of Soft Computing and Pattern Recognition, SoCPaR 2014.Google ScholarGoogle Scholar
  14. V.A. Likic, M. J. McConville, T. Lithgow, and A. Bacic1, "Systems Biology: The Next Frontier for Bioinformatics", Advances in Bioinformatics Vol 2010, pp.1--10,2010.Google ScholarGoogle ScholarCross RefCross Ref
  15. L.V. Bijuraj, "Clustering and its Applications", Proceedings of National Conference on New Horizons in IT - NCNHIT 2013.Google ScholarGoogle Scholar
  16. N.S.Nithya1, Dr.K.Duraiswamy,P.Gomathy "A Survey on Clustering Techniques in Medical Diagnosis",International Journal of Computer Science Trends and Technology (IJCST) -- Vo1, 2013.Google ScholarGoogle Scholar
  17. H.Liu, J.Lu, "Brief Survey of K-means Clustering Algorithms", Applied Mechanics and Materials, 2010.Google ScholarGoogle Scholar

Recommendations

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Sign in
  • Published in

    cover image ACM Other conferences
    BDCA'17: Proceedings of the 2nd international Conference on Big Data, Cloud and Applications
    March 2017
    685 pages
    ISBN:9781450348522
    DOI:10.1145/3090354

    Copyright © 2017 ACM

    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    • Published: 29 March 2017

    Permissions

    Request permissions about this article.

    Request Permissions

    Check for updates

    Qualifiers

    • research-article
    • Research
    • Refereed limited

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader