ABSTRACT
RNA-seq contains rich information about individual even single cell, implies certain biology pattern vary in special time or space two dimensions, e.g. different life stage or environment. Byusing clustering and other computing methods, we can efficient analysis and decode those data applying to cancer diagnosis and treat, biological evolution and so on. However, RNA-seq data has features of super-high dimensions, less labeled samples and strong noise, which bring large challenges for clustering analysis. Therefore, we proposed a new clustering method GSE, which can efficient enhance the signal-to-noise ratio of input similarity matrix using diffusion process in weighted connection network to improve clustering performance. Comparing with latest clustering methods, our method has advantages in external clustering criterions NMI and ARI indicators. Meanwhile inadequacy and improved idea are given. Code can be downloaded from Git-hub.
- Shao C, Höfer T. Robust classification of single-cell transcriptome data by nonnegative matrix factorization. Bioinformatics, 2017, 33(2): 235--242. DOI= https://doi.org/10.1093/bioinformatics/btw607.Google ScholarCross Ref
- Li A, Yin X, Pan Y. Three-dimensional gene map of cancer cell types: Structural entropy minimisation principle for defining tumour subtypes. Scientific reports, 2016, 6: 20412. DOI= https://doi.org/10.1038/srep20412.Google Scholar
- Wang B, Zhu J, Pierson E, et al. Visualization and analysis of single-cell RNA-seq data by kernel-based similarity learning. Nature methods, 2017, 14(4): 414. DOI= https://doi.org/10.1038/nmeth.4207.Google Scholar
- Zhu X, Li H D, Xu Y, et al. A hybrid clustering algorithm for identifying cell types from single-cell RNA-seq data. Genes, 2019, 10(2): 98. DOI= https://doi.org/10.3390/genes10020098.Google ScholarCross Ref
- Xu C, Su Z. Identification of cell types from single-cell transcriptomes using a novel clustering method. Bioinformatics, 2015, 31(12): 1974--1980. DOI= https://doi.org/10.1093/bioinformatics/btv088.Google ScholarCross Ref
- Zhu X, Li H D, Guo L, et al. Analysis of Single-Cell RNA-seq Data by Clustering Approaches[J]. Current Bioinformatics, 2019, 14(4): 314--322. DOI= https://doi.org/10.2174/1574893614666181120095038.Google ScholarCross Ref
- Wang B, Pourshafeie A, Zitnik M, et al. Network enhancement as a general method to denoise weighted biological networks[J]. Nature communications, 2018, 9(1): 3108. DOI= https://doi.org/10.1038/s41467-018-05469-x.Google Scholar
- Biase F H, Cao X, Zhong S. Cell fate inclination within 2-cell and 4-cell mouse embryos revealed by single-cell RNA sequencing. Genome research, 2014, 24(11): 1787--1796. DOI= http://dx.doi.org/10.1101/gr.177725.114.Google Scholar
- Kimmerling R J, Szeto G L, Li J W, et al. A microfluidic platform enabling single-cell RNA-seq of multigenerational lineages. Nature communications, 2016, 7: 10220. DOI= https://doi.org/10.1038/ncomms10220.Google Scholar
- Usoskin D, Furlan A, Islam S, et al. Unbiased classification of sensory neuron types by large-scale single-cell RNA sequencing. Nature neuroscience, 2015, 18(1): 145.DOI= https://doi.org/10.1038/nn.3881.Google Scholar
- Kim J K, Kolodziejczyk A A, Ilicic T, et al. Characterizing noise structure in single-cell RNA-seq distinguishes genuine from technical stochastic allelic expression. Nature communications, 2015, 6: 8687.DOI= https://doi.org/10.1038/ncomms9687.Google Scholar
- Treutlein B, Brownfield D G, Wu A R, et al. Reconstructing lineage hierarchies of the distal lung epithelium using single-cell RNA-seq. Nature, 2014, 509(7500): 371. DOI= https://doi.org/10.1038/nature13173.Google ScholarCross Ref
- Blondel V D, Guillaume J L, Lambiotte R, et al. Fast unfolding of communities in large networks. Journal of statistical mechanics: theory and experiment, 2008, 2008(10): P10008.Google Scholar
Index Terms
- GSE: Graph similarity enhancement algorithm for single-cell RNA-seq data clustering
Recommendations
Establishment of an Integrated Computational Workflow for Single Cell RNA-Seq Dataset
ICBBB '19: Proceedings of the 2019 9th International Conference on Bioscience, Biochemistry and BioinformaticsSingle cell RNA-sequencing (scRNA-Seq) has emerged as a popular transcriptomic profiling approach to address long-standing questions on developmental biology and cancer biology. With the advent of increasing single-cell computational methods, it is not ...
Clustering of Small-Sample Single-Cell RNA-Seq Data via Feature Clustering and Selection
PRICAI 2019: Trends in Artificial IntelligenceAbstractWe present FeatClust, a software tool for clustering small sample size single-cell RNA-Seq datasets. The FeatClust approach is based on feature selection. It divides features into several groups by performing agglomerative hierarchical clustering ...
Diffusion Kernel based Fast Adaptive Clustering of Single Cell RNA-seq Data
ICBBT '19: Proceedings of the 2019 11th International Conference on Bioinformatics and Biomedical TechnologyRecently, with the advent of high throughput single-cell technologies, it has become possible to quantify the whole transcriptome of individual cells; however, it remains challenging to discover intrinsic rare cell-types from high throughput genes ...
Comments