ABSTRACT
Nowadays, omics techniques have been widely used to study cancer and other related problems, but there are many cancer subtypes in a certain type of cancer which are unclear or even completely unknown, and thus unsupervised learning are only suitable approaches to address the problem. Cluster analysis refers to the analysis process of grouping data into multiple classes composed of similar data points. In this paper, six omics datasets are used to compare three clustering methods, in order to find a more suitable clustering method for omics datasets.
- Debnath Mousumi, Prasad GBKS, Bisen PS. Molecular Diagnostics: Promises and Possibilities Dordrech Heidelberg London, Springer,2010 pp11-31.Google Scholar
- N. A. Shushkova,S. E. Novikova,V. G. Zgoda. Exosomes of Malignant Tumors: Prospects of Omiсs Diagnostics[J]. Biochemistry (Moscow), Supplement Series B: Biomedical Chemistry,2020,14(2):Google Scholar
- Youqiong Ye,Ying Jing,Liang Li,Gordon B. Mills,Lixia Diao,Hong Liu,Leng Han. Sex-associated molecular differences for cancer immunotherapy[J]. Nature Communications,2020,11(1):Google Scholar
- Jie Xiong,Wei-Li Zhao. Advances in multiple omics of natural-killer/T cell lymphoma[J]. BioMed Central,2018,11(1):Google Scholar
- Cristina Alexandra Ciocan-Cartita,Ancuța Jurj,Mihail Buse,Diana Gulei,Cornelia Braicu,Lajos Raduly,Roxana Cojocneanu,Lavinia Lorena Pruteanu,Cristina Adela Iuga,Ovidiu Coza,Ioana Berindan-Neagoe. The Relevance of Mass Spectrometry Analysis for Personalized Medicine through Its Successful Application in Cancer “Omics”[J]. International Journal of Molecular Sciences,2019,20(10):Google Scholar
- Bin Zhu,Nan Song,Ronglai Shen,Arshi Arora,Mitchell J. Machiela,Lei Song,Maria Teresa Landi,Debashis Ghosh,Nilanjan Chatterjee,Veera Baladandayuthapani,Hongyu Zhao. Integrating Clinical and Multiple Omics Data for Prognostic Assessment across Human Cancers[J]. Scientific Reports,2017,7(400405):Google Scholar
- Bin Chen,Lana Garmire,Diego F. Calvisi,Mei-Sze Chua,Robin K. Kelley,Xin Chen. Harnessing big ‘omics’ data and AI for drug discovery in hepatocellular carcinoma[J]. Nature Reviews Gastroenterology & Hepatology,2020,17(Suppl. 2):Google Scholar
- Y.-H. Taguchi. Tensor decomposition-based and principal-component-analysis-based unsupervised feature extraction applied to the gene expression and methylation profiles in the brains of social insects with multiple castes[J]. BMC Bioinformatics,2018,19(Supplement_2):Google Scholar
- HUANG, XIAOHUI, YE, YUNMING, ZHANG, HAIJUN. Extensions of Kmeans-Type Algorithms: A New Clustering Framework by Integrating Intracluster Compactness and Intercluster Separation[J]. IEEE transactions on neural networks and learning systems,2014,25(8):1433-1446. DOI:10.1109/TNNLS.2013.2293795.Google Scholar
- Ashutosh Kumar Dubey,Umesh Gupta,Sonal Jain. Comparative Study of K-means and Fuzzy C-means Algorithms on The Breast Cancer Data[J]. International Journal on Advanced Science, Engineering and Information Technology,2018,8(1):Google ScholarCross Ref
- Sarica Alessia,Vaccaro Maria Grazia,Quattrone Andrea,Quattrone Aldo. A Novel Approach for Cognitive Clustering of Parkinsonisms through Affinity Propagation[J]. Algorithms,2021,14(2):Google Scholar
- Bae J H , Kim M , Lim J S , Feature Selection for Colon Cancer Detection Using K-Means Clustering and Modified Harmony Search Algorithm[J]. 2021. H. Poor, An Introduction to Signal Detection and Estimation. New York: Springer-Verlag, 1985, ch. 4.Google Scholar
- Zdeněk Šulc,Hana Řezanková. Comparison of Similarity Measures for Categorical Data in Hierarchical Clustering.[J]. Journal of Classification,2019,36(1):Google Scholar
- Fischer S , Freuling C M , T Müller, Defining objective clusters for rabies virus sequences using affinity propagation clustering[J]. Plos Neglected Tropical Diseases, 2018, 12(1):e0006182. E. H. Miller, “A note on reflector arrays (Periodical style—Accepted for publication),” IEEE Trans. Antennas Propagat., to be published.Google ScholarCross Ref
- AID SMKKONEN, EEVA SOMERKOSKI, ULLA PAASO, Genotypic variation in yellow autumn leaf colours explains aphid load in silver BIRCH[J]. The New Phytologist,2012,195(2):461-469. DOI:10.1111/j.1469-8137.2012.04156.x.Google Scholar
- Madan, Siddharth, Dana, Modified balanced iterative reducing and clustering using hierarchies (m-BIRCH) for visual clustering[J]. Pattern Analysis & Applications Paa, 2016.C. J. Kaufman, Rocky Mountain Research Lab., Boulder, CO, private communication, May 1995.Google Scholar
- Xin Li,Wei Xiang. Object Tracking Using Meanshift Algorithm Combined with Kalman Filter on Robotic Fish[C]//Proceedings of 2015 5th International Symposium on Knowledge Acquisition and Modeling(KAM 2015).,2015:179-183.Google Scholar
- HONGBO ZHU, HAI ZHAO, DAN LIU, A new improved filter for target tracking: compressed iterative particle filter[J]. Natural science,2011,3(4):301-306. DOI:10.4236/ns.2011.34039.Google Scholar
- Zhao M , Jha A , Liu Q , Faster Mean-shift: GPU-accelerated Embedding-clustering for Cell Segmentation and Tracking[J]. 2020.Google Scholar
- Frey B J , Dueck D . Clustering by passing messages between data points[J]. Science, 315.Google Scholar
- Schwartz R , Shen W , Campbell J , INTERSPEECH 2007 Construction of a Phonotactic Dialect Corpus using Semiautomatic Annotation *. 2013.Sam Anzaroot and Andrew McCallum. 2013.Google Scholar
Index Terms
- Comparison of different clustering methods applied to omics datasets
Recommendations
A critical review of machine-learning for “multi-omics” marine metabolite datasets
AbstractDuring the last decade, genomic, transcriptomic, proteomic, metabolomic, and other omics datasets have been generated for a wide range of marine organisms, and even more are still on the way. Marine organisms possess unique and diverse ...
Highlights- Recent progress in the use and integration of “multi-omics” techniques to identify novel marine metabolites.
- The multi-omics data integration tools developed for analyzing “multi-omics” data.
- The requirement of ML for analyzing “...
Integrative clustering methods for multi‐omics data
AbstractIntegrative analysis of multi‐omics data has drawn much attention from the scientific community due to the technological advancements which have generated various omics data. Leveraging these multi‐omics data potentially provides a more ...
Three categories of integrative multi‐omics clustering methods. Multi‐omics data (e.g., DNA methylation, copy number variation, gene expression) are collected for each sample. Integrative multi‐omics clustering methods can be used to analyze such data and ...
A review on omics-based biomarkers discovery for Alzheimer's disease from the bioinformatics perspectives: Statistical approach vs machine learning approach
AbstractAlzheimer's Disease (AD) is a neurodegenerative disease that affects cognition and is the most common cause of dementia in the elderly. As the number of elderly individuals increases globally, the incidence and prevalence of AD are ...
Highlights- Omics-based techniques may be able to delineate pathways that lead to neuronal death and identify AD-associated biomarkers.
Comments