Multi-task multi-modal learning for joint diagnosis and prognosis of human cancers

doi:10.1016/j.media.2020.101795

Medical Image Analysis

Volume 65, October 2020, 101795

https://doi.org/10.1016/j.media.2020.101795 Get rights and content

Highlights

•
Consider the inherent correlation between diagnosis and prognosis tasks and propose a novel multi-task multi-modal learning framework for joint diagnosis and prognosis of human cancer.
•
Integrate histopathological image and genomic data for the diagnosis and prognosis of human cancers.
•
Conduct experiments on three cancer cohorts from the TCGA database that can validate the effectiveness of the proposed method.
•
In-depth explanation of the selected multi-modal biomarkers.

Abstract

With the tremendous development of artificial intelligence, many machine learning algorithms have been applied to the diagnosis of human cancers. Recently, rather than predicting categorical variables (e.g., stages and subtypes) as in cancer diagnosis, several prognosis prediction models basing on patients’ survival information have been adopted to estimate the clinical outcome of cancer patients. However, most existing studies treat the diagnosis and prognosis tasks separately. In fact, the diagnosis information (e.g., TNM Stages) indicates the extent of the disease severity that is highly correlated with the patients’ survival. While the diagnosis is largely made based on histopathological images, recent studies have also demonstrated that integrative analysis of histopathological images and genomic data can hold great promise for improving the diagnosis and prognosis of cancers. However, direct combination of these two types of data may bring redundant features that will negatively affect the prediction performance. Therefore, it is necessary to select informative features from the derived multi-modal data. Based on the above considerations, we propose a multi-task multi-modal feature selection method for joint diagnosis and prognosis of cancers. Specifically, we make use of the task relationship learning framework to automatically discover the relationships between the diagnosis and prognosis tasks, through which we can identify important image and genomics features for both tasks. In addition, we add a regularization term to ensure that the correlation within the multi-modal data can be captured. We evaluate our method on three cancer datasets from The Cancer Genome Atlas project, and the experimental results verify that our method can achieve better performance on both diagnosis and prognosis tasks than the related methods.

Graphical abstract

The framework of our study is consisted of three steps. Firstly, extracting imaging and eigengene features from the histopathological image and gene expression data, respectively. Secondly, implementing the proposed multi-task multi-modal feature selection algorithm (i.e., M2DP) to identify diagnosis and prognosis related features. Thirdly, based on the selected features of each patient, applying AdaBoosting and Cox proportional hazard models for the diagnosis and prognosis prediction of cancer patients, respectively.

Introduction

Cancer is the leading cause of death in economically developed countries and the second leading cause of death in developing countries (Siegel et al., 2016). It is estimated that the number of new cases of cancer will be 539.2 per 10,000 people by the year of 2025 (Siegel et al., 2016). Thus, accurate diagnosis of cancer especially at its early stage is particularly important. So far, many biomarkers have been shown to be sensitive to the diagnosis of cancers. For example, quite a number of cancer diagnosis models (Nir, Hor, Karimi, Fazli, 2018, Coudray, Sakellaropoulos, Narula, Snuderl, 2018, Gecer, Aksoy, Mercan, Shapiro, 2018) were based on histopathological images, since it can reveal the morphological characteristics of cells that are closely related to the aggressiveness of cancers. Besides histopathological images, it has been known that the genetic mutations and gene expression levels can affect the development of cancers by accelerating cell division rates (Kim and Kaelin, 2004) and modifying the tumor micro-environment (Yuan et al., 2012). Accordingly, many researchers also used genomic features such as gene expression signatures to drive diagnosis practices (Wilhelm, Veltman, Kovacs, Waldman, 2002, Yang, Liu, Pang, 2018, Niazi, Khalid, 2016). In all these diagnosis methods, classification models are learned from training samples to predict categorical variables (e.g., TNM Stage) on the testing subjects.

In addition to predict categorical variables (i.e., TNM Stage) as in cancer diagnosis, many prognosis prediction models were also adopted to perform survival analysis based on different modalities of biomarkers (Lin, Wei, Ying, 1993, Cheng, Mo, Wang, 2017, Cooperberg, Davicioni, Crisan, 2015, Li, Wang, Ye, 2016, Zhu, Yao, Huang, 2016, Yi, Tang, Zhang, 2018, Veer, Dai, 2002). Different from the diagnosis task that focuses on the identification of current disease state, the prognosis task aims at making an prediction about the expected clinical outcome of the cancer patients. Among all the prognosis prediction models, Cox proportional hazard model (Lin et al., 1993) was the most popular one. Cheng et al. (2017a) and Cooperberg et al. (2015) used the Cox model to stratify cancer patients into subgroups with different predicted outcomes from histopathological images and genomic data, respectively. Besides the Cox model, two recent studies MTLSA (Li et al., 2016) and DeepSurv (Zhu et al., 2016), were designed to model the complex relationship between the input feature and clinical outcome. Other studies include (Yi et al., 2018) have proposed a hierarchical regression model to estimate the survival risks of different patients, and experimental results on the high-dimensional genomic data validate its superiority over the comparing methods.

Despite these progress, to the best of our knowledge, all the existing studies treated diagnosis and prognosis tasks independently, without considering the inherent correlation between them. As a matter of fact, the diagnosis information indicates the extent of the disease severity, which is highly correlated with the patients’ clinical outcomes (Scarpa et al., 2010). For example, patients in stage II suffer from more aggressive cancers than those in stage I, and thus generally have higher risk for short survival time. It can be expected that better prediction performance will be achieved if we learn the diagnosis and prognosis tasks jointly, for the information of one task will be helpful to predict another related task.

At the same time, existing studies (Sun, Li, 2018, Yao, Huang, 2017, Shao, Cheng, Zhang, Huang, 2018, Cheng, Zhang, Han, 2017, Yuan, Failmezger, Rueda, 2012, Huang, Zhan, Xiang, 2019) have demonstrated that the integrative analysis of images and genomic data hold great promise for cancer assessment and risk prediction. For example, Yuan et al. (2012) demonstrated that integration of lymphocyte morphology from histopathology images and gene expression signatures can significantly increase the prognosis accuracy for ER-negative breast cancer patients. Sun and Li (2018) combined the pathological images with gene expression data to classify longer-term and shorter-term survivors on breast cancer cohort. Yao and Huang (2017) developed a novel deep learning framework integrating both image and genomic data to predict the clinical outcome of cancer patients. However, direct combination of multi-modal data will increase the feature dimension, which may cause the problem of ”curse of dimensionality”, given the limited training samples in cancer research. Thus, feature selection, which can be considered as bio-marker identification, has become an important step for the diagnosis and prognosis of cancers. Currently, most of the existing studies (Cheng, Zhang, Han, 2017, Yuan, Failmezger, Rueda, 2012) first concatenated all features from histopathological images and genomic data into a long feature vector, followed by the traditional single-modality sparse learning algorithm (e.g., LASSO) for key components discovery. However, these feature selection methods overlooked the correlation within the multi-modal data, which has been widely accepted as a critical component in the state-of-the-art multi-modality based machine learning methods (Liu, Chen, Shen, 2014, Mohammadi, Hossein, Soltanian, 2017).

Inspired by the above considerations, we propose a multi-task multi-modal feature selection method (M2DP) for joint diagnosis and prognosis of cancers. Specifically, based on the task relationship learning framework, our method can automatically derive the correlation between the diagnosis and prognosis tasks, without assuming it to be known in advance. Intuitively, exploiting such task relationships can help identify a subset of bio-makers for a specific task with the knowledge of other related task. In addition, we also consider the association between different modalities by adding a regularization term to capture the inter-correlation between the selected imaging and genomic components.

To evaluate the effectiveness of the proposed method, we perform experiments on three large cancer cohorts (i.e., Lung Squamous Cell Carcinoma, Breast Invasive Carcinoma, Liver Hepatocellular Carcinoma) in The Cancer Genome Atlas (TCGA). The experimental results verify that our proposed M2DP method not only can achieve better performance on both diagnosis and prognosis tasks than competing algorithms, but also help to discover useful histopathological image and genomic bio-markers for the prediction of the development of cancers.

Our preliminary work that only using the diagnosis information can help achieve better prognosis prediction performance was published in MICCAI 2019 (Shao et al., 2019). In this substantially expanded journal paper, we offered new contributions in the following aspects: 1) further demonstrating that the proposed model can also promote the prediction performance for diagnosis tasks; 2) evaluating the effectiveness of the proposed method on two additional datasets (i.e., Breast Invasive Carcinoma and Liver Hepatocellular Carcinoma datasets); 3) in-depth explanation of the bio-markers that are identified by the proposed model. 4) visualization of the selected image features in both high and low survival risk groups. 5) discussion on effects of parameter σ in the proposed M2DP model.

Section snippets

Datasets.

The Cancer Genome Atlas (TCGA) is a large consortium project that has generated genomic and imaging data for thousands of tumor samples across more than 30 types of cancers (Zhu et al., 2014). In this study, we test our method on three early-stage (i.e., stage I and stage II) cancer cohorts including Lung Squamous Cell Carcinoma (LUSC), Breast Invasive Carcinoma (BRCA) and Liver Hepatocellular Carcinoma (LIHC) from TCGA, since the diagnosis and prognosis of early-stage cancer patients are

Experimental settings.

To evaluate the performance of the proposed M2DP method for the diagnosis and prognosis of cancer patients, we test it on three early-stage cancer cohorts (i.e., LUSC, BRCA and LIHC) derived from the TCGA database. For each cohort, we randomly partition it into 5 folds. Here, we enforce the ratio of Stage I and Censored patients in each fold approximate to that in the whole cohort with at most ± 0.05 gap, and we show the ratio of Stage I and censored patients in each fold in Tables S4-S6 in

The effect of the parameter σ in M2DP model.

In the proposed M2DP model, we fix the parameter σ in the weight of the survival loss (i.e., shown in Eq. (2)) as a constant ( $σ = 1.5$ ) that is larger than 1. In this section, we investigate the effect of tuning σ in the M2DP model. Specifically, we vary parameter σ from {0.5, 1.5, 2, 2.5}, and record their corresponding Accuracies and Concordance Indexes for both diagnosis and prognosis tasks, respectively. The results are shown in Table 8. As can be seen from Table 8, on one hand, the

Conclusion

In this paper, we develop M2DP, an effective multi-task multi-modal feature selection method that can jointly identify diagnosis and prognosis associated bio-markers from both histopathological image and gene expression data. The main advantage of our approach is its capability of utilizing the inherent correlation within different tasks to guide the feature selection process, which can more accurately diagnose cancer stage and predict the clinical outcome for different types of cancer

CRediT authorship contribution statement

Wei Shao: Conceptualization, Methodology, Writing - original draft. Tongxin Wang: Methodology. Liang Sun: Methodology. Tianhan Dong: Validation. Zhi Han: Writing - original draft. Zhi Huang: Conceptualization. Jie Zhang: Writing - review & editing. Daoqiang Zhang: Supervision. Kun Huang: Supervision.

Declaration of Competing Interest

Dear Editor-in-Chief:

We would like to submit the enclosed manuscript ”Multi-task Multi-modal Learning for Joint Diagnosis and Prognosis of Human Cancers”, which is invited to submit to Medical Image Analysis as the extension of our MICCAI 2019 paper (i.e., Diagnosis-Guided Multi-modal Feature Selection for Prognosis Prediction of Lung Squamous Cell Carcinoma).

All authors have approved this submission and confirm there are no conflicts of interest with the requested reviewers.

Best regards,

Wei

Acknowledgments

This work was supported by the National Natural Science Foundation of China (Nos. 61902183, 61876082, 61861130366, 61732006) and National Key R&D Programme of China (Grant Nos. 2018YFC2001600, 2018YFC2001602), the Royal Society-Academy of Medical Sciences Newton Advanced Fellowship (No. NAF\R1\180371), and the IU Precision Health Initiative Program.

References (49)

J. Cheng et al.
Identification of topological features in renal tumor microenvironment associated with patient survival
Bioinformatics
(2017)
S. Cho et al.
Identification of single nucleotide polymorphisms in the tumor necrosis factor (TNF) and TNF receptor superfamily in the korean population
Hum. Immunol.
(2004)
M. Cooperberg et al.
Combined value of validated clinical and genomic risk stratification tools for predicting prostate cancer mortality in a high-risk prostatectomy cohort
Eur. Urol.
(2015)
B. Gecer et al.
Detection and classification of cancer in whole slide breast histopathology images using deep convolutional networks
Pattern Recognit.
(2018)
F. Liu et al.
Inter-modality relationship constrained multi-modality multi-task feature selection for alzheimer’s disease and mild cognitive impairment identification
Neuroimage
(2014)
G. Nir et al.
Automatic grading of prostate cancer in digitized histopathology images: learning from multiple experts
Med. Image Anal.
(2018)
D. Sun et al.
Integrating genomic data and pathological images to effectively predict breast cancer clinical outcome
Comput. Method. Progr. Biomed.
(2018)
J. Zhang et al.
Normalized lmQCM: an algorithm for detecting weak quasi-clique modules in weighted graph with application in functional gene cluster discovery in cancer
Cancer Inform.
(2016)
S. Affo et al.
The role of cancer-associated fibroblasts and fibrosis in liver cancer
Ann. Rev. Pathol.
(2017)
N. Ai et al.
High expression of GP73 in primary hepatocellular carcinoma and its function in the assessment of transcatheter arterial chemoembolization
Oncol. Lett.
(2017)

T. Ashton et al.

Oxidative phosphorylation as an emerging target in cancer therapy

Clin. Cancer Res.

(2018)

H.C. Chen et al.

Assessment of performance of survival prediction models for cancer prognosis

BMC Med. Res. Methodol.

(2012)

J. Cheng et al.

Identification of topological features in renal tumor microenvironment associated with patient survival

Bioinformatics

(2017)

J. Cheng et al.

Integrative analysis of histopathological images and genomic data predicts clear cell renal cell carcinoma prognosis

Cancer Res.

(2017)

N. Coudray et al.

Classification and mutation prediction from non-small cell lung cancer histopathology images using deep learning

Nat. Med.

(2018)

C. Denkert et al.

Tumor-associated lymphocytes as an independent predictor of response to neoadjuvant chemotherapy in breast cancer

J. Clin. Oncol.

(2010)

Z. Huang et al.

SALMON: Survival analysis learning with multi-omics neural networks on breast cancer

Front Genet.

(2019)

Y. Kim et al.

Role of VHL gene mutation in human cancer

J. Clin. Oncol.

(2004)

M. Lerman et al.

The 630-kb lung cancer homozygous deletion region on human chromosome 3p21. 3: identification and evaluation of the resident candidate tumor suppressor genes

Cancer Res.

(2000)

Y. Li et al.

A multi-task learning formulation for survival analysis

Proceedings of the SIGKDD International Conference on Knowledge Discovery and Data Mining

(2016)

D. Lin et al.

Checking the cox model with cumulative sums of martingale-based residuals

Biometrika

(1993)

D.Y. Lin et al.

The robust inference for the cox proportional hazards model

J. Am. Stat. Assoc.

(1989)

M. Liu et al.

Joint binary classifier learning for ECOC-based multi-class classification

IEEE Trans. Pattern Anal. Mach. Intell.

(2015)

Y. Liu et al.

Cancer and innate immune system interactions: translational potentials for cancer immunotherapy

J. Immunother.

(1997)

Cited by (50)

Computational pathology: A survey review and the way forward
2024, Journal of Pathology Informatics
Computational Pathology (CPath) is an interdisciplinary science that augments developments of computational approaches to analyze and model medical histopathology images. The main objective for CPath is to develop infrastructure and workflows of digital diagnostics as an assistive CAD system for clinical pathology, facilitating transformational changes in the diagnosis and treatment of cancer that are mainly address by CPath tools. With evergrowing developments in deep learning and computer vision algorithms, and the ease of the data flow from digital pathology, currently CPath is witnessing a paradigm shift. Despite the sheer volume of engineering and scientific works being introduced for cancer image analysis, there is still a considerable gap of adopting and integrating these algorithms in clinical practice. This raises a significant question regarding the direction and trends that are undertaken in CPath. In this article we provide a comprehensive review of more than 800 papers to address the challenges faced in problem design all-the-way to the application and implementation viewpoints. We have catalogued each paper into a model-card by examining the key works and challenges faced to layout the current landscape in CPath. We hope this helps the community to locate relevant works and facilitate understanding of the field’s future directions. In a nutshell, we oversee the CPath developments in cycle of stages which are required to be cohesively linked together to address the challenges associated with such multidisciplinary science. We overview this cycle from different perspectives of data-centric, model-centric, and application-centric problems. We finally sketch remaining challenges and provide directions for future technical developments and clinical integration of CPath. For updated information on this survey review paper and accessing to the original model cards repository, please refer to GitHub. Updated version of this draft can also be found from arXiv.
Publicly available datasets of breast histopathology H&E whole-slide images: A scoping review
2024, Journal of Pathology Informatics
Advancements in digital pathology and computing resources have made a significant impact in the field of computational pathology for breast cancer diagnosis and treatment. However, access to high-quality labeled histopathological images of breast cancer is a big challenge that limits the development of accurate and robust deep learning models. In this scoping review, we identified the publicly available datasets of breast H&E-stained whole-slide images (WSIs) that can be used to develop deep learning algorithms. We systematically searched 9 scientific literature databases and 9 research data repositories and found 17 publicly available datasets containing 10 385 H&E WSIs of breast cancer. Moreover, we reported image metadata and characteristics for each dataset to assist researchers in selecting proper datasets for specific tasks in breast cancer computational pathology. In addition, we compiled 2 lists of breast H&E patches and private datasets as supplementary resources for researchers. Notably, only 28% of the included articles utilized multiple datasets, and only 14% used an external validation set, suggesting that the performance of other developed models may be susceptible to overestimation. The TCGA-BRCA was used in 52% of the selected studies. This dataset has a considerable selection bias that can impact the robustness and generalizability of the trained algorithms. There is also a lack of consistent metadata reporting of breast WSI datasets that can be an issue in developing accurate deep learning models, indicating the necessity of establishing explicit guidelines for documenting breast WSI dataset characteristics and metadata.
Mix-supervised multiset learning for cancer prognosis analysis with high-censoring survival data
2024, Expert Systems with Applications
High censoring phenomenon usually occurs in cancer prognosis analysis, which, however, would introduce bias for model construction and limit generalization performance. In this paper, we first explore and identify an appropriate censoring range for cancer prognosis evaluation, upon which we present a mix-supervised multiset learning framework to cope with high-censoring data. Specifically, we construct multiple subsets with the specified censoring proportion, followed by a multiset representation learning method to learn subset-specific representations, which equips with adversary integrality preservation and dependency limitation constraints to ensure the unbiasedness of subsets and eliminate the redundancy among subsets, respectively. Furthermore, a mix-supervised multiset fusion model is proposed to estimate the relative survival risk, in which teacher model can make full use of the survival time of uncensored samples and the prognosis-related attributes of censored ones to generate reliable pseudo-labels and latent-space for student model. We evaluate the proposed method on three public datasets, and extensive experimental results demonstrate its superiority.
RetCCL: Clustering-guided contrastive learning for whole-slide image retrieval
2023, Medical Image Analysis
Citation Excerpt :
However, visual inspection on the entire WSI is very labor-intensive and time-consuming. Computational pathology based on deep learning technologies has been emerged to facilitate the automation process of pathology diagnoses, such as classification of cancer types (Campanella et al., 2019; Lu et al., 2021; Xue et al., 2021), delineation of cancerous or nuclear regions (Kumar et al., 2017), survival prediction (Shao et al., 2020), image retrieval (Kalra et al., 2020a), etc. Benefiting from the increasing amount of WSIs, WSI retrieval has recently attracted growing attention (Chen et al., 2021; Kalra et al., 2020a,b), which can return a series of similar WSIs from a historically characterized database when given a WSI for a query.
Benefiting from the large-scale archiving of digitized whole-slide images (WSIs), computer-aided diagnosis has been well developed to assist pathologists in decision-making. Content-based WSI retrieval can be a new approach to find highly correlated WSIs in a historically diagnosed WSI archive, which has the potential usages for assisted clinical diagnosis, medical research, and trainee education. During WSI retrieval, it is particularly challenging to encode the semantic content of histopathological images and to measure the similarity between images for interpretable results due to the gigapixel size of WSIs. In this work, we propose a Retrieval with Clustering-guided Contrastive Learning (RetCCL) framework for robust and accurate WSI-level image retrieval, which integrates a novel self-supervised feature learning method and a global ranking and aggregation algorithm for much improved performance. The proposed feature learning method makes use of existing large-scale unlabeled histopathological image data, which helps learn universal features that could be used directly for subsequent WSI retrieval tasks without extra fine-tuning. The proposed WSI retrieval method not only returns a set of WSIs similar to a query WSI, but also highlights patches or sub-regions of each WSI that share high similarity with patches of the query WSI, which helps pathologists interpret the searching results. Our WSI retrieval framework has been evaluated on the tasks of anatomical site retrieval and cancer subtype retrieval using over 22,000 slides, and the performance exceeds other state-of-the-art methods significantly (around 10% for the anatomic site retrieval in terms of average $m M V @ 10$ ). Besides, the patch retrieval using our learned feature representation offers a performance improvement of 24% on the TissueNet dataset in terms of $m M V @ 5$ compared with using ImageNet pre-trained features, which further demonstrates the effectiveness of the proposed CCL feature learning method.
A deep learning method for automatic evaluation of diagnostic information from multi-stained histopathological images
2022, Knowledge-Based Systems
Citation Excerpt :
Merveille et al. [35] developed a computerized method to extract pathological attributes from multi-stained consecutive whole slide images (WSIs) via a global multi-channel analysis pipeline. In addition, several multi-modal fusion frameworks based on multi-task correlation learning were established for the integrative analysis of histopathological images and genomic data in cancer prognosis prediction [36,37]. However, to the best of our knowledge, few studies focus on quantitative evaluation of diagnostic information from multi-stained histopathological images due to the difficulty of data acquirement and the lack of quantification algorithms.
Manual screening of large-scale histopathological images is an extremely time-consuming, laborious and subjective procedure. Accurate evaluation of diagnostic information from multi-color stained images requires expertise due to the complex nature of histopathology and the lack of quantifiable measurement. In this work, a novel deep learning method is developed based on a convolutional siamese network, in which the information quantification task is transformed into a similarity assessment between lesion and non-lesion patterns on histopathological images. The subtle changes underlying the microstructure of tissue biopsies can be captured through an optimization of training loss within a low-to-high-level feature space. A new information score is introduced to quantify the abnormality in tissue appearance and stain pattern. Experiments on 3 independent data cohorts including 5 types of color-stained images demonstrate that our method can achieve promising performance compared with state-of-the-art methods. Results show that the proposed information score can serve as an effective measure to evaluate the importance of multi-stained images, and ultimately facilitate automatic diagnosis for clinical multi-stained histopathology.
Biologically interpretable multi-task deep learning pipeline predicts molecular alterations, grade, and prognosis in glioma patients
2024, Research Square

View all citing articles on Scopus

¹: Wei Shao is now working in the School of Medicine, Indiana University, USA.

View full text

Multi-task multi-modal learning for joint diagnosis and prognosis of human cancers

Highlights

Abstract

Graphical abstract

Introduction

Section snippets

Datasets.

Experimental settings.

The effect of the parameter σ in M2DP model.

Conclusion

CRediT authorship contribution statement

Declaration of Competing Interest

Acknowledgments

Bioinformatics

Hum. Immunol.

Eur. Urol.

Pattern Recognit.

Neuroimage

Med. Image Anal.

Comput. Method. Progr. Biomed.

Cancer Inform.

The role of cancer-associated fibroblasts and fibrosis in liver cancer

Ann. Rev. Pathol.

High expression of GP73 in primary hepatocellular carcinoma and its function in the assessment of transcatheter arterial chemoembolization

Oncol. Lett.

Oxidative phosphorylation as an emerging target in cancer therapy

Clin. Cancer Res.

Assessment of performance of survival prediction models for cancer prognosis

BMC Med. Res. Methodol.

Identification of topological features in renal tumor microenvironment associated with patient survival

Bioinformatics

Integrative analysis of histopathological images and genomic data predicts clear cell renal cell carcinoma prognosis

Cancer Res.

Classification and mutation prediction from non-small cell lung cancer histopathology images using deep learning

Nat. Med.

Tumor-associated lymphocytes as an independent predictor of response to neoadjuvant chemotherapy in breast cancer

J. Clin. Oncol.

SALMON: Survival analysis learning with multi-omics neural networks on breast cancer

Front Genet.

Role of VHL gene mutation in human cancer

J. Clin. Oncol.

The 630-kb lung cancer homozygous deletion region on human chromosome 3p21. 3: identification and evaluation of the resident candidate tumor suppressor genes

Cancer Res.

A multi-task learning formulation for survival analysis

Proceedings of the SIGKDD International Conference on Knowledge Discovery and Data Mining

Checking the cox model with cumulative sums of martingale-based residuals

Biometrika

The robust inference for the cox proportional hazards model

J. Am. Stat. Assoc.

Joint binary classifier learning for ECOC-based multi-class classification

IEEE Trans. Pattern Anal. Mach. Intell.

Cancer and innate immune system interactions: translational potentials for cancer immunotherapy

J. Immunother.