skip to main content
10.1145/3107411.3108196acmconferencesArticle/Chapter ViewAbstractPublication PagesbcbConference Proceedingsconference-collections
poster

Neopepsee: Accurate Genome-level Prediction of Neoantigens by Harnessing Sequence and Amino Acid Immunogenicity Information

Authors Info & Claims
Published:20 August 2017Publication History

ABSTRACT

In cancer genomics, next generation sequencing data are usually used to detect somatic driver mutations for identifying the cause of tumorigenesis. However, non-driver somatic mutations, or passenger mutations, can also play an important role in cancer cell survival and treatment by generating aberrant short peptide sequences as known as "neoantigens". Tumor-specific mutations form novel immunogenic peptides called neoantigens, which could be important to the recent promising outcomes of cancer immunotherapy, including immune checkpoint blockade. Many studies have tried to identify various sequence characteristics for prediction of immunogenicity; however, practical applications rely on a single predicted value (MHC-I binding affinity) with an arbitrary cut-off. Here, we developed Neopepsee, a method that applies a machine learning to predict personal neoantigen with next generation sequencing data. Neopepsee not only automates the entire computational procedure for immunogenicity prediction from raw data but also improves accuracy by harnessing 10 different features for classification, including conventional MHC-I and T-cell receptor binding affinity and amino acid characteristics (e.g., hydrophobicity, polarity and charge). Additionally, we found that protein sequence similarity to known pathogenic epitopes is a novel strong feature for classification. Tests with validated epitope datasets and independently proven neoantigens confirmed the remarkable improvement in accuracy. Application of Neopepsee to 224 public stomach adenocarcinoma data predicted neoantigens, whose burden is strongly correlated with patient prognosis. By providing a convenient platform with better accuracy, Neopepsee will be of many uses in cancer immunotherapy research, such as in developing predictive biomarkers and in designing personalized cancer vaccines.

Index Terms

  1. Neopepsee: Accurate Genome-level Prediction of Neoantigens by Harnessing Sequence and Amino Acid Immunogenicity Information

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      ACM-BCB '17: Proceedings of the 8th ACM International Conference on Bioinformatics, Computational Biology,and Health Informatics
      August 2017
      800 pages
      ISBN:9781450347228
      DOI:10.1145/3107411

      Copyright © 2017 Owner/Author

      Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 20 August 2017

      Check for updates

      Qualifiers

      • poster

      Acceptance Rates

      ACM-BCB '17 Paper Acceptance Rate42of132submissions,32%Overall Acceptance Rate254of885submissions,29%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader