Poster Presentation HUPO 2019 - 18th Human Proteome Organization World Congress

FusionPro, a versatile proteogenomic tool for identification of novel fusion transcripts and their potential translation products in cells (#615)

Chae-Yeon Kim 1 , Keun Na 1 , Saeram Park 1 , Seul-Ki Jeong 1 , Jin-Young Cho 1 , Heon Shin 1 , Min Jung Lee 1 , Gyoonhee Han 2 , Young-Ki Paik 1
  1. Yonsei Proteome Research Center, Yonsei University, Seoul, Seodaemun-gu, Republic of Korea
  2. Department of Pharmacy, College of Pharmacy, Yonsei University, Seoul, Seodaemun-gu, Republic of Korea

Fusion proteoforms are translation products derived from gene fusion. Although very rare, the fusion proteoforms play important roles in biomedical science. For example, fusion proteoforms influence the development of tumors by serving as cancer markers or cell cycle regulators. Although numerous studies have reported bioinformatics tools that can predict fusion transcripts, few proteogenomic tools are available that can predict and identify proteoforms. In this study, we develop a versatile proteogenomic tool “FusionPro”, which facilitates the identification of fusion transcripts and their potential translatable peptides. FusionPro provides an independent gene fusion prediction module and can build sequence databases for annotated fusion proteoforms. FusionPro shows greater sensitivity than the available fusion finders when analyzing simulated or real RNA sequencing datasets. We use FusionPro to identify 18 fusion junction peptides and three potential fusion-derived peptides by MS/MS-based analysis of leukemia cell lines (Jurkat and K562) and ovarian cancer tissues from the Clinical Proteomic Tumor Analysis Consortium. Among the identified fusion proteins, we molecularly validate two fusion junction isoforms and a translation product of FAM133B:CDK6. Moreover, sequence analysis suggests that the fusion protein participates in the cell cycle progression. In addition, our prediction results indicate that fusion transcripts often have multiple fusion junctions and that these fusion junctions tend to be distributed in a non-random pattern at both the chromosome and gene levels. Thus, FusionPro allows users to detect various types of fusion translation products using a transcriptome-informed approach and to gain a comprehensive understanding of the formation and biological roles of fusion proteoforms.