蛋白质组学数据揭示可变剪接蛋白质变体
DOI:
作者:
作者单位:

河北医科大学中西医结合研究所

作者简介:

通讯作者:

中图分类号:

基金项目:

国家自然科学基金


Proteomics Data Reveals Alternative Splicing Proteoforms
Author:
Affiliation:

Institute of Integrated Chinese and western Hebei Medical University Medicine

Fund Project:

The National Natural Science Foundation of China

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    可变剪接使同一个基因产生多种不同的转录本和蛋白质,增加蛋白质多样性和功能复杂性。转录组学和蛋白质组学是鉴定可变剪接的两种主要途径,分别通过分析RNA测序数据和蛋白质测序数据来鉴定可变剪接事件和相应蛋白质产物。尽管RNA水平的可变剪接研究已经取得了一定的进展,但是对于剪接蛋白亚型的检测和区分仍然不足。本文综述了近年来对可变剪接及其生物功能,可变剪接在不同水平检测的研究进展,详细介绍利用“自下而上”的蛋白质组学数据鉴定可变剪接蛋白变体的两种主要方法,序列数据库构建和蛋白质鉴定算法开发。鉴定不同的可变剪接蛋白质有利于我们认识更全面的蛋白质的功能,对发现相关生物标志物和关键药物靶点具有重要意义。

    Abstract:

    Alternative splicing is an important regulatory mechanism in organisms, influencing the expression of genes involved in processes such as drug metabolism, pathway activation, and apoptosis. It refers to the process of removing introns from precursor mRNA and joining the remaining exons to produce mature mRNA. During this process, different combinations of exons can result in multiple mature mRNAs. This process is known as alternative splicing. Alternative splicing allows the same gene to produce different transcript variants and protein isoforms, increasing protein diversity and functional complexity. Transcriptomics and proteomics are two main approaches for identifying alternative splicing events. Transcriptomics identifies alternative splicing by analyzing differences between RNA sequencing data and reference sequences in databases. This method relies on the development of modern sequencing technologies. It also depends on increasingly improved splicing identification algorithms. Examples of these algorithms include alignment mapping and sequencing data quality control. The other approach is proteomic data analysis, which identifies corresponding protein products. We consider alternative splicing events more meaningful when they can be detected at the protein level. Alternative splicing proteoforms can be identified using bottom-up proteomics based on mass spectrometry. Due to the high sequence similarity between these alternative splicing proteoforms, general proteomic data analysis pipelines do not achieve good discrimination between them. To improve the identification of proteoforms and obtain differentiation information for different isoforms in proteomic data, two strategies have been developed for improving data processing, as shown in the figure: the construction of special databases and targeted identification algorithms. We believe that this potential protein isoform information may play a crucial role in life science research. In terms of databases, it is not enough to only use ordinary public databases for searching. To ensure the discovery of as many isoforms as possible, the method of constructing sample-specific databases assisted by RNA sequencing data has been widely used, which can increase the probability of detecting proteoforms. Another key strategy is the improvement of protein identification algorithms. Traditional identification algorithms often struggle to distinguish between highly similar or mutually inclusive proteoforms. To address the complex identification of alternative splicing proteoforms, several inference algorithms have been developed, which are combined with existing search engines to better characterize and detect alternative splicing proteoforms. These include peptide grouping (PeptideClassifier, SEPepQuant, GpGrouper), peptide quantitative correlation (PQPQ, PeCorA, COPF, SpliceVista), machine learning (IsoSVM, Re-Fraction, LibSVM), and major splice isoform theory (ASV-ID). Such methods have shown promising results in focusing on alternative splicing proteoforms. When using these algorithms, we should try different ones based on actual situations. Additionally, the performance of these algorithms is limited by the quality of the input data. To ensure reliable identification, it is also essential to perform proper peptide identification and quality control at the front end. In general, the detection and differentiation of spliced protein isoforms are still inadequate, requiring continued attention. This article reviews recent research progress on alternative splicing and its biological functions, as well as the detection of alternative splicing at different levels, and introduces the main methods for identifying alternative splicing proteoforms using bottom-up proteomic data. Identifying different alternative splicing proteoforms helps us understand the comprehensive functions of proteins and is of great significance for discovering related biomarkers and key drug targets.

    参考文献
    相似文献
    引证文献
引用本文

吴怡颖,孔德志,张炜.蛋白质组学数据揭示可变剪接蛋白质变体[J].生物化学与生物物理进展,,():

复制
分享
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2024-03-18
  • 最后修改日期:2024-07-01
  • 接受日期:2024-07-01
  • 在线发布日期:
  • 出版日期: