High-accuracy Splice Site Prediction Based on Statistical Difference Table and Weighted Voting
CSTR:
Author:
Affiliation:

1.Hunan Engineering & Technology Research Center for Agricultural Big Data Analysis & Decision-making, Hunan Agricultural University, Changsha, 410128, China;2.Orient Science &Technology College, Hunan Agricultural University, Changsha, 410128, China

Clc Number:

Fund Project:

This work was supported by grants from The National Natural Science Foundation of China (61701177), Hunan Provincial Natural Science Foundation of China (2018JJ3225) and Scientific Research Project of Hunan Province Education Office (17A096).

  • Article
  • |
  • Figures
  • |
  • Metrics
  • |
  • Reference
  • |
  • Related
  • |
  • Cited by
  • |
  • Materials
  • |
  • Comments
    Abstract:

    High-accuracy splice site recognition based on machine learning is the key to eukaryotic genome annotation. In this paper, we used chi-square test to determine the window size of sequences, and constructed a chi-square statistical difference table to extract the positional features, and combined with the frequencies of dinucleotides to characterize sequences. For the problem that the positive and negative samples of splice sites are extremely imbalanced, 10 SVM classifiers based on the equal proportion of positive and negative samples were built for weighted voting, which effectively solved the imbalanced pattern classification problem. Independent testing results in HS3D dataset showed that the prediction accuracy of donor and acceptor sites were 93.39% and 90.46% respectively, obviously higher than that of the compared methods. The positional features based on the chi-square statistical difference table can effectively characterize DNA sequences, and have application prospects in signal site recognition of molecular sequences.

    Reference
    Related
    Cited by
Get Citation

ZENG Ying, CHEN Yuan, YUAN Zhe-Ming. High-accuracy Splice Site Prediction Based on Statistical Difference Table and Weighted Voting[J]. Progress in Biochemistry and Biophysics,2019,46(5):496-503

Copy
Related Videos

Share
Article Metrics
  • Abstract:
  • PDF:
  • HTML:
  • Cited by:
History
  • Received:October 15,2018
  • Revised:March 21,2019
  • Adopted:March 25,2019
  • Online: May 22,2019
  • Published: May 20,2019
Article QR Code