Alignment-free comparison is a recently developed method for sequence alignment, which has high computational efficiency and suitable to the low identical sequences. Alignment-free comparison was successfully applied in the DNA analysis. However, the accuracy of analysis is not high when it was applied in protein analysis because the complexity of protein is larger than DNA by consisting of 20 types of residues. Thus, residues are clustered into a few groups based on their similarity of physicochemical features. Using such simplified alphabets, the complexity of protein sequences is reduced and at the same time the key information encoded in the sequences remains. Therefore, the accuracy of alignment-free comparison is improved.
LI Jing, LI Feng-Bo, WANG Wei. Simplification of Protein Sequence and Alignment-free Sequence Analysis[J]. Progress in Biochemistry and Biophysics,2006,33(12):1215-1222
Copy® 2025 All Rights Reserved ICP:京ICP备05023138号-1 京公网安备 11010502031771号