1)电子科技大学生命科学与技术学院,信息生物学研究中心,成都 611731;2)首都医科大学附属北京友谊医院药学部,北京 100050;3)北京市临床药学研究所,北京 100035
国家自然科学基金(82130112)资助项目。
1)School of Life Science and Technology and Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu 611731, China;2)Department of Pharmacy, Beijing Friendship Hospital, Capital Medical University, Beijing 100050, China;3)Beijing Institute of Clinical Pharmacy, Beijing 100035, China
This work was supported by a grant from The National Nature Scientific Foundation of China (82130112).
核酸适配体(nucleic acid aptamer)是一类能够高效、特异识别蛋白质、小分子和完整细胞等多种靶标的单链寡核苷酸分子,凭借其分子质量小、化学与热稳定性强、易于修饰且免疫原性低等优势,在诊断、治疗与分析检测领域展现出广阔的应用前景。本文首先对现有核酸适配体数据库进行了系统梳理,随后概述了高通量SELEX数据的模拟筛选流程;接着综述了从序列特征提取到二级和三级结构预测的各类生物信息学方法;然后评述了分子对接、分子动力学及结合自由能计算结合机器学习与深度学习技术在核酸适配体-靶标相互作用预测中的应用;最后总结了基于理性设计、生成式AI与进化算法的计算设计策略,并列举了精准诊断、生物传感和食品安全等典型案例。通过多层次、多方法的整合,本文为核酸适配体的发现、设计与应用提供了一套全面的生物信息学工具与思路,并对未来智能闭环和多模态融合的研究方向进行了 展望。
Nucleic acid aptamers represent a class of single-stranded oligonucleotides capable of high-affinity and specific binding to diverse targets, including proteins, small molecules, cells, and metal ions. Their advantages over antibodies—such as simpler synthesis, lower immunogenicity, superior stability, and easier modification—have positioned them as powerful tools in therapeutics, diagnostics, and biosensing. This review systematically surveys the integral role of bioinformatics and artificial intelligence (AI) in modern aptamer development, spanning from in silico selection and structural prediction to the generative design of novel aptamer sequences. The application of high-throughput SELEX (HT-SELEX) has greatly accelerated the discovery of aptamers, but also introduced computational challenges in processing large-scale sequencing data. Bioinformatics pipelines now routinely include tools like AptaPLEX and AptaSuite for preprocessing raw reads, including demultiplexing, adapter trimming, and quality filtering. Subsequent analytical steps involve clustering-based tools (e.g., FASTAptamer, AptaCLUSTER) to identify enriched sequences, and motif discovery algorithms (such as AptaTRACE and MPBind) that uncover conserved sequence-structure patterns associated with binding functionality. These approaches allow researchers to move beyond manual curation and extract meaningful candidates from complex selection rounds. Accurate prediction of secondary and tertiary structures is essential for understanding aptamer function and interaction mechanisms. Conventional tools, including RNAfold and Mfold, employ thermodynamics-based models to predict RNA folding, yet often struggle with pseudoknots and non-canonical pairs. Recent advances in deep learning—exemplified by SPOT-RNA, E2Efold, and UFold—have significantly improved prediction accuracy by leveraging neural networks trained on large structural datasets. For tertiary structure, methods range from fragment assembly (Rosetta FARFAR2) and homology modeling (RNAComposer) to deep learning-aided approaches such as AlphaFold-RNA and RoseTTAFoldNA. While these tools offer new insights, predicting structures for short, flexible aptamers remains non-trivial. Predicting aptamer–target interactions draws on both physics-based and data-driven approaches. Molecular docking programs—AutoDock Vina, ZDOCK, and MDockPP—provide initial binding poses, which can be refined using molecular dynamics simulations (with GROMACS, AMBER, or NAMD) and free energy perturbation techniques to estimate binding affinity. Complementarily, machine learning models are increasingly employed to predict interactions from sequence and structural features. Early efforts used hand-engineered features with classifiers like SVM and random forest, while contemporary deep learning models (AptaNet, AptaBERT, PAIR) utilize pre-trained language models to capture intricate sequence-binding relationships with superior generalization. Perhaps the most transformative development is the use of generative AI for de novo aptamer design. Conditional variational autoencoders (e.g., RaptGen), generative adversarial networks (e.g., AptaDesigner), and diffusion models (e.g., AptaDiff) can generate novel aptamer sequences conditioned on target properties or desired binding affinities. Reinforcement learning and evolutionary algorithms, including Monte Carlo tree search (Apta-MCTS) and NSGA-II, support multi-objective optimization toward high specificity, stability, and low immunogenicity. These approaches mark a paradigm shift from selective discovery to intentional design, greatly expanding the functional sequence space. Aptamers designed via these computational strategies are increasingly applied in biomedical and environmental fields, including as targeted therapeutics, diagnostic biosensors, and agents in food safety monitoring. Nonetheless, key challenges persist: data scarcity and heterogeneity, model interpretability, and experimental validation bottlenecks. Future progress will depend on standardized data sharing, improved explainable AI, and the integration of computational design with high-throughput experimental screening—ultimately enabling robust, clinically viable aptamer technologies.
刘上华,张洪旗,刘入铭,曾红娟,邓科君,鄢丹,汤丽霞,林昊.面向核酸适配体的人工智能方法与应用[J].生物化学与生物物理进展,,():
复制

扫码关注 生物化学与生物物理进展 ® 2025 网站版权 ICP:京ICP备05023138号-1 京公网安备 11010502031771号
