ӳ��ý

PMCID

PMC12871374

Machine learning enables efficient and effective affinity maturation of nanobodies.

bioRxiv : the preprint server for biology

Authors	Steffanie Paul Edward Harvey James Osei-Owusu Aaron Kollasch Adam Riesselman Conor McMahon Artem Gazizov Murali Anuganti Filmawit Belay Minh Kieu Haisun Zhu Robert Hollingsworth Wade Harper Deborah Moshinsky Andre Teixeira Debora Marks Andrew Kruse
Abstract	Antibodies can bind their targets with exquisite potency and selectivity due in part to large antibody-target protein-protein interaction surface areas. Despite the very large size and diversity of synthetic libraries, sorting alone tends to yield binders with modest affinities. By analogy to the affinity maturation in the natural immune system, these initial hits are typically affinity matured to achieve high affinity binding. However, affinity maturation campaigns can be laborious, often requiring multiple selection rounds and strategies for each clone to be optimized. Here, we investigated whether one could accelerate the discovery of optimized binders using machine learning on sequencing data from single selection sorts of affinity maturation yeast-display campaigns. Our results show that sparse sequencing data from a single sorting round can predict sequences that are enriched after multiple rounds. We also find that linear models outperform deep neural networks and semi-supervised approaches in ranking validated affinity-enhancing substitutions. Linear models are also more interpretable, offering insights into residue preferences that can be leveraged for further engineering. We use our models to design and select optimized nanobody binders to relaxin family peptide receptor 1 (RXFP1), yielding multiple improved binders including 3 sub nanomolar binders with the best exhibiting a ~2500-fold improvement over WT.
Year of Publication	2026
Journal	bioRxiv : the preprint server for biology
Date Published	01/2026
ISSN	2692-8205
DOI	10.64898/2026.01.11.698911
PubMed ID	41648218
Links

Recent ӳ��ý Publications

A resource of "bottom-line" variant associations for 1,281 complex traits by integrating data across published genome-wide association studies.

Prognostic value of tumor-informed ctDNA in HPV-independent head and neck squamous cell carcinoma.

Multi-molecular scores map process-specific polygenic diabetes risk to atherosclerosis, cardiometabolic diseases, and vascular complications.

Machine learning enables efficient and effective affinity maturation of nanobodies.

Copy Number Variant Duplications Associated with Essential Tremor.