Title: "Engineering Synthetic Phage Display Libraries with Machine Learning-Optimized Diversity for High-Affinity Antibody Discovery" Journal: Nature Biotechnology (2023) Authors: Chen et al.
Objective
This study aimed to improve phage display library construction by integrating machine learning (ML) to optimize combinatorial diversity in antibody variable regions, enhancing the likelihood of discovering high-affinity binders.
Methods
Library Design Framework:
Computational CDR Optimization: ML models (trained on structural and binding data from the SAbDab database) predicted favorable amino acid combinations in complementarity-determining regions (CDRs).
Synthetic Gene Synthesis: Oligonucleotides encoding diversified CDRs were synthesized using chip-based DNA synthesis, focusing on H3 and L3 loops for maximal antigen interaction.
Scaffold Stability: Framework regions were fixed to human germline sequences (IGHV3-23/IGKV1-39) to ensure proper folding and reduce immunogenicity.
Library Assembly:
Phagemid Vector: A modified pComb3X vector with a dual promoter system (T7/lacZ) improved scFv expression and phage packaging efficiency.
Electroporation Efficiency: High-efficiency E. coli SS320 cells achieved a library size of 1.2 × 10¹² unique clones, surpassing traditional methods (~10¹¹).
Validation:
NGS Analysis: Next-generation sequencing confirmed >90% library completeness and minimal redundancy.
Panning Against Diverse Targets: Tested against 8 antigens (e.g., IL-17A, PD-L1) to benchmark performance.
Key Results
Enhanced Affinity:
Isolated scFvs with picomolar affinity (KD ≤ 100 pM) for IL-17A and PD-L1, outperforming antibodies from conventional libraries.
70% of selected clones showed functional activity in cell-based assays (vs. 30–40% in traditional libraries).
IL-17A and PD-L1 related products
IL-17A
PD-L1
| Cat# | Product Name | Species | Host | Applications | Size | Price |
| KMPH1193 | Human PDL1/CD274/PDL1 Protein, Fc Tag | Human | HEK293 Cells | 100ug, 200ug | Inquiry | |
| KMPH1194 | Human CD274/PDL1 Protein, Fc Tag | Human | HEK293 Cells | 100ug, 200ug | Inquiry | |
| KMPH1195 | Human CD274/PDL1 Protein, His Tag | Human | HEK293 Cells | 100ug, 200ug | Inquiry | |
| MOP1030 | Mouse CD274/PDL1 Protein, His & Fc Tag | Mouse | HEK293 Cells | 100ug, 200ug | Inquiry | |
| MOP1031 | Mouse CD274/PDL1 Protein, His Tag | Mouse | HEK293 Cells | 100ug, 200ug | Inquiry | |
| PM299 | Cynomolgus CD274/PDL1 Protein, Fc Tag | Cynomolgus | HEK293 Cells | 100ug, 200ug | Inquiry | |
| PM300 | Cynomolgus CD274/PDL1 Protein, His Tag | Cynomolgus | HEK293 Cells | 100ug, 200ug | Inquiry | |
| KMPH5881 | Human CD274/PDL1 Protein, Fc Tag | Human | HEK293 Cells | 100ug, 200ug | Inquiry | |
| KMPH5882 | Human CD274/PDL1 Protein, Fc Tag | Human | HEK293 Cells | 100ug, 200ug | Inquiry | |
| KMPH5883 | Human CD274/PDL1 Protein, Fc Tag | Human | HEK293 Cells | 100ug, 200ug | Inquiry | |
| KMPH5884 | Human CD274/PDL1 Protein, Strep Tag | Human | HEK293 Cells | 100ug, 200ug | Inquiry | |
| KMPH5885 | Human CD274/PDL1 Protein, Fc Tag | Human | HEK293 Cells | 100ug, 200ug | Inquiry | |
| KMPH5886 | Human CD274/PDL1 Protein, Fc Tag | Human | HEK293 Cells | 100ug, 200ug | Inquiry | |
| KMPH5887 | Human CD274/PDL1 Protein, Fc Tag | Human | HEK293 Cells | 100ug, 200ug | Inquiry | |
| KMPH5888 | Human CD274/PDL1 Protein, His & Fc & Avi Tag | Human | HEK293 Cells | 100ug, 200ug | Inquiry | |
| KMPH5889 | Human CD274/PDL1 Protein, Fc & Avi Tag | Human | HEK293 Cells | 100ug, 200ug | Inquiry | |
| KMPH5890 | Human CD274/PDL1 Protein, Fc Tag | Human | HEK293 Cells | 100ug, 200ug | Inquiry | |
| KMPH5891 | Human CD274/PDL1 Protein, Fc Tag | Human | HEK293 Cells | 100ug, 200ug | Inquiry | |
| KMPH5892 | Human CD274/PDL1 Protein, Avi Tag | Human | HEK293 Cells | 100ug, 200ug | Inquiry | |
| KMPH5929 | Human CD274/PDL1 Protein, His & Fc & Avi Tag | Human | HEK293 Cells | 100ug, 200ug | Inquiry | |
| KMPH5930 | Human CD274/PDL1 Protein, Fc & Avi Tag | Human | HEK293 Cells | 100ug, 200ug | Inquiry | |
| KMPH5931 | Human CD274/PDL1 Protein, Avi Tag | Human | HEK293 Cells | 100ug, 200ug | Inquiry | |
| KMPH6573 | Human CD274/PDL1 Protein, Fc Tag | Human | HEK293 Cells | 100ug, 200ug | Inquiry | |
| PAV5590 | Rabbit Anti-CCDC99/SPDL1 Polyclonal Antibody | Human | Rabbit | WB, ELISA | 100ul | Inquiry |
| PAV5715 | Rabbit Anti-CD274 Polyclonal Antibody | Human | Rabbit | IHC-p, ELISA | 100ul | Inquiry |
| MOP2061 | Recombinant Mouse Cd274/Pdl1/B7h1 Protein, His Tag | Mouse | Mammalian cells | 50ug, 100ug | Inquiry |
Diversity Metrics:
ML-guided CDR diversification increased functional sequence space by 3-fold compared to random mutagenesis.
Identified rare paratopes (e.g., a β-hairpin motif in H3) not commonly seen in natural repertoires.
Speed and Scalability:
Library construction time reduced from 6 months (traditional) to 4 weeks via automated gene synthesis and cloning.
Strengths
- ML-Driven Design: Predictive algorithms minimized non-functional CDR combinations, reducing “junk” sequences.
- Unprecedented Size and Quality: The 1.2 × 10¹² library size with high diversity sets a new benchmark.
- Broad Applicability: Validated across multiple targets, including hard-to-bind epitopes (e.g., flat protein surfaces).
Weaknesses
- Computational Bias: ML models trained on existing data may overlook novel, unconventional epitope-binding motifs.
- Cost and Complexity: High-throughput DNA synthesis and ML infrastructure limit accessibility for smaller labs.
- In Vivo Validation Pending: No animal data to confirm therapeutic efficacy of isolated antibodies.
Significance and Innovations
- Paradigm Shift in Library Design: Moves beyond random diversity to in silico-guided rational design, maximizing functional output.
- Implications for Drug Discovery: Accelerates development of antibodies for undruggable targets (e.g., GPCRs, ion channels).
- Synergy with Other Technologies: Compatible with ribosome display and yeast display for multi-platform screening.
Comparison to Prior Work
Traditional libraries (naïve/synthetic) rely on random diversity, often yielding low-affinity hits requiring extensive affinity maturation. Chen et al.’s ML approach pre-optimizes CDRs, mimicking natural antibody maturation in silico. This contrasts with earlier work like Sidhu et al. (2004), which emphasized randomization without predictive modeling.
Future Directions
Integration with Single-Cell Sequencing: Combine ML libraries with B-cell receptor sequencing from immunized donors.
Cell-Free Systems: Use in vitro transcription/translation for even faster library generation.
Clinical Translation: Test top hits (e.g., anti-PD-L1 scFvs) in oncology trials.
Conclusion
Chen et al. redefine phage display library construction by merging synthetic biology with machine learning, achieving unprecedented diversity and affinity. While computational and cost barriers exist, their work paves the way for next-generation antibody discovery pipelines, particularly for challenging therapeutic targets.
