Publications
bioRxivJan 2023 DOI:
10.1101/2023.01.30.526201

Meta Learning Improves Robustness and Performance in Machine Learning-Guided Protein Engineering

Minot, Mason; Reddy, Sai T.
Product Used
Genes
Abstract
Machine learning-guided protein engineering continues to rapidly progress, however, collecting large, well-labeled data sets remains time and resource intensive. Directed evolution and protein engineering studies often require extensive experimental processes to eliminate noise and fully label high-throughput protein sequence-function data. Meta learning methods established in other fields (e.g. computer vision and natural language processing) have proven effective in learning from noisy data, given the availability of a small data set with trusted labels and thus could be applied for protein engineering. Here, we generate yeast display antibody mutagenesis libraries and screen them for target antigen binding followed by deep sequencing. Meta learning approaches are able to learn under high synthetic and experimental noise as well as in under labeled data settings, typically outperforming baselines significantly and often requiring a fraction of the training data. Thus, we demonstrate meta learning may expedite and improve machine learning-guided protein engineering.Availability and implementationThe code used in this study is publicly available athttps://github.com/LSSI-ETH/meta-learning-for-protein-engineering.Graphical Abstract
Product Used
Genes

Related Publications