TY - JOUR
T1 - A machine-learning-guided mutagenesis platform for accelerated discovery of novel functional proteins
AU - Saito, Yutaka
AU - Oikawa, Misaki
AU - Nakazawa, Hikaru
AU - Niide, Teppei
AU - Kameda, Tomoshi
AU - Tsuda, Koji
AU - Umetsu, Mitsuo
N1 - Publisher Copyright:
The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. All rights reserved. No reuse allowed without permission.
Copyright:
Copyright 2020 Elsevier B.V., All rights reserved.
PY - 2018/3/16
Y1 - 2018/3/16
N2 - Molecular evolution based on mutagenesis is widely used in protein engineering. However, optimal proteins are often difficult to obtain due to a large sequence space that requires high costs for screening experiments. Here, we propose a novel approach that combines molecular evolution with machine learning. In this approach, we conduct two rounds of mutagenesis where an initial library of protein variants is used to train a machine-learning model to guide mutagenesis for the second-round library. This enables to prepare a small library suited for screening experiments with high enrichment of functional proteins. We demonstrated a proof-of-concept of our approach by altering the reference green fluorescent protein (GFP) so that its fluorescence is changed to yellow while improving its fluorescence intensity. Using 155 and 78 variants for the initial and the second-round libraries, respectively, we successfully obtained a number of proteins showing yellow fluorescence, 12 of which had better fluorescence performance than the reference yellow fluorescent protein (YFP). These results show the potential of our approach as a powerful platform for accelerated discovery of functional proteins.
AB - Molecular evolution based on mutagenesis is widely used in protein engineering. However, optimal proteins are often difficult to obtain due to a large sequence space that requires high costs for screening experiments. Here, we propose a novel approach that combines molecular evolution with machine learning. In this approach, we conduct two rounds of mutagenesis where an initial library of protein variants is used to train a machine-learning model to guide mutagenesis for the second-round library. This enables to prepare a small library suited for screening experiments with high enrichment of functional proteins. We demonstrated a proof-of-concept of our approach by altering the reference green fluorescent protein (GFP) so that its fluorescence is changed to yellow while improving its fluorescence intensity. Using 155 and 78 variants for the initial and the second-round libraries, respectively, we successfully obtained a number of proteins showing yellow fluorescence, 12 of which had better fluorescence performance than the reference yellow fluorescent protein (YFP). These results show the potential of our approach as a powerful platform for accelerated discovery of functional proteins.
UR - http://www.scopus.com/inward/record.url?scp=85095653491&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85095653491&partnerID=8YFLogxK
U2 - 10.1101/282996
DO - 10.1101/282996
M3 - Article
AN - SCOPUS:85095653491
JO - [No source information available]
JF - [No source information available]
ER -