Protocol to use protein language models predicting and following experimental validation of function-enhancing variants of thymine-N-glycosylase

oleh: Yan He, Xibin Zhou, Fajie Yuan, Xing Chang

Format: Article
Diterbitkan: Elsevier 2024-09-01

Deskripsi

Summary: Protein language models (PLMs) are machine learning tools trained to predict masked amino acids within protein sequences, offering opportunities to enhance protein function without prior knowledge of their specific roles. Here, we present a protocol for optimizing thymine-DNA-glycosylase (TDG) using PLMs. We describe steps for “zero-shot” enzyme optimization, construction of plasmids, double plasmid transfection, and high-throughput sequencing and data analysis. This protocol holds promise for streamlining the engineering of gene editing tools, delivering improved activity while minimizing the experimental workload.For complete details on the use and execution of this protocol, please refer to He et al.1 : Publisher’s note: Undertaking any experimental protocol requires adherence to local institutional guidelines for laboratory safety and ethics.