ThermalProGAN: A sequence-based thermally stable protein generator trained using unpaired data.

Huang HL, Weng CH, Nordling TEM, Liou YF

J Bioinform Comput Biol 21 (1) 2350008 [2023-02-00; online 2023-03-31]

The synthesis of proteins with novel desired properties is challenging but sought after by the industry and academia. The dominating approach is based on trial-and-error inducing point mutations, assisted by structural information or predictive models built with paired data that are difficult to collect. This study proposes a sequence-based unpaired-sample of novel protein inventor (SUNI) to build ThermalProGAN for generating thermally stable proteins based on sequence information. The ThermalProGAN can strongly mutate the input sequence with a median number of 32 residues. A known normal protein, 1RG0, was used to generate a thermally stable form by mutating 51 residues. After superimposing the two structures, high similarity is shown, indicating that the basic function would be conserved. Eighty four molecular dynamics simulation results of 1RG0 and the COVID-19 vaccine candidates with a total simulation time of 840[Formula: see text]ns indicate that the thermal stability increased. This proof of concept demonstrated that transfer of a desired protein property from one set of proteins is feasible. Availability and implementation: The source code of ThermalProGAN can be freely accessed at with an MIT license. The website is Supplementary information: Supplementary data are available on Github.

Category: Health

Type: Journal article

PubMed 36999645

DOI 10.1142/S0219720023500087

Crossref 10.1142/S0219720023500087

Publications 9.2.2