Unraveling dynamic protein structures by two- dimensional infrared spectra with a pretrained machine learning modelWu, F (Wu, Fan); Huang, Y (Huang, Yan); Yang, GK (Yang, Guokun); Ye, S (Ye, Sheng); Mukamel, S (Mukamel, Shaul); Jiang, J (Jiang, Jun)
Proceedings of the National Academy of Sciences of the United States of America, 2024, Volume 121, 2409257121.
Dynamic protein structures are crucial for deciphering their diverse biological functions. Two- dimensional infrared (2DIR) spectroscopy stands as an ideal tool for tracing rapid conformational evolutions in proteins. However, linking spectral characteristics to dynamic structures poses a formidable challenge. Here, we present a pretrained machine learning model based on 2DIR spectra analysis. This model has learned signal features from approximately 204,300 spectra to establish a "spectrum- structure" correlation, thereby tracing the dynamic conformations of proteins. It excels in accurately predicting the dynamic content changes of various secondary structures and demonstrates universal transferability on real folding trajectories spanning timescales from microseconds to milliseconds. Beyond exceptional predictive performance, the model offers attention - based spectral explanations of dynamic conformational changes. Our 2DIR-based pretrained model is anticipated to provide unique insights into the dynamic structural information of proteins in their native environments.
|