TY - JOUR
T1 - Protein-ligand binding affinity prediction based on profiles of intermolecular contacts
AU - Wang, Debby D.
AU - Chan, Moon Tong
N1 - Publisher Copyright:
© 2022 The Author(s)
PY - 2022/1
Y1 - 2022/1
N2 - As a key element in structure-based drug design, binding affinity prediction (BAP) for putative protein-ligand complexes can be efficiently achieved by the incorporation of structural descriptors and machine-learning models. However, developing concise descriptors that will lead to accurate and interpretable BAP remains a difficult problem in this field. Herein, we introduce the profiles of intermolecular contacts (IMCPs) as descriptors for machine-learning-based BAP. IMCPs describe each group of protein-ligand contacts by the count and average distance of the group members, and collaborate closely with classical machine-learning models. Performed on multiple validation sets, IMCP-based models often result in better BAP accuracy than those originating from other similar descriptors. Additionally, IMCPs are simple and concise, and easy to interpret in model training. These descriptors highly conclude the structural information of protein-ligand complexes and can be easily updated with personalized profile features. IMCPs have been implemented in the BAP Toolkit on github ( https://github.com/debbydanwang/BAP).
AB - As a key element in structure-based drug design, binding affinity prediction (BAP) for putative protein-ligand complexes can be efficiently achieved by the incorporation of structural descriptors and machine-learning models. However, developing concise descriptors that will lead to accurate and interpretable BAP remains a difficult problem in this field. Herein, we introduce the profiles of intermolecular contacts (IMCPs) as descriptors for machine-learning-based BAP. IMCPs describe each group of protein-ligand contacts by the count and average distance of the group members, and collaborate closely with classical machine-learning models. Performed on multiple validation sets, IMCP-based models often result in better BAP accuracy than those originating from other similar descriptors. Additionally, IMCPs are simple and concise, and easy to interpret in model training. These descriptors highly conclude the structural information of protein-ligand complexes and can be easily updated with personalized profile features. IMCPs have been implemented in the BAP Toolkit on github ( https://github.com/debbydanwang/BAP).
KW - Computer-aided drug design
KW - Intermolecular contact profiles
KW - Machine learning
KW - Protein-ligand binding affinity
KW - Scoring function
UR - http://www.scopus.com/inward/record.url?scp=85125588178&partnerID=8YFLogxK
U2 - 10.1016/j.csbj.2022.02.004
DO - 10.1016/j.csbj.2022.02.004
M3 - Article
AN - SCOPUS:85125588178
VL - 20
SP - 1088
EP - 1096
JO - Computational and Structural Biotechnology Journal
JF - Computational and Structural Biotechnology Journal
ER -