Predicting complexation performance between cyclodextrins and guest molecules by integrated machine learning and molecular modeling techniques.
10.1016/j.apsb.2019.04.004
- Author:
Qianqian ZHAO
1
;
Zhuyifan YE
1
;
Yan SU
1
;
Defang OUYANG
1
Author Information
1. State Key Laboratory of Quality Research in Chinese Medicine, Institute of Chinese Medical Sciences (ICMS), University of Macau, Macau, China.
- Publication Type:Journal Article
- Keywords:
Binding free energy;
Cyclodextrin;
Deep learning;
Ketoprofen;
LightGBM;
Machine learning;
Molecular modeling;
Random forest
- From:
Acta Pharmaceutica Sinica B
2019;9(6):1241-1252
- CountryChina
- Language:English
-
Abstract:
Most pharmaceutical formulation developments are complex and ideal formulations are generally obtained after extensive experimentation. Machine learning is increasingly advancing many aspects in modern society and has achieved significant success in multiple subjects. Current research demonstrated that machine learning can be adopted to build up high-accurate predictive models in drugs/cyclodextrins (CDs) systems. Molecular descriptors of compounds and experimental conditions were employed as inputs, while complexation free energy as outputs. Results showed that the light gradient boosting machine provided significantly improved predictive performance over random forest and deep learning. The mean absolute error was 1.38 kJ/mol and squared correlation coefficient was 0.86. The evaluation of relative importance of molecular descriptors further demonstrated the key factors affecting molecular interactions in drugs/CD systems. In the specific ketoprofen-CD systems, machine learning model showed better predictive performance than molecular modeling calculation, while molecular simulation could provide structural, dynamic and energetic information. The integration of machine learning and molecular simulation could produce synergistic effect for interpreting and predicting pharmaceutical formulations. In conclusion, the developed predictive models were able to quickly and accurately predict the solubilizing capacity of CD systems. Current research has taken an important step toward the application of machine learning in pharmaceutical formulation design.