DivyaKarade/Deep-learning-classification-based-model-for-screening-compounds-with-hERG-inhibitory-activity

Developing a Deep learning classification-based model for screening pharmaceutical compounds with hERG inhibitory activity (cardiotoxicity) and using the model to screen CAS antiviral database to identify compounds with cardiotoxicity potential. The data is derived from "Drug Discovery Hackathon 2020: PS ID: DDT2-13" (https://innovateindia.mygov.in/ddh2020/problem-statements/) Details related to the project can also be derived from: (https://youtu.be/7tqaPmYQmCM) Note: The solution for the above problem statement is solved with Deep learning classification based model instead of linear discriminant analysis model as written in the problem statement. Details of the project: In silico prediction of cardiotoxicity with high sensitivity and specificity for potential drug molecules would be of immense value. Hence, building a classification-based machine learning models, capable of efficiently predicting cardiotoxicity will be critical. A data set of diverse pharmaceutical compounds with hERG channel inhibitory activity (blocker/non-blocker) is provided. The SMILES notations of all compounds are given. The set of compounds divided into a training set and a test set using 70:30 ratios. Simple, reproducible and easily transferable classification models developed from the training set compounds using 2D descriptors. The models were validated based on the test set compounds. The models is having the following quality: Training Set: ROC AUC for training set: 0.977280 Classification accuracy for training set: 0.986058 Precision for training set: 0.993124 Sensitivity/Recall for training set: 0.990235 F1 score for training set: 0.991677 Confusion matrix: [[ 892 33] [ 47 4766]] Test set: ROC AUC for test set: 0.649767 Classification accuracy for test set: 0.813670 Precision for test set: 0.883061 Sensitivity/Recall for test set: 0.990235 F1 score for test set: 0.889050 Confusion matrix: [[ 165 243] [ 215 1835]] The best model was also used to classify CAS antiviral

27
/ 100
Experimental

This project helps pharmaceutical researchers and computational chemists quickly screen potential drug candidates for cardiotoxicity. By inputting chemical structures (SMILES notation), it predicts whether a compound is likely to inhibit the hERG channel, a common cause of heart-related side effects. The output is a classification of compounds as either hERG blockers or non-blockers, allowing for early identification of problematic molecules in drug discovery.

No commits in the last 6 months.

Use this if you need an initial, high-throughput computational screen to identify drug candidates with potential cardiotoxicity early in the discovery process.

Not ideal if you require definitive experimental validation of hERG inhibition or a detailed mechanistic understanding of a compound's cardiotoxic potential.

drug-discovery cardiotoxicity-screening medicinal-chemistry pharmacology in-silico-screening
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 5 / 25
Maturity 16 / 25
Community 6 / 25

How are scores calculated?

Stars

14

Forks

1

Language

Python

License

Apache-2.0

Last pushed

Oct 02, 2024

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/DivyaKarade/Deep-learning-classification-based-model-for-screening-compounds-with-hERG-inhibitory-activity"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.