daisybio/data-leakage-ppi-prediction

Code associated with the paper 'Cracking the blackbox of deep sequence-based protein-protein interaction prediction'

27
/ 100
Experimental

This project critically evaluates existing computational methods for predicting protein-protein interactions (PPIs) based on protein sequences. It takes established PPI datasets and protein sequence information, then re-analyzes their performance under strict conditions to reveal if their high accuracy is due to learning from true biological signals or from hidden data similarities. This tool is designed for computational biologists and biochemists who develop or assess deep learning models for PPI prediction.

No commits in the last 6 months.

Use this if you are a researcher in bioinformatics or computational biology looking to understand the limitations and potential biases in current deep sequence-based protein-protein interaction prediction models.

Not ideal if you are looking for a new, robust method to predict protein-protein interactions without a focus on critically evaluating existing models' methodologies.

protein-protein interaction bioinformatics computational biology model validation sequence analysis
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 7 / 25
Maturity 16 / 25
Community 4 / 25

How are scores calculated?

Stars

27

Forks

1

Language

C++

License

GPL-3.0

Last pushed

Jan 08, 2024

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/daisybio/data-leakage-ppi-prediction"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.