omarperacha/ps4-dataset

The largest open-source dataset for Protein Single Sequence Secondary Structure prediction.

27
/ 100
Experimental

This dataset and toolkit help protein scientists and bioinformaticians develop and evaluate models that predict a protein's 3D shape from its amino acid sequence. You input a protein's amino acid sequence, and the system predicts its secondary structure (alpha-helices, beta-sheets, etc.). It's used by researchers working on understanding protein function and drug discovery.

No commits in the last 6 months.

Use this if you are a researcher or bioinformatician building or improving computational models for predicting protein secondary structure.

Not ideal if you simply want to predict secondary structure for a few sequences without developing your own model; for that, use the provided Hugging Face Space.

protein-science bioinformatics structural-biology drug-discovery protein-modeling
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 5 / 25
Maturity 16 / 25
Community 6 / 25

How are scores calculated?

Stars

14

Forks

1

Language

Python

License

CC0-1.0

Last pushed

Dec 29, 2024

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/omarperacha/ps4-dataset"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.