vam-sin/CATHe
Deep Learning tool trained on protein sequence embeddings from protein language models to accurately detect remote homologues for CATH superfamilies
This tool helps researchers in structural biology and bioinformatics identify distantly related proteins, known as remote homologues. You provide a protein sequence (in FASTA format) and it predicts which CATH superfamily it belongs to. This is ideal for scientists working on protein function, evolution, and drug discovery who need to classify proteins into structural and evolutionary families.
No commits in the last 6 months.
Use this if you need to accurately identify distant evolutionary relationships and classify novel protein sequences into CATH superfamilies, even when sequence similarity is very low.
Not ideal if you are looking for tools to analyze highly similar protein sequences or prefer traditional homology search methods like BLAST without machine learning augmentation.
Stars
15
Forks
4
Language
Python
License
MIT
Category
Last pushed
Jun 22, 2022
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/vam-sin/CATHe"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
DeepRank/deeprank2
An open-source deep learning framework for data mining of protein-protein interfaces or...
sacdallago/biotrainer
Biological prediction models made simple.
jonathanking/sidechainnet
An all-atom protein structure dataset for machine learning.
a-r-j/ProteinWorkshop
Benchmarking framework for protein representation learning. Includes a large number of...
songlab-cal/tape
Tasks Assessing Protein Embeddings (TAPE), a set of five biologically relevant semi-supervised...