zjunlp/OntoProtein
[ICLR 2022] OntoProtein: Protein Pretraining With Gene Ontology Embedding
This project helps biological researchers and computational biologists analyze protein sequences by incorporating the comprehensive knowledge from Gene Ontology (GO). It takes raw protein sequence data and GO definitions as input to generate an enhanced protein language model. This model outputs improved predictions for various protein-related tasks, such as protein function prediction, secondary structure prediction, and contact prediction, making it valuable for scientists working with protein data.
151 stars. No commits in the last 6 months.
Use this if you need more accurate predictions for protein functions, structures, or interactions by leveraging detailed biological knowledge during protein sequence analysis.
Not ideal if you are looking for a simple protein sequence alignment tool or a solution that does not require incorporating complex biological ontology data.
Stars
151
Forks
22
Language
Python
License
MIT
Category
Last pushed
Mar 10, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/zjunlp/OntoProtein"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
ziqizhang/jate
JATE - Just Automatic Term Extraction (in Python)
mcs07/ChemDataExtractor
Automatically extract chemical information from scientific documents
brucewlee/lftk
[BEA @ ACL 2023] General-purpose tool for linguistic features extraction; Tested on readability...
mmmaurer/elfen
A python package to efficiently extract linguistic features for text/NLP datasets
strangetom/ingredient-parser
A tool to parse recipe ingredients into structured data