krishnanlab/txt2onto
Code for classifying unstructured text to tissue ontology terms using natural language processing and machine learning.
This tool helps scientists and researchers systematically categorize genomic samples based on their descriptive text, even when the descriptions are inconsistent or non-standard. You input raw, unstructured text descriptions of biological samples, and it outputs standardized tissue and cell-type annotations from a structured ontology like UBERON, along with prediction probabilities. It's designed for anyone managing or analyzing large datasets of human genomic samples.
No commits in the last 6 months.
Use this if you need to standardize and enrich your genomic sample metadata with precise tissue and cell-type annotations from free-text descriptions.
Not ideal if your samples lack any descriptive text metadata or if you require annotations for tissues/cell types not covered by the 346 UBERON terms included.
Stars
27
Forks
7
Language
Python
License
BSD-3-Clause
Category
Last pushed
Aug 21, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/krishnanlab/txt2onto"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
GaoQ1/rasa_nlu_gq
turn natural language into structured data(支持中文,自定义了N种模型,支持不同的场景和任务)
yongzhuo/nlp_xiaojiang
自然语言处理(nlp),小姜机器人(闲聊检索式chatbot),BERT句向量-相似度(Sentence Similarity),XLNET句向量-相似度(text xlnet...
textgain/grasp
Essential NLP & ML, short & fast pure Python code
boat-group/fancy-nlp
NLP for human. A fast and easy-to-use natural language processing (NLP) toolkit, satisfying your...
changwookjun/nlp-paper
NLP Paper