Lipairui/textgo
Text preprocessing, representation, similarity calculation, text search and classification. Let's go and play with text!
This tool helps data analysts and researchers prepare raw text data for analysis, convert it into numerical formats, and perform tasks like finding similar documents or automatically categorizing content. It takes unstructured text in English or Chinese, cleans it up, transforms it, and then outputs structured data representations or classification labels. Anyone working with large volumes of text for insights, search, or automation would find this useful.
No commits in the last 6 months. Available on PyPI.
Use this if you need to quickly clean, represent, compare, search, or classify text data in either English or Chinese.
Not ideal if you primarily work with highly specialized linguistic analysis or require deep, interpretive semantic understanding beyond common NLP tasks.
Stars
45
Forks
3
Language
Python
License
MIT
Category
Last pushed
Mar 27, 2022
Commits (30d)
0
Dependencies
8
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/Lipairui/textgo"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
codertimo/BERT-pytorch
Google AI 2018 BERT pytorch implementation
JayYip/m3tl
BERT for Multitask Learning
920232796/bert_seq2seq
pytorch实现 Bert 做seq2seq任务,使用unilm方案,现在也可以做自动摘要,文本分类,情感分析,NER,词性标注等任务,支持t5模型,支持GPT2进行文章续写。
sileod/tasknet
Easy modernBERT fine-tuning and multi-task learning
graykode/toeicbert
TOEIC(Test of English for International Communication) solving using pytorch-pretrained-BERT model.