ardauzunoglu/TRScraper
TRScraper, doğal dil işleme uygulamalarında kullanılmak amacıyla geliştirilmiş, Türkçe içerik girilen büyük platformlarda metin madenciliği yapma imkanı sunan bir uygulamadır.
TRScraper helps gather Turkish text content from large online platforms for use in natural language processing (NLP) applications. It takes URLs of Turkish content pages and outputs the extracted text data into an Excel spreadsheet. This tool is ideal for researchers, data analysts, or marketers who need to collect Turkish text for sentiment analysis, topic modeling, or other text mining tasks.
No commits in the last 6 months.
Use this if you need to extract Turkish text from major online platforms to analyze or process it further.
Not ideal if you need to scrape data from platforms that are not primarily Turkish or if you require non-textual data.
Stars
76
Forks
3
Language
Python
License
MIT
Category
Last pushed
Feb 16, 2021
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/ardauzunoglu/TRScraper"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
hltcoe/turkle
Django-based clone of Amazon's Mechanical Turk service running in your local environment.
emres/turkish-deasciifier
Turkish deasciifier in Python based on Deniz Yüret's turkish-mode for Emacs
brolin59/trnlp
TÜRKÇE İÇİN DOĞAL DİL İŞLEME ARAÇLARI
ooguz/turkce-kufur-karaliste
Türkçe için bir kara liste (blacklist)
ahmetaa/zemberek-nlp
NLP tools for Turkish.