Alperencode/DergiPark-Project

DergiPark - Web Scraping Project

20
/ 100
Experimental

This tool helps researchers, academics, or data scientists gather a large collection of Turkish academic articles. It takes the entire DergiPark website as input and outputs structured data from over 25,000 articles into formats like JSON lines and plain text. This is ideal for anyone building AI models, conducting large-scale research, or needing a comprehensive dataset of Turkish academic content.

No commits in the last 6 months.

Use this if you need to quickly obtain a complete, structured dataset of academic articles from DergiPark for research, AI model training, or data analysis.

Not ideal if you only need a few specific articles, as the project is designed for bulk extraction of the entire DergiPark database.

academic-research data-acquisition literature-review nlp-datasets scientific-publishing
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 4 / 25
Maturity 16 / 25
Community 0 / 25

How are scores calculated?

Stars

8

Forks

Language

Python

License

MIT

Category

scraper

Last pushed

May 20, 2024

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/perception/Alperencode/DergiPark-Project"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.