nas5w/imdb-data

A JSON file of 50,000 IMDB movie reviews to be used in machine learning applications.

44
/ 100
Emerging

This project provides 50,000 movie reviews from IMDb, pre-labeled as either positive or negative, in a single JSON file. It's designed for data scientists or machine learning practitioners who need a ready-to-use dataset for training text classification models. You get movie review text and its corresponding sentiment, which you can use to build systems that automatically categorize opinions.

No commits in the last 6 months. Available on npm.

Use this if you need a pre-processed dataset of movie reviews to train a sentiment analysis model, especially when learning text classification.

Not ideal if you need movie reviews with more granular sentiment scores, specific movie metadata, or a dataset already split into training and testing sets.

sentiment-analysis text-classification natural-language-processing machine-learning-datasets
Stale 6m No Dependents
Maintenance 0 / 25
Adoption 5 / 25
Maturity 25 / 25
Community 14 / 25

How are scores calculated?

Stars

12

Forks

3

Language

JavaScript

License

MIT

Last pushed

Jan 03, 2023

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/nas5w/imdb-data"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.