alex-raw/imsdb_parse
Parse movie scripts for linguistic analysis
This tool helps linguists and researchers analyze movie and TV scripts by automatically breaking them down into their core components like dialogue, character names, and scene headings. You input raw HTML script files, and it outputs structured data in XML format, ready for detailed linguistic analysis. It's designed for anyone studying language patterns and structure within film and television content.
No commits in the last 6 months.
Use this if you need to reliably extract and categorize different elements from movie or TV scripts for linguistic research or content analysis.
Not ideal if you need to process scripts from a wide variety of sources beyond IMSDB, or if you require robust handling for diverse formatting styles.
Stars
7
Forks
—
Language
Python
License
GPL-3.0
Category
Last pushed
Feb 16, 2022
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/alex-raw/imsdb_parse"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
prakharchoudhary/SentimentAnalysis
A Sentimental Analysis model trained on IMDb dataset, using Bag of words model(tokenisation)
WLXie-Tony/Movie_Review_Analysis
Official replication package for IJFE (2026). Asynchronous ETL pipeline using GPT-4o to quantify...
SkyThonk/Movie-Reviews-Sentiment-Analysis
Sentiment Analysis of Movie Reviews is either positive or negative review, the dataset which is...
farisology/SentimentAnalysis
Sentiment Analysis model using Linear SVM and collection of Tweets about Star Wars Rogue One Movie
zhiming-xu/spoiler-detector
Detect spoilers in IMDb movie reviews with deep neural network