analyticalmonk/pyspark_nlp_workshop
Instructions and code for the workshop "From Big Data to NLP Insights: Unlocking the Power of PySpark and Spark NLP"
This workshop provides step-by-step guidance to set up a cloud-based environment for processing large volumes of text data. You'll learn how to feed big data into a system and extract valuable natural language processing (NLP) insights. Data scientists, machine learning engineers, and NLP specialists working with large datasets will find this useful.
No commits in the last 6 months.
Use this if you need to perform natural language processing on massive text datasets and want to leverage the power of PySpark and Spark NLP in a Databricks environment.
Not ideal if you are looking for a simple, local NLP solution for small datasets or are not comfortable with cloud-based big data platforms.
Stars
12
Forks
2
Language
Jupyter Notebook
License
—
Category
Last pushed
May 09, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/analyticalmonk/pyspark_nlp_workshop"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
dipanjanS/text-analytics-with-python
Learn how to process, classify, cluster, summarize, understand syntax, semantics and sentiment...
jonathandunn/text_analytics
Basic text analytics and natural language processing in Python
IBM/watson-document-co-relation
Correlate text content across documents using Watson NLU, Python NLTK and Watson Studio.
Clarifai/clarifai-pyspark
Interfaces for Unstructured data and ML pipelines with Databricks and Clarifai
umer7/Applied-Text-Mining-in-Python
Repo for Applied Text Mining in Python (coursera) by University of Michigan