0101011/analitika

Testing Automatic Text Summarization

/ 100

Emerging

This project helps data scientists and researchers prepare raw text articles for natural language processing tasks. It takes in raw text data, tokenizes and filters it, and can enrich it with pre-trained word embeddings, outputting cleaned and augmented data in HDF5 and pickle formats. It's designed for someone who needs to process large collections of text for analysis or model training.

No commits in the last 6 months.

Use this if you need to quickly pre-process a dataset of text articles for further analysis or machine learning, and want to incorporate pre-trained embeddings and data augmentation.

Not ideal if you're looking for a complete, end-to-end text summarization solution or a user-friendly interface for non-technical users.

text-preprocessing natural-language-processing data-preparation text-analysis research-data

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 6 / 25

Maturity 16 / 25

Community 9 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Higher-rated alternatives

InflixOP/ContentSnap

ContentSnap is a powerful browser extension that leverages cutting-edge NLP models to summarize...

NC0DER/GreekWikipedia

A Greek abstractive summarization dataset based on Wikipedia.

AmoghPradeep/abstractive-text-summarizer

Abstractive text summarization using BART.

KhushiRajurkar/Medical-Document-Summarizer

An interactive pipeline that extracts and summarizes clinical trial PDFs using advanced NLP models

msorkhpar/wiki-entity-summarization-preprocessor

Convert Wikidata and Wikipedia raw files to filterable formats with a focus of marking Wikidata ...

Explore NLP Tools

All categories Trending NLP directory Insights