hamelsmu/ktext

Utilities for preprocessing text for deep learning with Keras

/ 100

Established

This tool helps developers prepare raw text data for deep learning models, particularly when using the Keras framework. It takes unstructured text, cleans it by removing unwanted elements like phone numbers or HTML, breaks it into individual words, and then converts these words into numerical sequences that deep learning models can understand. The primary users are machine learning engineers or data scientists working with text-based AI applications.

180 stars. No commits in the last 6 months. Available on PyPI.

Use this if you need to quickly pre-process text data for a Keras deep learning model, especially if you have a large dataset that can fit into memory and you want to leverage parallel processing for speed.

Not ideal if your text data is too large to fit into a single computer's memory, or if you prefer using the more modern and maintained text processing layers built directly into Keras.

natural-language-processing machine-learning-engineering text-analytics deep-learning-preparation

Stale 6m

Maintenance 0 / 25

Adoption 10 / 25

Maturity 25 / 25

Community 18 / 25

How are scores calculated?

Stars

180

Forks

Language

Jupyter Notebook

License

MIT

Related tools

chakki-works/seqeval

A Python framework for sequence labeling evaluation(named-entity recognition, pos tagging, etc...)

Hironsan/anago

Bidirectional LSTM-CRF and ELMo for Named-Entity Recognition, Part-of-Speech Tagging and so on.

jbesomi/texthero

Text preprocessing, representation and visualization from zero to hero.

asahi417/tner

Language model fine-tuning on NER with an easy interface and cross-domain evaluation. "T-NER: An...

Franck-Dernoncourt/NeuroNER

Named-entity recognition using neural networks. Easy-to-use and state-of-the-art results.

Explore NLP Tools

All categories Trending NLP directory Insights