Lenubolim/TextDiff
Official code implementation of " TextDiff: Mask-Guided Residual Diffusion Models for Scene Text Image " in Pattern Recognition
This project helps improve the clarity and readability of text found within images, especially in real-world scenes where text might be blurry or low-resolution. It takes an image containing scene text as input and produces a higher-resolution version where the text is sharper and easier to read. This is useful for anyone working with images containing text, such as those in digital archiving, optical character recognition (OCR) pre-processing, or general image enhancement.
No commits in the last 6 months.
Use this if you need to enhance the resolution of text in images, making blurry or pixelated scene text clear and legible for better human comprehension or machine processing.
Not ideal if you're looking for a general image upscaling tool that doesn't prioritize text specifically, or if you need to extract text from images (for which dedicated OCR software would be more suitable).
Stars
24
Forks
—
Language
—
License
—
Category
Last pushed
Apr 24, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/diffusion/Lenubolim/TextDiff"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
huggingface/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
bghira/SimpleTuner
A general fine-tuning kit geared toward image/video/audio diffusion models.
mcmonkeyprojects/SwarmUI
SwarmUI (formerly StableSwarmUI), A Modular Stable Diffusion Web-User-Interface, with an...
nateraw/stable-diffusion-videos
Create 🔥 videos with Stable Diffusion by exploring the latent space and morphing between text prompts
TheDesignFounder/DreamLayer
Benchmark diffusion models faster. Automate evals, seeds, and metrics for reproducible results.