slgero/receipt_parser
Allow parsing Russian receipts
This tool helps you automatically extract structured information from messy, free-text descriptions of products on Russian receipts. It takes a raw product description, like "Нап.пив.ХУГАР.ГРЕЙПФ.н/ф 0.47л," and identifies the normalized product name, brand, and category. This is ideal for anyone who needs to quickly organize and analyze purchase data from Russian retail transactions, such as small business owners, accountants, or market researchers.
No commits in the last 6 months. Available on PyPI.
Use this if you need to process scanned or OCR'd Russian receipts and extract consistent product, brand, and category information, even from abbreviated or noisy text.
Not ideal if your receipts are not in Russian or if you require full receipt image parsing beyond just the product line item descriptions.
Stars
54
Forks
1
Language
Python
License
MIT
Category
Last pushed
Aug 14, 2020
Commits (30d)
0
Dependencies
9
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/slgero/receipt_parser"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
isaacus-dev/semchunk
A fast, lightweight and easy-to-use Python library for splitting text into semantically...
chatopera/Synonyms
:herb: 中文近义词:聊天机器人,智能问答工具包
CUNY-CL/wikipron
Massively multilingual pronunciation mining
jacksonllee/pylangacq
Language Acquisition Research Tools
goodmami/wn
A modern, interlingual wordnet interface for Python