JerryYLi/valhalla-nmt

Code repository for CVPR 2022 paper "VALHALLA: Visual Hallucination for Machine Translation"

35
/ 100
Emerging

This project helps machine translation researchers and practitioners improve translation quality by incorporating visual information. It takes source language text and corresponding images as input, generating more accurate target language translations. This is particularly useful for those working with datasets where visual context is crucial for understanding the meaning of text.

No commits in the last 6 months.

Use this if you are a machine translation researcher or engineer looking to experiment with and implement state-of-the-art multimodal machine translation models that leverage visual context.

Not ideal if you are a general user needing a simple, off-the-shelf translation tool without deep technical expertise or specific multimodal data.

machine-translation natural-language-processing multimodal-AI computer-vision AI-research
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 7 / 25
Maturity 16 / 25
Community 12 / 25

How are scores calculated?

Stars

28

Forks

4

Language

Python

License

MIT

Last pushed

Feb 19, 2023

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/transformers/JerryYLi/valhalla-nmt"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.