qiyuw/WSPAlign
WSPAlign: Word Alignment Pre-training via Large-Scale Weakly Supervised Span Prediction, to appear at ACL 2023 main conference.
This tool helps language professionals accurately identify corresponding words or phrases between two sentences in different languages. You input a sentence in one language and its translated counterpart in another, and it outputs which words in the first sentence align with which words in the second. This is ideal for computational linguists, machine translation researchers, or anyone needing precise word-level mappings across languages.
No commits in the last 6 months.
Use this if you need to understand or evaluate the exact word-to-word correspondence between a source text and its translation for tasks like machine translation quality assessment or creating parallel corpora.
Not ideal if you're looking for a user-friendly translation tool or a general text analysis application without a focus on word alignment.
Stars
12
Forks
2
Language
Python
License
—
Category
Last pushed
Apr 15, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/qiyuw/WSPAlign"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
luheng/deep_srl
Code and pre-trained model for: Deep Semantic Role Labeling: What Works and What's Next
sileod/tasksource
Datasets collection and preprocessings framework for NLP extreme multitask learning
loomchild/maligna
Bilingual sengence aligner
CK-Explorer/DuoSubs
Semantic subtitle aligner and merger for bilingual subtitle syncing.
coastalcph/lex-glue
LexGLUE: A Benchmark Dataset for Legal Language Understanding in English