hexuandeng/Mono4SiMT
The implementation for our paper, "Improving Simultaneous Machine Translation with Monolingual Data," accepted to AAAI 2023. 🎉
This project provides an implementation for improving real-time, simultaneous machine translation. It takes existing bilingual and monolingual text datasets, processes them, and then outputs a more accurate simultaneous translation model. This is for machine learning researchers or computational linguists working on advancing automated, live translation systems.
No commits in the last 6 months.
Use this if you are a researcher focused on improving the quality of simultaneous machine translation by leveraging monolingual data for sequence-level knowledge distillation.
Not ideal if you are looking for an out-of-the-box, user-friendly tool for translating documents or conversations; this is a research implementation, not an end-user application.
Stars
12
Forks
3
Language
Python
License
Apache-2.0
Category
Last pushed
Jul 19, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/hexuandeng/Mono4SiMT"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
princeton-nlp/SimCSE
[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821
n-waves/multifit
The code to reproduce results from paper "MultiFiT: Efficient Multi-lingual Language Model...
yxuansu/SimCTG
[NeurIPS'22 Spotlight] A Contrastive Framework for Neural Text Generation
alibaba-edu/simple-effective-text-matching
Source code of the ACL2019 paper "Simple and Effective Text Matching with Richer Alignment Features".
Shark-NLP/OpenICL
OpenICL is an open-source framework to facilitate research, development, and prototyping of...