Babelscape/CroCoAlign

A Cross-Lingual, Context-Aware and Fully-Neural Sentence Alignment System for Long Texts.

34
/ 100
Emerging

This tool helps you quickly find corresponding sentences in very long documents that are written in different languages, such as a novel translated into several languages. You provide two versions of a document, each in a different language, and it outputs a list of matching sentences. This is ideal for linguists, translators, or researchers who need to analyze parallel texts.

No commits in the last 6 months.

Use this if you need to accurately identify and link equivalent sentences across lengthy documents written in two different languages.

Not ideal if you're working with short texts, only one language, or need to align at a word or phrase level rather than full sentences.

cross-lingual content analysis translation studies linguistic research multilingual document processing parallel text generation
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 5 / 25
Maturity 16 / 25
Community 13 / 25

How are scores calculated?

Stars

10

Forks

2

Language

Python

License

Last pushed

Sep 11, 2024

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/Babelscape/CroCoAlign"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.