folia and libfolia
FoLiA is a specification format, and libfolia is a C++ library that implements parsing and manipulation of that format—they are ecosystem siblings where the library provides programmatic access to the annotated data structure defined by the specification.
About folia
proycon/folia
FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (including corpora) with linguistic annotations. A wide variety of linguistic annotations are supported, making FoLiA a useful format for NLP tasks and data interchange. Note that the actual Python library for processing FoLiA is implemented as part of PyNLPl, this contains higher-level tools that use the library as well as the full documentation, validation schemas, and set definitions
This project provides FoLiA, a standardized XML-based format for storing and exchanging language resources with rich linguistic annotations. It accepts raw text or existing annotated corpora and produces a meticulously structured FoLiA XML file that details various linguistic features. Linguists, computational linguists, and researchers working with annotated text data will find this useful for managing and sharing their datasets.
About libfolia
LanguageMachines/libfolia
FoLiA library for C++
This C++ library helps developers working with linguistic data. It allows them to read, modify, and create documents adhering to the FoLiA (Format for Linguistic Annotation) standard. The tool is for C++ developers building natural language processing applications.
Scores updated daily from GitHub, PyPI, and npm data. How scores work