lilakk/BooookScore

A package to generate summaries of long-form text and evaluate the coherence of these summaries. Official package for our ICLR 2024 paper, "BooookScore: A systematic exploration of book-length summarization in the era of LLMs".

/ 100

Emerging

This project helps researchers and practitioners evaluate the quality of AI-generated summaries for very long texts, like entire books. You input full book texts and the AI-generated summaries, and it outputs a coherence score for each summary. This is ideal for academics and researchers studying large language models and their summarization capabilities.

130 stars. No commits in the last 6 months. Available on PyPI.

Use this if you need to systematically generate and assess the coherence of summaries for extremely long documents using various large language models.

Not ideal if you're looking for a simple tool to generate short summaries of typical articles or documents without needing detailed quality evaluation.

natural-language-processing academic-research text-summarization ai-evaluation large-language-models

Stale 6m

Maintenance 0 / 25

Adoption 10 / 25

Maturity 25 / 25

Community 11 / 25

How are scores calculated?

Stars

130

Forks

Language

Python

License

MIT

Higher-rated alternatives

openfactcheck-research/openfactcheck

An Open-source Factuality Evaluation Demo for LLMs

several27/FakeNewsCorpus

A dataset of millions of news articles scraped from a curated list of data sources.

Cartus/Automated-Fact-Checking-Resources

Links to conference/journal publications in automated fact-checking (resources for the...

armingh2000/FactScoreLite

FactScoreLite is an implementation of the FactScore metric, designed for detailed accuracy...

manideep2510/siamese-BERT-fake-news-detection-LIAR

Triple Branch BERT Siamese Network for fake news classification on LIAR-PLUS dataset in PyTorch

Explore NLP Tools

All categories Trending NLP directory Insights