lilakk/BooookScore

A package to generate summaries of long-form text and evaluate the coherence of these summaries. Official package for our ICLR 2024 paper, "BooookScore: A systematic exploration of book-length summarization in the era of LLMs".

46
/ 100
Emerging

This project helps researchers and practitioners evaluate the quality of AI-generated summaries for very long texts, like entire books. You input full book texts and the AI-generated summaries, and it outputs a coherence score for each summary. This is ideal for academics and researchers studying large language models and their summarization capabilities.

130 stars. No commits in the last 6 months. Available on PyPI.

Use this if you need to systematically generate and assess the coherence of summaries for extremely long documents using various large language models.

Not ideal if you're looking for a simple tool to generate short summaries of typical articles or documents without needing detailed quality evaluation.

natural-language-processing academic-research text-summarization ai-evaluation large-language-models
Stale 6m
Maintenance 0 / 25
Adoption 10 / 25
Maturity 25 / 25
Community 11 / 25

How are scores calculated?

Stars

130

Forks

10

Language

Python

License

MIT

Last pushed

Oct 01, 2024

Commits (30d)

0

Dependencies

11

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/lilakk/BooookScore"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.