gersteinlab/Struc-Bench

[NAACL 2024] Struc-Bench: Are Large Language Models Good at Generating Complex Structured Tabular Data? https://aclanthology.org/2024.naacl-short.2/

/ 100

Emerging

This project helps researchers and developers evaluate how well large language models can generate complex, structured data in various formats. You provide test data in JSON, and it produces generated outputs (tables, HTML, or LaTeX) along with scores indicating generation quality. It's for anyone working with Large Language Models who needs to benchmark their ability to create structured text.

No commits in the last 6 months.

Use this if you are a machine learning researcher or developer evaluating the performance of Large Language Models (LLMs) in generating structured tabular data.

Not ideal if you are looking for a tool to generate production-ready structured data directly, as this is primarily an evaluation and benchmarking framework.

natural-language-processing large-language-models data-generation model-evaluation structured-data

No License Stale 6m No Package No Dependents

Maintenance 2 / 25

Adoption 8 / 25

Maturity 8 / 25

Community 12 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

—

Higher-rated alternatives

ExtensityAI/symbolicai

A neurosymbolic perspective on LLMs

TIGER-AI-Lab/MMLU-Pro

The code and data for "MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding...

deep-symbolic-mathematics/LLM-SR

[ICLR 2025 Oral] This is the official repo for the paper "LLM-SR" on Scientific Equation...

microsoft/interwhen

A framework for verifiable reasoning with language models.

zhudotexe/fanoutqa

Companion code for FanOutQA: Multi-Hop, Multi-Document Question Answering for Large Language...

Explore Transformer Models

All categories Trending Transformer directory Insights