antgroup/ravig-bench

Official implementation of "RAViG-Bench: A Benchmark for Retrieval-Augmented Visually-rich Generation with Multi-modal Automated Evaluation"

/ 100

Experimental

This tool helps you assess the quality of automatically generated web content, specifically HTML and CSS code, that incorporates visual elements and information retrieved from other documents. You feed in the generated web page code, and it provides detailed reports on whether the code works correctly, if the visual design looks good, and if the content is accurate and complete. It's ideal for developers or researchers who are building and testing systems that create web pages with rich visual layouts based on retrieved information.

Use this if you need to rigorously evaluate the functional correctness, visual appeal, and informational accuracy of automatically generated HTML/CSS web content.

Not ideal if you're manually creating web pages or only need to validate simple HTML without visual design or content quality checks.

web-content-generation html-css-validation automated-ui-testing generative-ai-evaluation design-quality-assessment

No Package No Dependents

Maintenance 10 / 25

Adoption 5 / 25

Maturity 13 / 25

Community 0 / 25

How are scores calculated?

Stars

Forks

—

Language

Python

License

Apache-2.0

Featured in

You're Shipping AI You Can't Measure

Higher-rated alternatives

vectara/open-rag-eval

RAG evaluation without the need for "golden answers"

DocAILab/XRAG

XRAG: eXamining the Core - Benchmarking Foundational Component Modules in Advanced...

HZYAI/RagScore

⚡️ The "1-Minute RAG Audit" — Generate QA datasets & evaluate RAG systems in Colab, Jupyter, or...

AIAnytime/rag-evaluator

A library for evaluating Retrieval-Augmented Generation (RAG) systems (The traditional ways).

microsoft/benchmark-qed

Automated benchmarking of Retrieval-Augmented Generation (RAG) systems

Explore RAG Tools

All categories Trending RAG directory Insights