ZhihaoZhang97/RU-AI

[WWW'25] Official repo for paper: RU-AI: A Large Multimodal Dataset for Machine Generated Content Detection

34
/ 100
Emerging

This project offers a large collection of real and machine-generated content across text, images, and audio, designed to help evaluate and build tools that detect AI-generated media. You can input various types of digital content (text documents, images, audio files) and use this dataset to train or test models that distinguish between human-created and AI-generated content. It's ideal for researchers and practitioners focused on media authenticity and combating misinformation.

Use this if you need a comprehensive, multimodal dataset to develop or benchmark machine learning models for detecting AI-generated text, images, or audio.

Not ideal if you are looking for a ready-to-use application to detect AI-generated content or if you have limited computational resources (requires significant storage and GPU power).

AI-content-detection misinformation media-authenticity digital-forensics content-verification
No Package No Dependents
Maintenance 6 / 25
Adoption 5 / 25
Maturity 16 / 25
Community 7 / 25

How are scores calculated?

Stars

10

Forks

1

Language

Jupyter Notebook

License

MIT

Last pushed

Nov 24, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/generative-ai/ZhihaoZhang97/RU-AI"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.