google/magika
Fast and accurate AI powered file content types detection
Magika quickly and accurately identifies the true content type of various files, from documents and code to binary data, even when file extensions are missing or incorrect. You feed it files, and it tells you exactly what kind of content is inside, like 'Microsoft Word document' or 'Python source code'. This is ideal for security analysts, data managers, or anyone needing to categorize large collections of files for safety or organization.
10,151 stars. Used by 4 other packages. Actively maintained with 11 commits in the last 30 days. Available on PyPI.
Use this if you need to precisely identify the content type of many files to ensure proper handling, routing, or security scanning.
Not ideal if you only need to identify basic file types by extension and don't require deep content inspection or high accuracy for diverse and potentially malicious files.
Stars
10,151
Forks
495
Language
Python
License
Apache-2.0
Category
Last pushed
Mar 03, 2026
Commits (30d)
11
Dependencies
2
Reverse dependents
4
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/google/magika"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Recent Releases
Related frameworks
meilfang/LMFD-PAD
Learnable Multi-level Frequency Decomposition and Hierarchical Attention Mechanism for...
2spi/ai-v-real
Real/AI Generated Image Classifier
athen-lab/mai
Multilayer Authenticity Identifier (MAI), a CNN model that attempts to identify synthetic AI images.
Saranya-T-S/AI-Image-Detector
Deep learning-based system to detect AI-generated images using ELA, PRNU, FFT, and noise...
Linear-Fox-Labs/DePixel
Distinguish between real and AI-generated images.