Zhen-Dong/Awesome-Quantization-Papers

List of papers related to neural network quantization in recent AI conferences and journals.

42
/ 100
Emerging

This project provides a curated, up-to-date list of research papers on neural network quantization. It organizes these papers by model structure and application scenarios, with keywords for quantization methods, making it easier to find relevant research. AI researchers and machine learning engineers working on optimizing deep learning models for efficiency would use this.

808 stars. No commits in the last 6 months.

Use this if you are an AI researcher or machine learning engineer looking for the latest academic papers on model quantization to improve the efficiency of deep learning models.

Not ideal if you are looking for ready-to-use code implementations or tutorials on how to quantize models.

deep-learning-research model-optimization neural-network-efficiency AI-conferences large-language-models
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 16 / 25

How are scores calculated?

Stars

808

Forks

59

Language

License

MIT

Last pushed

Mar 27, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/Zhen-Dong/Awesome-Quantization-Papers"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.