Westlake-AI/SEMA

Switch EMA: A Free Lunch for Better Flatness and Sharpness

30
/ 100
Emerging

This project helps machine learning engineers and researchers improve the performance and speed of their deep neural networks. By applying a simple modification to the Exponential Moving Average (EMA) technique, it takes your trained model parameters and outputs a more robust and efficient model. This is especially useful for those working on computer vision or natural language processing tasks who want to achieve better model generalization.

No commits in the last 6 months.

Use this if you are training deep neural networks for tasks like image classification, object detection, or language modeling and want to enhance your model's stability and generalization without significant extra computational cost.

Not ideal if you are not working with deep learning models or if your primary concern is not model generalization and training efficiency.

deep-learning-optimization computer-vision natural-language-processing model-training machine-learning-research
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 7 / 25
Maturity 16 / 25
Community 7 / 25

How are scores calculated?

Stars

28

Forks

2

Language

Python

License

Apache-2.0

Last pushed

Feb 16, 2024

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/diffusion/Westlake-AI/SEMA"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.