programindz/lora-vit-finetuning

Fine-Tuning Google's Vision Transformer LoRA technique. Two different LoRA adapters are tuned for separate classification (food and human actions). A simple Gradio interface is implemented to run the inference.

/ 100

Experimental

This project helps classify images into categories like food items or human actions. You provide an image, and it tells you what food it is or what action a person in the image is performing. This tool is useful for anyone who needs to quickly categorize a large number of images based on their content, like a content moderator or data annotator.

No commits in the last 6 months.

Use this if you need a quick way to identify food items or human actions within images using pre-trained AI models.

Not ideal if you need to classify images into categories other than food or human actions without further customization.

image-classification content-moderation data-annotation food-recognition activity-detection

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 4 / 25

Maturity 8 / 25

Community 0 / 25

How are scores calculated?

Stars

Forks

—

Language

Jupyter Notebook

License

—

Higher-rated alternatives

unslothai/unsloth

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek, Qwen, Llama,...

huggingface/peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

modelscope/ms-swift

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.5, DeepSeek-R1, GLM-5,...

oumi-ai/oumi

Easily fine-tune, evaluate and deploy gpt-oss, Qwen3, DeepSeek-R1, or any open source LLM / VLM!

linkedin/Liger-Kernel

Efficient Triton Kernels for LLM Training

Explore Transformer Models

All categories Trending Transformer directory Insights