agneet42/robustness_depth_lang

[CVPR 2024] "On the Robustness of Language Guidance for Low-Level Vision Tasks: Findings from Depth Estimation"

/ 100

Emerging

This project helps computer vision researchers understand how well language guidance works for tasks like monocular depth estimation. It takes an image and optional textual descriptions as input, then evaluates how robustly and accurately the system estimates the depth information for the scene. Computer vision scientists and engineers working on 3D scene understanding and perception systems would use this to evaluate the effectiveness of language-guided models.

No commits in the last 6 months.

Use this if you are a computer vision researcher evaluating the reliability and generalization capabilities of language-guided depth estimation models under various conditions.

Not ideal if you are looking for a plug-and-play solution for improving depth estimation in real-world applications without delving into research methodology and robustness analysis.

computer-vision depth-estimation language-guidance model-robustness 3D-scene-understanding

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 4 / 25

Maturity 16 / 25

Community 13 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

Apache-2.0

Higher-rated alternatives

jayin92/Skyfall-GS

Skyfall-GS: Synthesizing Immersive 3D Urban Scenes from Satellite Imagery

Tencent-Hunyuan/Hunyuan3D-2

High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.

ActiveVisionLab/gaussctrl

[ECCV 2024] GaussCtrl: Multi-View Consistent Text-Driven 3D Gaussian Splatting Editing

caiyuanhao1998/Open-DiffusionGS

Baking Gaussian Splatting into Diffusion Denoiser for Fast and Scalable Single-stage Image-to-3D...

deepseek-ai/DreamCraft3D

[ICLR 2024] Official implementation of DreamCraft3D: Hierarchical 3D Generation with...

Explore Diffusion Models

All categories Trending Diffusion directory Insights