what kind of depth image is expected to have best performance with current checkpoint? #14

lyc233333 · 2025-01-02T05:34:08Z

thanks for your great jobs in this project!

i am trying to use it to refine the depth image got from rgbd/stereo cameras, but it works not so great. could you please tell something more about the depth image used in this project? like the format, range, precision etc, and i can add more preprocessing on it.

TychoBomer · 2025-01-02T08:00:39Z

Searching for the same thing!

charisoudis · 2025-01-07T08:02:21Z

Hi, I am also exploring its application to RGBD data. It seems that the model does not generalize well to different prompt depth domains, in contrast with DepthAnything v2 (metric checkpoint). In particular, the fusion modules are expecting low res depth maps to draw sparse scale-shifts from and perform (sprarse) local refinement of the mono depths. When prompted with higher-res depth maps, whose values contain larger local variances, fusion performs worse.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

what kind of depth image is expected to have best performance with current checkpoint? #14

what kind of depth image is expected to have best performance with current checkpoint? #14

lyc233333 commented Jan 2, 2025

TychoBomer commented Jan 2, 2025

charisoudis commented Jan 7, 2025

what kind of depth image is expected to have best performance with current checkpoint? #14

what kind of depth image is expected to have best performance with current checkpoint? #14

Comments

lyc233333 commented Jan 2, 2025

TychoBomer commented Jan 2, 2025

charisoudis commented Jan 7, 2025