Technical tricks to improve or accelerate ECON

If the reconstructed geometry is not satisfying, play with the adjustable parameters in config/econ.yaml

use_smpl: ["hand"]
- [ ]: don't use either hands or face parts from SMPL-X
- ["hand"]: only use the visible hands from SMPL-X
- ["hand", "face"]: use both visible hands and face from SMPL-X
thickness: 2cm
- could be increased accordingly in case final reconstruction xx_full.obj looks flat
k: 4
- could be reduced accordingly in case the surface of xx_full.obj has discontinous artifacts
hps_type: PIXIE
- "pixie": more accurate for face and hands
- "pymafx": more robust for challenging poses
texture_src: image
- "image": direct mapping the aligned pixels to final mesh
- "SD": use Stable Diffusion to generate full texture (TODO)

To accelerate the inference, you could

use_ifnet: False
- True: use IF-Nets+ for mesh completion ( $\text{ECON}_\text{IF}$ - Better quality, ~2min / img)
- False: use SMPL-X for mesh completion ( $\text{ECON}_\text{EX}$ - Faster speed, ~1.8min / img)

# For single-person image-based reconstruction (w/o all visualization steps, 1.5min)
python -m apps.infer -cfg ./configs/econ.yaml -in_dir ./examples -out_dir ./results -novis

Bending legs

Related issues:

ECON: #133, #5
ICON: YuliangXiu/ICON#68
TeCH: huangyangyi/TeCH#14

Reasons:

Most existing human pose estimators (HPS) have some bias towards a mean pose, which has the legs bent
The pseudo GT of HPS is obtained by fitting SMPL(-X) onto 2D landmarks, 2D landmarks has depth ambiguity, which make legs bent

Solution:

The issue of "bending legs" could be significantly reduced by refining the pose of the SMPL(-X) body using an accurate front clothed normal map directly estimated from the input image. For this, we use Sapiens.
The back clothed normal map is conditioned on the back body normal map, which is rendered from the SMPL(-X) body model. Therefore, the accuracy of the back clothed normal map benefits from the refined SMPL(-X) body model.

Potential risk:

For extremely challenging poses, ECON's normal estimator might outperform Sapiens. ECON's normal estimation is conditioned on the SMPL(-X) body model, which encodes body articulation information, leaving only the normal disparity between the body and clothing to be estimated. In contrast, Sapiens directly estimates normal maps from the input image without SMPL(-X) conditioning. Thus, Sapiens might struggle with particularly challenging or rare poses.

Final Reconstruction
w/ sapiens-normal refinement

Original ECON normal estimator

Normal Estimation
w/ sapiens-normal refinement

Original ECON normal estimator

If you use Sapiens in your research, please consider citing it.

@inproceedings{khirodkar2024_sapiens,
    title={Sapiens: Foundation for Human Vision Models},
    author={Khirodkar, Rawal and Bagautdinov, Timur and Martinez, Julieta and Zhaoen, Su and James, Austin and Selednik, Peter and Anderson, Stuart and Saito, Shunsuke},
    year={2024},
    booktitle={European Conference on Computer Vision},
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

tricks.md

tricks.md

Technical tricks to improve or accelerate ECON

If the reconstructed geometry is not satisfying, play with the adjustable parameters in config/econ.yaml

To accelerate the inference, you could

Bending legs

Files

tricks.md

Latest commit

History

tricks.md

File metadata and controls

Technical tricks to improve or accelerate ECON

If the reconstructed geometry is not satisfying, play with the adjustable parameters in config/econ.yaml

To accelerate the inference, you could

Bending legs