This is the official PyTorch codes for the paper.
Integrating Extra Modality Helps Segmentor Find Camouflaged Objects Well
Chengyu Fang, Chunming He, Longxiang Tang, Yuelin Zhang, Chenyang Zhu, Yuqi Shen, Chubin Chen, Guoxia Xu, Xiu Li, arXiv 2025
Abstract: Camouflaged Object Segmentation (COS) remains challenging because camouflaged objects exhibit only subtle visual differences from their backgrounds and single-modality RGB methods provide limited cues, leading researchers to explore multimodal data to improve segmentation accuracy. In this work, we presenet MultiCOS, a novel framework that effectively leverages diverse data modalities to improve segmentation performance. MultiCOS comprises two modules: Bi-space Fusion Segmentor (BFSer), which employs a state space and a latent space fusion mechanism to integrate cross-modal features within a shared representation and employs a fusion‐feedback mechanism to refine context‐specific features, and Cross-modal Knowledge Learner (CKLer), which leverages external multimodal datasets to generate pseudo‐modal inputs and establish cross‐modal semantic associations, transferring knowledge to COS models when real multimodal pairs are missing. When real multimodal COS data are unavailable, CKLer yields additional segmentation gains using only non‐COS multimodal sources. Experiments on standard COS benchmarks show that BFSer outperforms existing multimodal baselines with both real and pseudo‐modal data.
- 2025-02-21: We release a part of results, bibtex, and the preprint of full paper.
- 2025-02-10: We release this repository, the preprint of full paper will be release soon.
- Complete this repository
- Datasets
- Training
- Testing
- Results
- Citation
We achieved state-of-the-art performance on COD10K, CAMO, NC4K, CHAMELEON, and PCOD1200. More results can be found in the paper. We will release all results from different datasets when the paper is accepted.
Visual Comparison (click to expand)
If you find the code helpful in your research or work, please cite the following paper(s).
@misc{fang2025multicos,
title={Integrating Extra Modality Helps Segmentor Find Camouflaged Objects Well},
author={Chengyu Fang and Chunming He and Longxiang Tang and Yuelin Zhang and Chenyang Zhu and Yuqi Shen and Chubin Chen and Guoxia Xu and Xiu Li},
year={2025},
eprint={2502.14471},
archivePrefix={arXiv},
primaryClass={cs.CV},
url={https://arxiv.org/abs/2502.14471},
}