欢迎分享CVPR 2024 论文和代码 / Welcome to share the paper and code of CVPR 2024 #210

amusi · 2024-02-27T03:57:45Z

[The format of the issue]
Paper name/title:
Paper link:
Code link:

iamhankai · 2024-02-27T06:02:23Z

Paper name/title: ParameterNet: Parameters Are All You Need for Large-scale Visual Pretraining of Mobile Networks
Paper link: https://arxiv.org/abs/2306.14525
Code link: https://parameternet.github.io/

iamhankai · 2024-02-27T06:03:21Z

Paper name/title: An Empirical Study of Scaling Law for OCR
Paper link: https://arxiv.org/abs/2401.00028
Code link: https://github.com/large-ocr-model/large-ocr-model.github.io

KuanchihHuang · 2024-02-27T06:35:04Z

Paper name/title: PTT: Point-Trajectory Transformer for Efficient Temporal 3D Object Detection
Paper link: https://arxiv.org/abs/2312.08371
Code link: https://github.com/kuanchihhuang/PTT

ShunyuanZheng · 2024-02-27T06:42:07Z

Paper name/title: GPS-Gaussian: Generalizable Pixel-wise 3D Gaussian Splatting for Real-time Human Novel View Synthesis
Paper link: https://arxiv.org/abs/2312.02155
Code link: https://github.com/ShunyuanZheng/GPS-Gaussian
Project link: https://shunyuanzheng.github.io/GPS-Gaussian

huliangxiao · 2024-02-27T06:52:17Z

Paper name/title: GaussianAvatar: Towards Realistic Human Avatar Modeling from a Single Video via Animatable 3D Gaussians
Paper link: https://arxiv.org/abs/2312.02134
Code link: https://github.com/huliangxiao/GaussianAvatar

TIANLE233 · 2024-02-27T07:24:38Z

Paper name/title: Stronger, Fewer, & Superior: Harnessing Vision Foundation Models for Domain Generalized Semantic Segmentation
Paper link: https://arxiv.org/abs/2312.04265
Code link: https://github.com/w1oves/Rein

zhuangshaobin · 2024-02-27T11:18:09Z

Paper name/title: Vlogger: Make Your Dream A Vlog
Paper link: https://arxiv.org/abs/2401.09414
Code link: https://github.com/Vchitect/Vlogger

BarqueroGerman · 2024-02-27T11:21:45Z

Paper name/title: Seamless Human Motion Composition with Blended Positional Encodings
Paper link: https://arxiv.org/abs/2402.15509
Code link: https://github.com/BarqueroGerman/FlowMDM

buaacyw · 2024-02-27T11:34:49Z

Paper name/title: GaussianEditor: Swift and Controllable 3D Editing with Gaussian Splatting
Paper link: https://arxiv.org/abs/2311.14521
Code link: https://github.com/buaacyw/GaussianEditor

Hansxsourse · 2024-02-27T13:50:06Z

Paper name/title: UniGS: Unified Representation for Image Generation and Segmentation
Paper link: https://arxiv.org/abs/2312.01985

classification could be: Diffusion / Image Generation / Segmentation

ch3cook-fdu · 2024-02-27T15:33:56Z

Paper name/title: LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning
Paper link: https://arxiv.org/abs/2311.18651
Code link: https://github.com/Open3DA/LL3DA
Project link: https://ll3da.github.io/

geometry-adaptation · 2024-02-27T16:26:10Z

Paper name/title: CLOVA: A Closed-LOop Visual Assistant with Tool Usage and Update
Paper link: https://arxiv.org/pdf/2312.10908.pdf
Project link: https://clova-tool.github.io/

thaoshibe · 2024-02-27T18:29:07Z

Paper name/title: Edit One for All: Interactive Batch Image Editing
Paper link: https://arxiv.org/abs/2401.10219
Code link: https://github.com/thaoshibe/edit-one-for-all
Project page: https://thaoshibe.github.io/edit-one-for-all

Nightmare-n · 2024-02-28T01:18:17Z

Paper name/title: UniPAD: A Universal Pre-training Paradigm for Autonomous Driving
Paper link: https://arxiv.org/abs/2310.08370
Code link: https://github.com/Nightmare-n/UniPAD

DearCaat · 2024-02-28T02:41:53Z

Paper name/title: Feature Re-Embedding: Towards Foundation Model-Level Performance in Computational Pathology
Paper link: https://arxiv.org/abs/2402.17228
Code link: https://github.com/DearCaat/RRT-MIL

Luffy03 · 2024-02-28T04:28:32Z

Paper name/title: VoCo: A Simple-yet-Effective Volume Contrastive Learning Framework for 3D Medical Image Analysis
Paper link: https://arxiv.org/abs/2402.17300
Code link: https://github.com/Luffy03/VoCo

xb534 · 2024-02-28T06:26:00Z

Paper name/title: SED: A Simple Encoder-Decoder for Open-Vocabulary Semantic Segmentation
Paper link: https://arxiv.org/abs/2311.15537
Code link: https://github.com/xb534/SED

WeichenFan · 2024-02-28T07:25:55Z

Paper name/title: Link-Context Learning for Multimodal LLMs
Paper link: https://arxiv.org/pdf/2308.07891.pdf
Code link: https://github.com/isekai-portal/Link-Context-Learning/tree/main

Murrol · 2024-02-28T07:49:54Z

Paper name/title: MoMask: Generative Masked Modeling of 3D Human Motions
Paper link: https://arxiv.org/abs/2312.00063
Code link: https://github.com/EricGuo5513/momask-codes

Andy1621 · 2024-02-28T09:13:28Z

Paper name/title: MVBench: A Comprehensive Multi-modal Video Understanding Benchmark
Paper link: https://arxiv.org/abs/2311.17005
Code link: https://github.com/OpenGVLab/Ask-Anything/tree/main/video_chat2

ethancohen123 · 2024-02-28T09:51:06Z

Paper name/title: ChAda-ViT : Channel Adaptive Attention for Joint Representation Learning of Heterogeneous Microscopy Images
Paper link: https://arxiv.org/abs/2311.15264
Code link: https://github.com/nicoboou/chada_vit

ingra14m · 2024-02-28T09:56:16Z

Paper name/title: Deformable 3D Gaussians for High-Fidelity Monocular Dynamic Scene Reconstruction
Paper link: https://arxiv.org/abs/2309.13101
Code link: https://github.com/ingra14m/Deformable-3D-Gaussians
Project page: https://ingra14m.github.io/Deformable-Gaussians/

ingra14m · 2024-02-28T09:57:17Z

Paper name/title: SC-GS: Sparse-Controlled Gaussian Splatting for Editable Dynamic Scenes
Paper link: https://arxiv.org/abs/2312.14937
Code link: https://github.com/yihua7/SC-GS
Project page: https://yihua7.github.io/SC-GS-web/

yyvhang · 2024-02-28T11:13:47Z

Paper name/title: LEMON: Learning 3D Human-Object Interaction Relation from 2D Images (Embodied AI)
Paper link: https://arxiv.org/abs/2312.08963
Code link: https://github.com/yyvhang/lemon_3d

horseee · 2024-02-28T11:26:00Z

Paper name/title: DeepCache: Accelerating Diffusion Models for Free
Paper link: https://arxiv.org/abs/2312.00858
Code link: https://github.com/horseee/DeepCache

SunzeY · 2024-02-29T17:35:44Z

Paper name/title: Alpha-CLIP: A CLIP Model Focusing on Wherever You Want
Paper link: https://arxiv.org/abs/2312.03818
Code link: https://github.com/SunzeY/AlphaCLIP

yinanhe · 2024-03-01T04:55:51Z

Paper name/title: VBench: Comprehensive Benchmark Suite for Video Generative Models
Paper link: https://arxiv.org/abs/2311.17982
Code link: https://github.com/Vchitect/VBench
Project Page: https://vchitect.github.io/VBench-project/

shikiw · 2024-03-01T05:52:08Z

Paper name/title: OPERA: Alleviating Hallucination in Multi-Modal Large Language Models via Over-Trust Penalty and Retrospection-Allocation
Paper link: https://arxiv.org/abs/2311.17911
Code link: https://github.com/shikiw/OPERA

jameslahm · 2024-03-01T06:23:49Z

Paper name/title: RepViT: Revisiting Mobile CNN From ViT Perspective
Paper link: https://arxiv.org/abs/2307.09283
Code link: https://github.com/THU-MIG/RepViT

lixinustc · 2024-03-02T05:46:55Z

Paper name/title: SeD: Semantic-Aware Discriminator for Image Super-Resolution
Paper link: https://arxiv.org/abs/2402.19387
Code link: https://github.com/lbc12345/SeD

HyeonHo99 · 2024-03-25T17:19:03Z

Paper name/title: VMC: Video Motion Customization using Temporal Attention Adaption for Text-to-Video Diffusion Models
Paper link: https://arxiv.org/abs/2312.00845
Code link: https://github.com/HyeonHo99/Video-Motion-Customization
Project Page: https://video-motion-customization.github.io/

xiuqhou · 2024-03-26T01:58:16Z

Paper name/title: Salience DETR: Enhancing Detection Transformer with Hierarchical Salience Filtering Refinement
Paper link: https://arxiv.org/abs/2403.16131
Code link: https://github.com/xiuqhou/Salience-DETR

zhangce01 · 2024-03-28T07:37:05Z

Paper name/title: HiKER-SGG: Hierarchical Knowledge Enhanced Robust Scene Graph Generation
Paper link: https://arxiv.org/abs/2403.12033
Code link: https://github.com/zhangce01/HiKER-SGG
Project page: https://zhangce01.github.io/HiKER-SGG/

cjerry1243 · 2024-03-30T01:48:11Z

Paper name/title: Learning from Synthetic Human Group Activities
Paper link: https://arxiv.org/abs/2306.16772
Code link: https://github.com/cjerry1243/M3Act
Project page: https://cjerry1243.github.io/M3Act/

chen-si-jia · 2024-04-05T08:32:38Z

Paper name/title: Delving into the Trajectory Long-tail Distribution for Muti-object Tracking
Paper link: https://arxiv.org/abs/2403.04700
Code link: https://github.com/chen-si-jia/Trajectory-Long-tail-Distribution-for-MOT

Vegetebird · 2024-04-07T14:01:36Z

Paper name/title: Hourglass Tokenizer for Efficient Transformer-Based 3D Human Pose Estimation
Paper link: https://arxiv.org/pdf/2311.12028.pdf
Code link: https://github.com/NationalGAILab/HoT

cherishleon · 2024-04-08T05:37:13Z

Paper name/title: FairCLIP: Harnessing Fairness in Vision-Language Learning
Paper link: https://arxiv.org/abs/2403.19949
Code link: https://github.com/Harvard-Ophthalmology-AI-Lab/FairCLIP
Project Page: https://ophai.hms.harvard.edu/datasets/harvard-fairvlmed10k/

QinYang79 · 2024-04-08T08:09:23Z

Paper name/title: Noisy-Correspondence Learning for Text-to-Image Person Re-identification
Paper link: https://arxiv.org/pdf/2308.09911.pdf
Code link: https://github.com/QinYang79/RDE

littlepure2333 · 2024-04-12T09:46:26Z

Paper name/title: A Cross-Subject Brain Decoding Framework
Project Page: https://littlepure2333.github.io/MindBridge/
Paper link: https://arxiv.org/abs/2404.07850
Code link: https://github.com/littlepure2333/MindBridge

Osilly · 2024-04-16T18:51:55Z

Paper name/title: A General and Efficient Training for Transformer via Token Expansion
Paper link: https://arxiv.org/abs/2404.00672
Code link: https://github.com/Osilly/TokenExpansion

YuqiYang213 · 2024-04-17T11:03:54Z

Paper name/title: Multi-Task Dense Prediction via Mixture of Low-Rank Experts
Paper link: https://arxiv.org/abs/2403.17749
Code link: https://github.com/YuqiYang213/MLoRE

YuqiYang213 · 2024-04-18T13:26:10Z

Paper name/title: Traffic Scene Parsing through the TSP6K Dataset
Paper link: https://arxiv.org/pdf/2303.02835.pdf
Code link: https://github.com/PengtaoJiang/TSP6K

dahyun-kang · 2024-04-20T09:53:17Z

Paper name/title: Contrastive Mean-Shift Learning for Generalized Category Discovery
Paper link: https://arxiv.org/abs/2404.09451
Code link: https://github.com/sua-choi/CMS
Project page: https://postech-cvlab.github.io/cms/

littlepure2333 · 2024-04-23T11:44:28Z

Paper name/title: A Cross-Subject Brain Decoding Framework Project Page: https://littlepure2333.github.io/MindBridge/ Paper link: https://arxiv.org/abs/2404.07850 Code link: https://github.com/littlepure2333/MindBridge

Sorry, the title should be:
MindBridge: A Cross-Subject Brain Decoding Framework

pablomm · 2024-04-24T17:24:54Z

Paper name/title: Open-Vocabulary Attention Maps with Token Optimization for Semantic Segmentation in Diffusion Models
Paper link: https://arxiv.org/abs/2403.14291
Code link: https://github.com/vpulab/ovam

Dayan-Guan · 2024-04-29T11:11:29Z

Paper name/title: Efficient Test-Time Adaptation of Vision-Language Models
Paper link: https://arxiv.org/abs/2403.18293
Code link: https://github.com/kdiAAA/TDA

TQTQliu · 2024-04-29T13:42:18Z

Paper name/title: Geometry-aware Reconstruction and Fusion-refined Rendering for Generalizable Neural Radiance Fields
Paper link: https://arxiv.org/abs/2404.17528
Code link: https://github.com/TQTQliu/GeFu
Project page: https://gefucvpr24.github.io/

2y7c3 · 2024-05-09T03:34:39Z

Paper name/title: Adversarial Score Distillation: When score distillation meets GAN
Arxiv link: https://arxiv.org/abs/2312.00739 (updating)
Paper link: https://2y7c3.github.io/pdfs/asd.pdf
Code link: https://github.com/2y7c3/ASD

ZhaoChuyang · 2024-05-11T10:02:24Z

Paper name/title: MS-DETR: Efficient DETR Training with Mixed Supervision
Paer link: https://arxiv.org/pdf/2401.03989
Code link: https://github.com/Atten4Vis/MS-DETR

youngLBW · 2024-05-13T03:41:32Z

Paper name/title: DiffusionGAN3D: Boosting Text-guided 3D Generation and Domain Adaptation by Combining 3D GANs and Diffusion Priors
Paper link: https://arxiv.org/abs/2312.16837
Project page: https://younglbw.github.io/DiffusionGAN3D-homepage
Code link: https://github.com/youngLBW/DiffusionGAN3D

demo4ai · 2024-05-19T16:44:19Z

Paper name/title: BlockGCN: Redefine Topology Awareness for Skeleton-Based Action Recognition
Paper link: https://www.researchgate.net/publication/379411619_BlockGCN_Redefining_Topology_Awareness_for_Skeleton-Based_Action_Recognition
Code link: https://github.com/ZhouYuxuanYX/BlockGCN

jaewonalive · 2024-05-26T03:35:56Z

Paper name/title: PeerAiD: Improving Adversarial Distillation from a Specialized Peer Tutor
Paper link: https://arxiv.org/abs/2403.06668
Code link: https://github.com/jaewonalive/PeerAiD

LQY404 · 2024-06-05T05:52:24Z

Paper name/title: CLIP as RNN: Segment Countless Visual Concepts without Training Endeavor
Project link: https://torrvision.com/clip_as_rnn/
Code link: https://github.com/kevin-ssy/CLIP_as_RNN

LQY404 · 2024-06-06T01:38:27Z

Paper name/title: ASAM: Boosting Segment Anything Model with Adversarial Tuning
Project link: https://link.zhihu.com/?target=https%3A//asam2024.github.io/
Code link: https://github.com/luckybird1994/ASAM

caiyuanhao1998 · 2024-06-07T04:32:14Z

Paper name/title: Structure-Aware Sparse-View X-ray 3D Reconstruction
Paper link: https://arxiv.org/abs/2311.10959
Code link: https://github.com/caiyuanhao1998/SAX-NeRF

Nicholas0228 · 2024-06-11T11:48:25Z

Paper name/title: CGI-DM: Digital Copyright Authentication for Diffusion Models via Contrasting Gradient Inversion
Paper link: https://arxiv.org/abs/2403.11162
Code link: https://github.com/Nicholas0228/Revelio

Linwei-Chen · 2024-06-27T08:46:30Z

Paper name/title: CVPR 2024 Poster (Highlight): Frequency-Adaptive Dilated Convolution for Semantic Segmentation
Paper link: https://arxiv.org/abs/2403.05369
Code link: https://github.com/Linwei-Chen/FADC

gswycf · 2024-06-28T14:52:59Z

[The format of the issue]
Paper name/title: SignGraph: A Sign Sequence is Worth Graphs of Nodes
Paper link: https://openaccess.thecvf.com/content/CVPR2024/papers/Gan_SignGraph_A_Sign_Sequence_is_Worth_Graphs_of_Nodes_CVPR_2024_paper.pdf
Code link: https://github.com/gswycf/SignGraph

xxayt · 2024-09-25T12:12:13Z

Paper name/title: Holistic Features are almost Sufficient for Text-to-Video Retrieval
Paper link: https://openaccess.thecvf.com/content/CVPR2024/papers/Tian_Holistic_Features_are_almost_Sufficient_for_Text-to-Video_Retrieval_CVPR_2024_paper.pdf
Code link: https://github.com/ruc-aimc-lab/TeachCLIP

WentaoTan · 2024-12-06T08:40:44Z

Paper name/title: Harnessing the Power of MLLMs for Transferable Text-to-Image Person ReID
Paper link: https://arxiv.org/pdf/2405.04940
Code link: https://github.com/WentaoTan/MLLM4Text-ReID

欢迎分享CVPR 2024 论文和代码 / Welcome to share the paper and code of CVPR 2024 #210

欢迎分享CVPR 2024 论文和代码 / Welcome to share the paper and code of CVPR 2024 #210

Comments

amusi commented Feb 27, 2024

iamhankai commented Feb 27, 2024

iamhankai commented Feb 27, 2024

KuanchihHuang commented Feb 27, 2024

ShunyuanZheng commented Feb 27, 2024 • edited Loading

huliangxiao commented Feb 27, 2024

TIANLE233 commented Feb 27, 2024

zhuangshaobin commented Feb 27, 2024

BarqueroGerman commented Feb 27, 2024

buaacyw commented Feb 27, 2024

Hansxsourse commented Feb 27, 2024

ch3cook-fdu commented Feb 27, 2024

geometry-adaptation commented Feb 27, 2024 • edited Loading

thaoshibe commented Feb 27, 2024

Nightmare-n commented Feb 28, 2024

DearCaat commented Feb 28, 2024

Luffy03 commented Feb 28, 2024

xb534 commented Feb 28, 2024

WeichenFan commented Feb 28, 2024

Murrol commented Feb 28, 2024

Andy1621 commented Feb 28, 2024

ethancohen123 commented Feb 28, 2024

ingra14m commented Feb 28, 2024

ingra14m commented Feb 28, 2024

yyvhang commented Feb 28, 2024

horseee commented Feb 28, 2024

SunzeY commented Feb 29, 2024

yinanhe commented Mar 1, 2024

shikiw commented Mar 1, 2024

jameslahm commented Mar 1, 2024

lixinustc commented Mar 2, 2024 • edited Loading

HyeonHo99 commented Mar 25, 2024

xiuqhou commented Mar 26, 2024 • edited Loading

zhangce01 commented Mar 28, 2024

cjerry1243 commented Mar 30, 2024

chen-si-jia commented Apr 5, 2024

Vegetebird commented Apr 7, 2024

cherishleon commented Apr 8, 2024

QinYang79 commented Apr 8, 2024

littlepure2333 commented Apr 12, 2024

Osilly commented Apr 16, 2024

YuqiYang213 commented Apr 17, 2024

YuqiYang213 commented Apr 18, 2024

dahyun-kang commented Apr 20, 2024

littlepure2333 commented Apr 23, 2024

pablomm commented Apr 24, 2024

Dayan-Guan commented Apr 29, 2024

TQTQliu commented Apr 29, 2024

2y7c3 commented May 9, 2024

ZhaoChuyang commented May 11, 2024

youngLBW commented May 13, 2024

demo4ai commented May 19, 2024

jaewonalive commented May 26, 2024

LQY404 commented Jun 5, 2024

LQY404 commented Jun 6, 2024 • edited Loading

caiyuanhao1998 commented Jun 7, 2024

Nicholas0228 commented Jun 11, 2024

Linwei-Chen commented Jun 27, 2024

gswycf commented Jun 28, 2024

xxayt commented Sep 25, 2024

WentaoTan commented Dec 6, 2024

ShunyuanZheng commented Feb 27, 2024 •

edited

Loading

geometry-adaptation commented Feb 27, 2024 •

edited

Loading

lixinustc commented Mar 2, 2024 •

edited

Loading

xiuqhou commented Mar 26, 2024 •

edited

Loading

LQY404 commented Jun 6, 2024 •

edited

Loading