Skip to content

Latest commit

 

History

History
1130 lines (671 loc) · 42 KB

File metadata and controls

1130 lines (671 loc) · 42 KB

Awesome-CVPR2024-Low-Level-VisionAwesome

整理汇总下今年CVPR底层视觉(Low-Level Vision)相关的论文和代码,括超分辨率,图像去雨,图像去雾,去模糊,去噪,图像恢复,图像增强,图像去摩尔纹,图像修复,图像质量评价,插帧,图像/视频压缩等任务,具体如下。

欢迎star,fork和PR~

Please feel free to star, fork or PR if helpful~

Related Collections(相关整理)

参考或转载请注明出处

CVPR2024官网:https://cvpr.thecvf.com/Conferences/2024

CVPR接收论文列表:https://cvpr.thecvf.com/Conferences/2024/AcceptedPapers

CVPR完整论文库:https://openaccess.thecvf.com/CVPR2024

开会时间:2024年6月17日-6月21日

论文接收公布时间:2024年2月27日

【Contents】

1.超分辨率(Super-Resolution)

AdaBM: On-the-Fly Adaptive Bit Mapping for Image Super-Resolution

A Dynamic Kernel Prior Model for Unsupervised Blind Image Super-Resolution

APISR: Anime Production Inspired Real-World Anime Super-Resolution

Arbitrary-Scale Image Generation and Upsampling using Latent Diffusion Model and Implicit Neural Decoder

Beyond Image Super-Resolution for Image Recognition with Task-Driven Perceptual Loss

Bilateral Event Mining and Complementary for Event Stream Super-Resolution

Boosting Flow-based Generative Super-Resolution Models via Learned Prior

Building Bridges across Spatial and Temporal Resolutions: Reference-Based Super-Resolution via Change Priors and Conditional Diffusion Model

CAMixerSR: Only Details Need More “Attention”

CFAT: Unleashing Triangular Windows for Image Super-resolution

Continuous Optical Zooming: A Benchmark for Arbitrary-Scale Image Super-Resolution in Real World

CoSeR: Bridging Image and Language for Cognitive Super-Resolution

CDFormer: When Degradation Prediction Embraces Diffusion Model for Blind Image Super-Resolution

CycleINR: Cycle Implicit Neural Representation for Arbitrary-Scale Volumetric Super-Resolution of Medical Data

Diffusion-based Blind Text Image Super-Resolution

DiSR-NeRF: Diffusion-Guided View-Consistent Super-Resolution NeRF

Image Processing GNN: Breaking Rigidity in Super-Resolution

Latent Modulated Function for Computational Optimal Continuous Image Representation

Learning Coupled Dictionaries from Unpaired Data for Image Super-Resolution

Learning Large-Factor EM Image Super-Resolution with Generative Priors

Low-Res Leads the Way: Improving Generalization for Super-Resolution by Self-Supervised Learning

Navigating Beyond Dropout: An Intriguing Solution towards Generalizable Image Super-Resolution

Neural Super-Resolution for Real-time Rendering with Radiance Demodulation

Rethinking Diffusion Model for Multi-Contrast MRI Super-Resolution

SeD: Semantic-Aware Discriminator for Image Super-Resolution

SeeSR: Towards Semantics-Aware Real-World Image Super-Resolution

Self-Adaptive Reality-Guided Diffusion for Artifact-Free Super-Resolution

SinSR: Diffusion-Based Image Super-Resolution in a Single Step

Super-Resolution Reconstruction from Bayer-Pattern Spike Streams

Text-guided Explorable Image Super-resolution

Training Generative Image Super-Resolution Models by Wavelet-Domain Losses Enables Better Control of Artifacts

Transcending the Limit of Local Window: Advanced Super-Resolution Transformer with Adaptive Token Dictionary

Uncertainty-Aware Source-Free Adaptive Image Super-Resolution with Wavelet Augmentation Transformer

Universal Robustness via Median Randomized Smoothing for Real-World Super-Resolution

Video Super-Resolution

Enhancing Video Super-Resolution via Implicit Resampling-based Alignment

FMA-Net: Flow-Guided Dynamic Filtering and Iterative Feature Refinement with Multi-Attention for Joint Video Super-Resolution and Deblurring

Learning Spatial Adaptation and Temporal Coherence in Diffusion Models for Video Super-Resolution

Upscale-A-Video: Temporal-Consistent Diffusion Model for Real-World Video Super-Resolution

Video Super-Resolution Transformer with Masked Inter&Intra-Frame Attention

2.图像去雨(Image Deraining)

Bidirectional Multi-Scale Implicit Neural Representations for Image Deraining

3.图像去雾(Image Dehazing)

A Semi-supervised Nighttime Dehazing Baseline with Spatial-Frequency Aware and Realistic Brightness Constraint

Depth Information Assisted Collaborative Mutual Promotion Network for Single Image Dehazing

ODCR: Orthogonal Decoupling Contrastive Regularization for Unpaired Image Dehazing

SynFog: A Photo-realistic Synthetic Fog Dataset based on End-to-end Imaging Simulation for Advancing Real-World Defogging in Autonomous Driving

Video Dehazing

Driving-Video Dehazing with Non-Aligned Regularization for Safety Assistance

4.去模糊(Deblurring)

A Unified Framework for Microscopy Defocus Deblur with Multi-Pyramid Transformer and Contrastive Learning

AdaRevD: Adaptive Patch Exiting Reversible Decoder Pushes the Limit of Image Deblurring

Blur2Blur: Blur Conversion for Unsupervised Image Deblurring on Unknown Domains

Fourier Priors-Guided Diffusion for Zero-Shot Joint Low-Light Enhancement and Deblurring

ID-Blau: Image Deblurring by Implicit Diffusion-based reBLurring AUgmentation

LDP: Language-driven Dual-Pixel Image Defocus Deblurring Network

Mitigating Motion Blur in Neural Radiance Fields with Events and Frames

Motion-adaptive Separable Collaborative Filters for Blind Motion Deblurring

Motion Blur Decomposition with Cross-shutter Guidance

Real-World Efficient Blind Motion Deblurring via Blur Pixel Discretization

Spike-guided Motion Deblurring with Unknown Modal Spatiotemporal Alignment

Unsupervised Blind Image Deblurring Based on Self-Enhancement

Video Deblurring

Blur-aware Spatio-temporal Sparse Transformer for Video Deblurring

EVS-assisted Joint Deblurring Rolling-Shutter Correction and Video Frame Interpolation through Sensor Inverse Modeling

Frequency-aware Event-based Video Deblurring for Real-World Motion Blur

Latency Correction for Event-guided Deblurring and Frame Interpolation

5.去噪(Denoising)

LAN: Learning to Adapt Noise for Image Denoising

LED: A Large-scale Real-world Paired Dataset for Event Camera Denoising

Robust Image Denoising through Adversarial Frequency Mixup

Real-World Mobile Image Denoising Dataset with Efficient Baselines

SeNM-VAE: Semi-Supervised Noise Modeling with Hierarchical Variational Autoencoder

Transfer CLIP for Generalizable Image Denoising

Unmixing Diffusion for Self-Supervised Hyperspectral Image Denoising

ZERO-IG: Zero-Shot Illumination-Guided Joint Denoising and Adaptive Enhancement for Low-Light Images

6.图像恢复(Image Restoration)

Adapt or Perish: Adaptive Sparse Transformer with Attentive Feature Refinement for Image Restoration

Boosting Image Restoration via Priors from Pre-trained Models

CoDe: An Explicit Content Decoupling Framework for Image Restoration

Deep Equilibrium Diffusion Restoration with Parallel Sampling

Diff-Plugin: Revitalizing Details for Diffusion-based Low-level Tasks

Distilling Semantic Priors from SAM to Efficient Image Restoration Models

DocRes: A Generalist Model Toward Unifying Document Image Restoration Tasks

HIR-Diff: Unsupervised Hyperspectral Image Restoration Via Improved Diffusion Models

Image Restoration by Denoising Diffusion Models With Iteratively Preconditioned Guidance

Improving Image Restoration through Removing Degradations in Textual Representations

Learning Degradation-unaware Representation with Prior-based Latent Transformations for Blind Face Restoration

Learning Diffusion Texture Priors for Image Restoration

Look-Up Table Compression for Efficient Image Restoration

Multimodal Prompt Perceiver: Empower Adaptiveness Generalizability and Fidelity for All-in-One Image Restoration

PFStorer: Personalized Face Restoration and Super-Resolution

Restoration by Generation with Constrained Priors

Scaling Up to Excellence: Practicing Model Scaling for Photo-Realistic Image Restoration In the Wild

Selective Hourglass Mapping for Universal Image Restoration Based on Diffusion Model

Turb-Seg-Res: A Segment-then-Restore Pipeline for Dynamic Videos with Atmospheric Turbulence

WaveFace: Authentic Face Restoration with Efficient Frequency Recovery

Wavelet-based Fourier Information Interaction with Frequency Diffusion Adjustment for Underwater Image Restoration

7.图像增强(Image Enhancement)

Color Shift Estimation-and-Correction for Image Enhancement

Empowering Resampling Operation for Ultra-High-Definition Image Enhancement with Model-Aware Guidance

Fourier Priors-Guided Diffusion for Zero-Shot Joint Low-Light Enhancement and Deblurring

FlowIE:Efficient Image Enhancement via Rectified Flow

Light the Night: A Multi-Condition Diffusion Framework for Unpaired Low-Light Enhancement in Autonomous Driving

Robust Depth Enhancement via Polarization Prompt Fusion Tuning

Specularity Factorization for Low Light Enhancement

Towards Robust Event-guided Low-Light Image Enhancement: A Large-Scale Real-World Event-Image Dataset and Novel Approach

ZERO-IG: Zero-Shot Illumination-Guided Joint Denoising and Adaptive Enhancement for Low-Light Images

Zero-Reference Low-Light Enhancement via Physical Quadruple Priors

Video Enhancement

Binarized Low-light Raw Video Enhancement

UVEB: A Large-scale Benchmark and Baseline Towards Real-World Underwater Video Enhancement

8.图像修复(Inpainting)

Amodal Completion via Progressive Mixed Context Diffusion

Brush2Prompt: Contextual Prompt Generator for Object Inpainting

Choose What You Need: Disentangled Representation Learning for Scene Text Recognition Removal and Editing

Don't Look into the Dark: Latent Codes for Pluralistic Image Inpainting

Shadow-Enlightened Image Outpainting

Structure Matters: Tackling the Semantic Discrepancy in Diffusion Models for Image Inpainting

Video Inpainting

AVID: Any-Length Video Inpainting with Diffusion Model

Towards Language-Driven Video Inpainting via Multimodal Large Language Models

9.高动态范围成像(HDR Imaging)

CLIPtone: Unsupervised Learning for Text-based Image Tone Adjustment

Deep Video Inverse Tone Mapping Based on Temporal Clues

Generating Content for HDR Deghosting from Frequency View

HDRFlow: Real-Time HDR Video Reconstruction with Large Motions

Perceptual Assessment and Optimization of HDR Image Rendering

Towards HDR and HFR Video from Rolling-Mixed-Bit Spikings

Towards Real-World HDR Video Reconstruction: A Large-Scale Benchmark Dataset and A Two-Stage Alignment Network

Zero-Shot Structure-Preserving Diffusion Model for High Dynamic Range Tone Mapping

10.图像质量评价(Image Quality Assessment)

Blind Image Quality Assessment Based on Geometric Order Learning

Boosting Image Quality Assessment through Efficient Transformer Adaptation with Local Feature Enhancement

Bridging the Synthetic-to-Authentic Gap: Distortion-Guided Unsupervised Domain Adaptation for Blind Image Quality Assessment

CLIB-FIQA: Face Image Quality Assessment with Confidence Calibration

Contrastive Pre-Training with Multi-View Fusion for No-Reference Point Cloud Quality Assessment

Deep Generative Model based Rate-Distortion for Image Downscaling Assessment

Defense Against Adversarial Attacks on No-Reference Image Quality Models with Gradient Norm Regularization

DSL-FIQA: Assessing Facial Image Quality via Dual-Set Degradation Learning and Landmark-Guided Transformer

EvalCrafter: Benchmarking and Evaluating Large Video Generation Models

FineParser: A Fine-grained Spatio-temporal Action Parser for Human-centric Action Quality Assessment

KVQ: Kwai Video Quality Assessment for Short-form Videos

Learned Scanpaths Aid Blind Panoramic Video Quality Assessment

Modular Blind Video Quality Assessment

On the Content Bias in Fréchet Video Distance

PTM-VQA: Efficient Video Quality Assessment Leveraging Diverse PreTrained Models from the Wild

Q-Instruct: Improving Low-level Visual Abilities for Multi-modality Foundation Models

11.插帧(Frame Interpolation)

Data-Efficient Unsupervised Interpolation Without Any Intermediate Frame for 4D Medical Images

IQ-VFI: Implicit Quadratic Motion Estimation for Video Frame Interpolation

Perception-Oriented Video Frame Interpolation via Asymmetric Blending

Sparse Global Matching for Video Frame Interpolation with Large Motion

SportsSloMo: A New Benchmark and Baselines for Human-centric Video Frame Interpolation

TTA-EVF: Test-Time Adaptation for Event-based Video Frame Interpolation via Reliable Pixel and Sample Estimation

Video Frame Interpolation via Direct Synthesis with the Event-based Reference

Video Interpolation with Diffusion Models

12.视频/图像压缩(Video/Image Compression)

Boosting Neural Representations for Videos with a Conditional Decoder

C3: High-performance and low-complexity neural compression from a single image or video

Generative Latent Coding for Ultra-Low Bitrate Image Compression

How Far Can We Compress Instant-NGP-Based NeRF?

Laplacian-guided Entropy Model in Neural Codec with Blur-dissipated Synthesis

Learned Lossless Image Compression based on Bit Plane Slicing

Towards Backward-Compatible Continual Learning of Image Compression

Video Compression

Task-Aware Encoder Control for Deep Video Compression

Low-Latency Neural Stereo Streaming

Neural Video Compression with Feature Modulation

13.压缩图像质量增强(Compressed Image Quality Enhancement)

CPGA: Coding Priors-Guided Aggregation Network for Compressed Video Quality Enhancement

Enhancing Quality of Compressed Images by Mitigating Enhancement Bias Towards Compression Domain

14.图像去反光(Image Reflection Removal)

Language-guided Image Reflection Separation

Revisiting Singlelmage Reflection Removal in the Wild

15.图像去阴影(Image Shadow Removal)

HomoFormer: Homogenized Transformer for Image Shadow Removal

16.图像上色(Image Colorization)

Automatic Controllable Colorization by Imagination

Generative Quanta Color Imaging

Learning Inclusion Matching for Animation Paint Bucket Colorization

17.图像和谐化(Image Harmonization)

Relightful Harmonization: Lighting-aware Portrait Background Replacement

Video Harmonization with Triplet Spatio-Temporal Variation Patterns

18.视频稳相(Video Stabilization)

3D Multi-frame Fusion for Video Stabilization

Harnessing Meta-Learning for Improving Full-Frame Video Stabilization

19.图像融合(Image Fusion)

Equivariant Multi-Modality Image Fusion

MRFS: Mutually Reinforcing Image Fusion and Segmentation

Neural Spline Fields for Burst Image Fusion and Layer Separation

Probing Synergistic High-Order Interaction in Infrared and Visible Image Fusion

Revisiting Spatial-Frequency Information Integration from a Hierarchical Perspective for Panchromatic and Multi-Spectral Image Fusion

Text-IF: Leveraging Semantic Text Guidance for Degradation-Aware and Interactive Image Fusion

Task-Customized Mixture of Adapters for General Image Fusion

20.其他任务(Others)

Close Imitation of Expert Retouching for Black-and-White Photography

Content-Adaptive Non-Local Convolution for Remote Sensing Pansharpening

DiffSCI: Zero-Shot Snapshot Compressive Imaging via Iterative Spectral Diffusion Model

Dual Prior Unfolding for Snapshot Compressive Imaging

Dual-Camera Smooth Zoom on Mobile Phones

Dual-scale Transformer for Large-scale Single-Pixel Imaging

Genuine Knowledge from Practice: Diffusion Test-Time Adaptation for Video Adverse Weather Removal

Language-driven All-in-one Adverse Weather Removal

Learning to Remove Wrinkled Transparent Film with Polarized Prior

Leveraging Frame Affinity for sRGB-to-RAW Video De-rendering

Misalignment-Robust Frequency Distribution Loss for Image Transformation

NB-GTR: Narrow-Band Guided Turbulence Removal

NightCC: Nighttime Color Constancy via Adaptive Channel Masking

On the Robustness of Language Guidance for Low-Level Vision Tasks: Findings from Depth Estimation

ParamISP: Learned Forward and Inverse ISPs using Camera Parameters

RecDiffusion: Rectangling for Image Stitching with Diffusion Models

Residual Denoising Diffusion Models

Real-Time Exposure Correction via Collaborative Transformations and Adaptive Sampling

Rolling Shutter Correction with Intermediate Distortion Flow Estimation

SCINeRF: Neural Radiance Fields from a Snapshot Compressive Image

Seeing Motion at Nighttime with an Event Camera

Shadow Generation for Composite Image Using Diffusion Model

Spatio-Temporal Turbulence Mitigation: A Translational Perspective

Improving Spectral Snapshot Reconstruction with Spectral-Spatial Rectification

持续更新~