A comprehensive collection of IQA papers, datasets and codes. We also provide PyTorch implementations of mainstream metrics in IQA-PyTorch
Tip
📚✍️ Feel free to submit a pull request to add a paper you think deserves to be featured in this repository! Your contributions are incredibly appreciated 🙌 and will help make this project even better for everyone!
Thank you for your support! 🫶🤗🎉
Related Resources:
- Awesome Image Aesthetic Assessment and Cropping. A curated list of resources including papers, datasets, and relevant links to aesthetic evaluation and cropping.
Quality assessment with spatial context and local structures
[Arxiv 2024]Grounding-IQA: Grounding Multimodal Language Model for Image Quality Assessment, Chen et al. Bibtex[Eurographics 2024]Enhancing image quality prediction with self-supervised visual masking, Uğur et al. Github | Project | Bibtex[Arxiv 2024]SEAGULL: No-reference Image Quality Assessment for Regions of Interest via Vision-Language Instruction Tuning, Chen et al. Github | Bibtex[ACM MM2024]Q-Ground: Image Quality Grounding with Large Multi-modality Models, Chen et al. Bibtex | Github
All IQA types unified in a single model
[ECCV 2024]PromptIQA: Boosting the Performance and Generalization for No-Reference Image Quality Assessment via Prompts, Chen et al. Bibtex[ICML 2024]Q-Align: Teaching LMMs for Visual Scoring via Discrete Text-Defined Levels, Wu et al. Github | Bibtex
Human readable IQA, mostly with large language models
[Arxiv 2025]Next Token Is Enough: Realistic Image Quality and Aesthetic Scoring with Multimodal Large Language Model, Li et al. Github | Bibtex[Arxiv 2025]Q-Insight: Understanding Image Quality via Visual Reinforcement Learning, Li et al. Github | Bibtex[TPAMI 2024]Q-Bench+: A Benchmark for Multi-modal Foundation Models on Low-level Vision from Single Images to Pairs, Zhang et al. Github | Bibtex[Arxiv 2024]VisualCritic: Making LMMs Perceive Visual Quality Like Humans, Huang et al. Bibtex[ECCV 2024]A Comprehensive Study of Multimodal Large Language Models for Image Quality Assessment, Wu et al. Github | Bibtex[ECCV 2024]Towards Open-ended Visual Quality Comparison, Wu et al. Github | Bibtex[ECCV 2024]Depicting Beyond Scores: Advancing Image Quality Assessment through Multi-modal Language Models, You et al. Github | Project | Bibtex[Arxiv 2024]Descriptive Image Quality Assessment in the Wild, You et al. Github | Project | Bibtex[CVPR 2024]Q-Instruct: Improving Low-level Visual Abilities for Multi-modality Foundation Models, Wu et al. Github | Bibtex
- ✨
[CVPR 2024 (best paper)]Rich Human Feedback for Text-to-Image Generation, Liang et al. Github | Bibtex [ICLR 2024]Q-Bench: A Benchmark for General-Purpose Foundation Models on Low-level Vision, Wu et al. Github | Bibtex[ICCV 2023]TIFA: Text-to-Image Faithfulness Evaluation with Question Answering, Hu et al. Github | Bibtex | Project[NeurIPS 2023]ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation, Xu et al. Github | Bibtex[ICCV2023]Better Aligning Text-to-Image Models with Human Preference, Wu et al. Github | Github(HPSv2) | Bibtex[NeurIPS 2023]Pick-a-Pic: An Open Dataset of User Preferences for Text-to-Image Generation, Yuval et al. Github | Bibtex[TCSVT2023]A Fine-grained Subjective Perception & Alignment Database for AI Generated Image Quality Assessment, Li et al. Github | Bibtex
-
[Arxiv 2024]Q-Mamba: On First Exploration of Vision Mamba for Image Quality Assessment, Guan et al. Bibtex -
[Arxiv 2024]Adaptive Image Quality Assessment via Teaching Large Multimodal Model to Compare, Zhe et al. Bibtex -
[Arxiv 2024]Quality-aware Image-Text Alignment for Opinion-Unaware Image Quality Assessment, Agnolucci et al. Github | Bibtex -
[ACCV 2024]ATTIQA: Generalizable Image Quality Feature Extractor using Attribute-aware Pretraining, Kwon et al. Bibtex -
[CVPR 2024]Blind Image Quality Assessment Based On Geometric Order Learning, Shin et al. Github | Bibtex -
[CVPR 2024]Bridging the Synthetic-to-Authentic Gap: Distortion-Guided Unsupervised Domain Adaptation for Blind Image Quality Assessment, Li et al. Bibtex -
[CVPR 2024]Boosting Image Quality Assessment through Efficient Transformer Adaptation with Local Feature Enhancement , Xu et al. | Bibtex -
[WACV2024]ARNIQA: Learning Distortion Manifold for Image Quality Assessment, Agnolucci et al. Github | Bibtex -
[TIP2023]TOPIQ: A Top-down Approach from Semantics to Distortions for Image Quality Assessment, Chen et al. Github | Bibtex -
[ICCV2023]Test Time Adaptation for Blind Image Quality Assessment, Roy et al. Github | Bibtex -
[CVPR2023]Re-IQA: Unsupervised Learning for Image Quality Assessment in the Wild, Saha et al. Bibtex | Github -
[CVPR2023]Blind Image Quality Assessment via Vision-Language Correspondence: A Multitask Learning Perspective, Zhang et al. Github | Bibtex -
[CVPR2023]Quality-aware Pre-trained Models for Blind Image Quality Assessment, Zhao et al. Bibtex -
[AAAI2023]Exploring CLIP for Assessing the Look and Feel of Images, Wang et al. Github | Bibtex -
[AAAI2023]Data-Efficient Image Quality Assessment with Attention-Panel Decoder, Qin et al. Github | Bibtex -
[TPAMI2022]Continual Learning for Blind Image Quality Assessment , Zhang et al. Github | Bibtex -
[TIP2022]No-Reference Image Quality Assessment by Hallucinating Pristine Features, Chen et al. Github | Bibtex -
[TIP2022]VCRNet: Visual Compensation Restoration Network for No-Reference Image Quality Assessment, Pan et al. Github | Bibtex -
[TMM2022]GraphIQA: Learning Distortion Graph Representations for Blind Image Quality Assessment, Sun et al. Github | Bibtex -
[CVPR2021]Troubleshooting Blind Image Quality Models in the Wild, Wang et al. Github | Bibtex
| Paper Link | Method | Type | Published | Code | Keywords |
|---|---|---|---|---|---|
| arXiv | MANIQA | NR | CVPRW2022 | Official | Transformer, multi-dimension attention, dual branch |
| arXiv | TReS | NR | WACV2022 | Official | Transformer, relative ranking, self-consistency |
| KonIQ++ | NR | BMVC2021 | Official | Multi-task with distortion prediction | |
| arXiv | MUSIQ | NR | ICCV2021 | Official / Pytorch | Multi-scale, transformer, Aspect Ratio Preserved (ARP) resizing |
| arXiv | CKDN | NR | ICCV2021 | Official | Degraded reference, Conditional knowledge distillation (related to HIQA) |
| HyperIQA | NR | CVPR2020 | Official | Content-aware hyper network | |
| arXiv | Meta-IQA | NR | CVPR2020 | Official | Meta-learning |
| arXiv | GIQA | NR | ECCV2020 | Official | Generated image |
| arXiv | PI | NR | 2018 PIRM Challenge | Project | 1/2 * (NIQE + (10 - NRQM)). |
| arXiv | HIQA | NR | CVPR2018 | Project | Hallucinated reference |
| arXiv | BPSQM | NR | CVPR2018 | Pixel-wise quality map | |
| arXiv | RankIQA | NR | ICCV2017 | Github | Pretrain on synthetically ranked data |
| CNNIQA | NR | CVPR2014 | PyTorch | First CNN-based NR-IQA | |
| arXiv | UNIQUE | NR | TIP2021 | Github | Combine synthetic and authentic image pairs |
| arXiv | DBCNN | NR | TCSVT2020 | Official | Two branches for synthetic and authentic distortions |
| SFA | NR | TMM2019 | Official | Aggregate ResNet50 features of multiple cropped patches | |
| pdf/arXiv | PQR | NR/Aesthetic | TIP2019 | Official1/Official2 | Unify different type of aesthetic labels |
| arXiv | WaDIQaM (deepIQA) | NR/FR | TIP2018 | PyTorch | Weighted average of patch qualities, shared FR/NR models |
| NIMA | NR | TIP2018 | PyTorch/Tensorflow | Squared EMD loss | |
| MEON | NR | TIP2017 | Multi-task: distortion learning and quality prediction | ||
| arXiv | dipIQ | NR | TIP2017 | download | Similar to RankIQA |
| arXiv | NRQM (Ma) | NR | CVIU2017 | Project | Traditional, Super resolution |
| arXiv | FRIQUEE | NR | JoV2017 | Official | Authentically Distorted, Bag of Features |
| IEEE | HOSA | NR | TIP2016 | Matlab download | Traditional |
| ILNIQE | NR | TIP2015 | Official | Traditional | |
| BRISQUE | NR | TIP2012 | Official | Traditional | |
| BLIINDS-II | NR | TIP2012 | Official | ||
| CORNIA | NR | CVPR2012 | Matlab download | Codebook Representation | |
| NIQE | NR | SPL2012 | Official | Traditional | |
| DIIVINE | NR | TIP2011 | Official |
[ECCV2022]Shift-tolerant Perceptual Similarity Metric, Ghildyal et al. Github | Bibtex[BMVC2022]Content-Diverse Comparisons improve IQA, Thong et al. Bibtex[ACM MM2022]Quality Assessment of Image Super-Resolution: Balancing Deterministic and Statistical Fidelity, Zhou et al. Github | Bibtex
| Paper Link | Method | Type | Published | Code | Keywords |
|---|---|---|---|---|---|
| arXiv | AHIQ | FR | CVPR2022 NTIRE workshop | Official | Attention, Transformer |
| arXiv | JSPL | FR | CVPR2022 | Official | semi-supervised and positive-unlabeled (PU) learning |
| arXiv | CVRKD | NAR | AAAI2022 | Official | Non-Aligned content reference, knowledge distillation |
| arXiv | IQT | FR | CVPRW2021 | PyTorch | Transformer |
| arXiv | A-DISTS | FR | ACMM2021 | Official | |
| arXiv | DISTS | FR | TPAMI2021 | Official | |
| arXiv | LPIPS | FR | CVPR2018 | Project | Perceptual similarity, Pairwise Preference |
| arXiv | PieAPP | FR | CVPR2018 | Project | Perceptual similarity, Pairwise Preference |
| arXiv | WaDIQaM | NR/FR | TIP2018 | Official | |
| arXiv | JND-SalCAR | FR | TCSVT2020 | JND (Just-Noticeable-Difference) | |
| QADS | FR | TIP2019 | Project | Super-resolution | |
| FSIM | FR | TIP2011 | Project | Traditional | |
| VIF/IFC | FR | TIP2006 | Project | Traditional | |
| MS-SSIM | FR | Project | Traditional | ||
| SSIM | FR | TIP2004 | Project | Traditional | |
| PSNR | FR | Traditional |
[CVPR2025]Charm: The Missing Piece in ViT fine-tuning for Image Aesthetic Assessment, Behrad et al. Github | Bibtex[ACMMM 2024]AesExpert: Towards Multi-modality Foundation Model for Image Aesthetics Perception, Huang et al. Project | Github | Bibtex[Arxiv 2024]AesBench: An Expert Benchmark for Multimodal Large Language Models on Image Aesthetics Perception, Huang et al. Github | Bibtex[ECCV2024]Scaling Up Personalized Aesthetic Assessment via Task Vector Arithmetic, Yun et al. Bibtex | Project[CVPR2023]VILA: Learning Image Aesthetics from User Comments with Vision-Language Pretraining, Ke et al. Bibtex[CVPR2023]Towards Artistic Image Aesthetics Assessment: a Large-scale Dataset and a New Method, Yi et al. Github | Bibtex
Aims to predict the Image Intrinsic Scale, i.e. the scale at which an image shows the best quality
[ICCV 2025]Image Intrinsoc Scale Assessment: Bridging the Gap Between Quality and Resolution, Hosu et al. Bibtex | Github
[ECCV 2024]Multiscale Sliced Wasserstein Distances as Perceptual Color Difference Measures, He et al. Github | Bibtex[CVPR 2023]Learning a Deep Color Difference Metric for Photographic Images, Chen et al. Github | Bibtex
[CVPR2024]DSL-FIQA: Assessing Facial Image Quality via Dual-Set Degradation Learning and Landmark-Guided Transformer, Chen et al. Bibtex | Project
[NeurIPS 2023]Assessor360: Multi-sequence Network for Blind Omnidirectional Image Quality Assessment , Wu et al. Bibtex | Github
[Arxiv 2024]ESIQA: Perceptual Quality Assessment of Vision-Pro-based Egocentric Spatial Images, Zhu et al. Bibtex | Github
[Arxiv 2024]Guardians of Image Quality: Benchmarking Defenses Against Adversarial Attacks on Image Quality Metrics, Gushchin et al. Bibtex | Github | Project[NeurIPS 2022]Perceptual Attacks of No-Reference Image Quality Models with Human-in-the-Loop, Zhang et al. Bibtex | Github
| Title | Method | Published | Code | Keywords |
|---|---|---|---|---|
| arXiv | NiNLoss | ACMM2020 | Official | Norm-in-Norm Loss |
| Paper Link | Dataset Name | Type | Published | Website | Images | Annotations |
|---|---|---|---|---|---|---|
| arXiv | UHD-IQA | NR | ECCVW2024 | Project | 6k (~3840x2160) | 20 ratings per image |
| arXiv | PaQ-2-PiQ | NR | CVPR2020 | Official github | 40k, 120k patches | 4M |
| CVF | SPAQ | NR | CVPR2020 | Offical github | 11k (smartphone) | |
| arXiv | KonIQ-10k | NR | TIP2020 | Project | 10k from YFCC100M | 1.2M |
| arXiv | AADB | NR/Aesthentic | ECCV2016 | Official github | 10k images (8500/500/1000), 11 attributes | |
| arXiv | CLIVE | NR | TIP2016 | Project | 1200 | 350k |
| AVA | NR / Aesthentic | CVPR2012 | Github/Project | 250k (60 categories) | ||
| arXiv | PIPAL | FR | ECCV2020 | Project | 250 | 1.13M |
| arXiv | KADIS-700k | FR | arXiv | Project | 140k pristine / 700k distorted | 30 ratings (DCRs) per image. |
| IEEE | KADID-10k | FR | QoMEX2019 | Project | 81 | 10k distortions |
| Waterloo-Exp | FR | TIP2017 | Project | 4744 | 94k distortions | |
| MDID | FR | PR2017 | --- | 20 | 1600 distortions | |
| TID2013 | FR | SP2015 | Project | 25 | 3000 distortions | |
| LIVEMD | FR | ACSSC2012 | Project | 15 pristine images | two successive distortions | |
| CSIQ | FR | Journal of Electronic Imaging 2010 | --- | 30 | 866 distortions | |
| TID2008 | FR | 2009 | Project | 25 | 1700 distortions | |
| LIVE IQA | FR | TIP2006 | Project | 29 images, 780 synthetic distortions | ||
| link | IVC | FR | 2005 | --- | 10 | 185 distortions |
| Paper Title | Dataset Name | Type | Published | Website | Images | Annotations |
|---|---|---|---|---|---|---|
| arXiv | BAPPS(LPIPS) | FR | CVPR2018 | Project | 187.7k | 484k |
| arXiv | PieAPP | FR | CVPR2018 | Project | 200 images | 2.3M |