Skip to content

isLinXu/paper-list

Repository files navigation

paper-listGitHub starsGitHub forksGitHub watchersBuild StatusimgGitHub repo sizeGitHub language countGitHub last commitGitHubimg


Paper-List-DAILY
Automatically Update Papers Daily in list

Updated on 2025.06.18

paper_list

Table of Contents
  1. Classification
  2. Object Detection
  3. Semantic Segmentation
  4. Object Tracking
  5. Action Recognition
  6. Pose Estimation
  7. Image Generation
  8. LLM
  9. Scene Understanding
  10. Depth Estimation
  11. Audio Processing
  12. Multimodal
  13. Anomaly Detection
  14. Transfer Learning
  15. Optical Flow
  16. Reinforcement Learning
  17. Graph Neural Networks

Classification

Publish Date Title Authors PDF Code
2025-06-17 DDS-NAS: Dynamic Data Selection within Neural Architecture Search via On-line Hard Example Mining applied to Image Classification Matt Poyser et.al. 2506.14667 null
2025-06-17 Train Once, Forget Precisely: Anchored Optimization for Efficient Post-Hoc Unlearning Prabhav Sanga et.al. 2506.14515 null
2025-06-17 Compositional Attribute Imbalance in Vision Datasets Jiayi Chen et.al. 2506.14418 null
2025-06-17 One-Shot Neural Architecture Search with Network Similarity Directed Initialization for Pathological Image Classification Renao Yan et.al. 2506.14176 null
2025-06-17 SeqPE: Transformer with Sequential Position Encoding Huayang Li et.al. 2506.13277 null
2025-06-15 Intriguing Frequency Interpretation of Adversarial Robustness for CNNs and ViTs Lu Chen et.al. 2506.12875 null
2025-06-15 Medical Argument Mining: Exploitation of Scarce Data Using NLI Systems Maitane Urruela et.al. 2506.12823 null
2025-06-15 Cross-architecture universal feature coding via distribution alignment Changsheng Gao et.al. 2506.12737 null
2025-06-15 Unsupervised Contrastive Learning Using Out-Of-Distribution Data for Long-Tailed Dataset Cuong Manh Hoang et.al. 2506.12698 null
2025-06-15 Evaluating Cell Type Inference in Vision Language Models Under Varying Visual Context Samarth Singhal et.al. 2506.12683 null
2025-06-14 OscNet v1.5: Energy Efficient Hopfield Network on CMOS Oscillators for Image Classification Wenxiao Cai et.al. 2506.12610 null
2025-06-14 DejaVid: Encoder-Agnostic Learned Temporal Matching for Video Classification Darryl Ho et.al. 2506.12585 null
2025-06-14 MVP-CBM:Multi-layer Visual Preference-enhanced Concept Bottleneck Model for Explainable Medical Image Classification Chunjiang Wang et.al. 2506.12568 null
2025-06-14 PLD: A Choice-Theoretic List-Wise Knowledge Distillation Ejafa Bassam et.al. 2506.12542 null
2025-06-13 GeistBERT: Breathing Life into German NLP Raphael Scheible-Schmitt et.al. 2506.11903 null
2025-06-13 Evaluating Fairness and Mitigating Bias in Machine Learning: A Novel Technique using Tensor Data and Bayesian Regression Kuniko Paxton et.al. 2506.11627 null
2025-06-13 Machine Unlearning for Robust DNNs: Attribution-Guided Partitioning and Neuron Pruning in Noisy Environments Deliang Jin et.al. 2506.11615 null
2025-06-13 Black-Box Edge AI Model Selection with Conformal Latency and Accuracy Guarantees Anders E. Kalør et.al. 2506.11391 null
2025-06-12 SNR and Resource Adaptive Deep JSCC for Distributed IoT Image Classification Ali Waqas et.al. 2506.10699 null
2025-06-13 PiPViT: Patch-based Visual Interpretable Prototypes for Retinal Image Analysis Marzieh Oghbaie et.al. 2506.10669 link
2025-06-12 Boosting Adversarial Transferability for Hyperspectral Image Classification Using 3D Structure-invariant Transformation and Intermediate Feature Distance Chun Liu et.al. 2506.10459 null
2025-06-12 Can We Infer Confidential Properties of Training Data from LLMs? Penguin Huang et.al. 2506.10364 null
2025-06-12 Flick: Few Labels Text Classification using K-Aware Intermediate Learning in Multi-Task Low-Resource Languages Ali Almutairi et.al. 2506.10292 null
2025-06-11 FedMLAC: Mutual Learning Driven Heterogeneous Federated Audio Classification Jun Bai et.al. 2506.10207 null
2025-06-11 Detecção da Psoríase Utilizando Visão Computacional: Uma Abordagem Comparativa Entre CNNs e Vision Transformers Natanael Lucena et.al. 2506.10119 null
2025-06-11 DeepTraverse: A Depth-First Search Inspired Network for Algorithmic Visual Understanding Bin Guo et.al. 2506.10084 null
2025-06-11 Evidential Deep Learning with Spectral-Spatial Uncertainty Disentanglement for Open-Set Hyperspectral Domain Generalization Amirreza Khoshbakht et.al. 2506.09460 null
2025-06-11 MSSDF: Modality-Shared Self-supervised Distillation for High-Resolution Multi-modal Remote Sensing Image Learning Tong Wang et.al. 2506.09327 null
2025-06-10 ScalableHD: Scalable and High-Throughput Hyperdimensional Computing Inference on Multi-Core CPUs Dhruv Parikh et.al. 2506.09282 null
2025-06-10 Hyperbolic Dual Feature Augmentation for Open-Environment Peilin Yu et.al. 2506.08906 null
2025-06-10 Normalized Radon Cumulative Distribution Transforms for Invariance and Robustness in Optimal Transport Based Image Classification Matthias Beckmann et.al. 2506.08761 null
2025-06-12 InceptionMamba: An Efficient Hybrid Network with Large Band Convolution and Bottleneck Mamba Yuhang Wang et.al. 2506.08735 null
2025-06-10 Biologically Inspired Deep Learning Approaches for Fetal Ultrasound Image Classification Rinat Prochii et.al. 2506.08623 null
2025-06-10 mSTEB: Massively Multilingual Evaluation of LLMs on Speech and Text Tasks Luel Hagos Beyene et.al. 2506.08400 null
2025-06-10 An Adaptive Method Stabilizing Activations for Enhanced Generalization Hyunseok Seung et.al. 2506.08353 null
2025-06-11 Hyperspectral Image Classification via Transformer-based Spectral-Spatial Attention Decoupling and Adaptive Gating Guandong Li et.al. 2506.08324 null
2025-06-09 TokenBreak: Bypassing Text Classification Models Through Token Manipulation Kasimir Schulz et.al. 2506.07948 null
2025-06-09 MultiMatch: Multihead Consistency Regularization Matching for Semi-Supervised Text Classification Iustin Sirbu et.al. 2506.07801 null
2025-06-09 Improving Memory Efficiency for Training KANs via Meta Learning Zhangchi Zhao et.al. 2506.07549 null
2025-06-09 Mind the Gap: Removing the Discretization Gap in Differentiable Logic Gate Networks Shakir Yousefi et.al. 2506.07500 null
2025-06-08 Mobility-Aware Asynchronous Federated Learning with Dynamic Sparsification Jintao Yan et.al. 2506.07328 null
2025-06-08 A Stable Whitening Optimizer for Efficient Neural Network Training Kevin Frans et.al. 2506.07254 null
2025-06-08 Hierarchical Feature-level Reverse Propagation for Post-Training Neural Networks Ni Ding et.al. 2506.07188 null
2025-06-08 CTDGSI: A comprehensive exploitation of instance selection methods for automatic text classification. VII Concurso de Teses, Dissertações e Trabalhos de Graduação em SI -- XXI Simpósio Brasileiro de Sistemas de Informação Washington Cunha et.al. 2506.07169 null
2025-06-08 pFedSOP : Accelerating Training Of Personalized Federated Learning Using Second-Order Optimization Mrinmay Sen et.al. 2506.07159 null
2025-06-07 Rewriting the Budget: A General Framework for Black-Box Attacks Under Cost Asymmetry Mahdi Salmani et.al. 2506.06933 null
2025-06-06 Eigenspectrum Analysis of Neural Networks without Aspect Ratio Bias Yuanzhe Hu et.al. 2506.06280 null
2025-06-06 FPDANet: A Multi-Section Classification Model for Intelligent Screening of Fetal Ultrasound Minglang Chen et.al. 2506.06054 null
2025-06-06 Enhancing Orthopox Image Classification Using Hybrid Machine Learning and Deep Learning Models Alejandro Puente-Castro et.al. 2506.06007 null
2025-06-06 LTG at SemEval-2025 Task 10: Optimizing Context for Classification of Narrative Roles Egil Rønningstad et.al. 2506.05976 null
2025-06-06 Integer Binary-Range Alignment Neuron for Spiking Neural Networks Binghao Ye et.al. 2506.05679 null
2025-06-05 FRAME: Pre-Training Video Feature Representations via Anticipation and Memory Sethuraman TV et.al. 2506.05543 null
2025-06-05 Spectral Graph Neural Networks are Incomplete on Graphs with a Simple Spectrum Snir Hordan et.al. 2506.05530 null
2025-06-05 Robustness Evaluation for Video Models with Reinforcement Learning Ashwin Ramesh Babu et.al. 2506.05431 null
2025-06-05 Interpretable Few-Shot Image Classification via Prototypical Concept-Guided Mixture of LoRA Experts Zhong Ji et.al. 2506.04673 null
2025-06-04 Deep Learning for Absorption-Image Analysis Jacob Morrey et.al. 2506.04517 null
2025-06-04 KOALA++: Efficient Kalman-Based Optimization of Neural Networks with Gradient-Covariance Products Zixuan Xia et.al. 2506.04432 null
2025-06-04 Benchmarking Time-localized Explanations for Audio Classification Models Cecilia Bolaños et.al. 2506.04391 null
2025-06-04 Hierarchical Text Classification Using Contrastive Learning Informed Path Guided Hierarchy Neeraj Agrawal et.al. 2506.04381 null
2025-06-04 Recent Advances in Medical Image Classification Loan Dao et.al. 2506.04129 null
2025-06-04 Prompt Candidates, then Distill: A Teacher-Student Framework for LLM-driven Data Annotation Mingxuan Xia et.al. 2506.03857 null
2025-06-04 RhoDARTS: Differentiable Quantum Architecture Search with Density Matrix Simulations Swagat Kumar et.al. 2506.03697 null
2025-06-04 Directional Non-Commutative Monoidal Embeddings for MNIST Mahesh Godavarti et.al. 2506.03472 null
2025-06-03 RoNFA: Robust Neural Field-based Approach for Few-Shot Image Classification with Noisy Labels Nan Xiang et.al. 2506.03461 null
2025-06-02 Quantifying task-relevant representational similarity using decision variable correlation Yu et.al. 2506.02164 null
2025-06-02 Towards Better Generalization and Interpretability in Unsupervised Concept-Based Models Francesco De Santis et.al. 2506.02092 null
2025-06-02 OD3: Optimization-free Dataset Distillation for Object Detection Salwa K. Al Khatib et.al. 2506.01942 null
2025-06-02 Generalized Gradient Norm Clipping & Non-Euclidean $(L_0,L_1)$ -Smoothness Thomas Pethick et.al. 2506.01913 null
2025-06-02 Beyond Static Responses: Multi-Agent LLM Systems as a New Paradigm for Social Science Research Jennifer Haase et.al. 2506.01839 null
2025-06-02 mdok of KInIT: Robustly Fine-tuned LLM for Binary and Multiclass AI-Generated Text Detection Dominik Macko et.al. 2506.01702 null
2025-06-02 Data Pruning by Information Maximization Haoru Tan et.al. 2506.01701 null
2025-06-02 Domain Lexical Knowledge-based Word Embedding Learning for Text Classification under Small Data Zixiao Zhu et.al. 2506.01621 null
2025-06-02 Speed-up of Vision Transformer Models by Attention-aware Token Filtering Takahiro Naruko et.al. 2506.01519 null
2025-06-02 A Novel Context-Adaptive Fusion of Shadow and Highlight Regions for Efficient Sonar Image Classification Kamal Basha S et.al. 2506.01445 null
2025-05-30 Optimal Weighted Convolution for Classification and Denosing Simone Cammarasana et.al. 2505.24558 null
2025-05-30 SASP: Strip-Aware Spatial Perception for Fine-Grained Bird Image Classification Zheng Wang et.al. 2505.24380 null
2025-05-30 Spatiotemporal Analysis of Forest Machine Operations Using 3D Video Classification Maciej Wielgosz et.al. 2505.24375 null
2025-05-30 GeoVision Labeler: Zero-Shot Geospatial Classification with Vision and Language Models Gilles Quentin Hacheme et.al. 2505.24340 null
2025-05-30 Provably Improving Generalization of Few-Shot Models with Synthetic Data Lan-Cuong Nguyen et.al. 2505.24190 null
2025-05-30 FeatureSense: Protecting Speaker Attributes in Always-On Audio Sensing System Bhawana Chhaglani et.al. 2505.24115 null
2025-05-30 Proxy-FDA: Proxy-based Feature Distribution Alignment for Fine-tuning Vision Foundation Models without Forgetting Chen Huang et.al. 2505.24088 null
2025-05-29 BIRD: Behavior Induction via Representation-structure Distillation Galen Pogoncheff et.al. 2505.23933 null
2025-05-29 Boosting Domain Incremental Learning: Selecting the Optimal Parameters is All You Need Qiang Wang et.al. 2505.23744 null
2025-05-29 Spectrotemporal Modulation: Efficient and Interpretable Feature Representation for Classifying Speech, Music, and Environmental Sounds Andrew Chang et.al. 2505.23509 link
2025-05-29 MCFNet: A Multimodal Collaborative Fusion Network for Fine-Grained Semantic Classification Yang Qiao et.al. 2505.23365 null
2025-05-29 DSAGL: Dual-Stream Attention-Guided Learning for Weakly Supervised Whole Slide Image Classification Daoxi Cao et.al. 2505.23341 null
2025-05-29 Deep Modeling and Optimization of Medical Image Classification Yihang Wu et.al. 2505.23040 link
2025-05-28 Leveraging Diffusion Models for Synthetic Data Augmentation in Protein Subcellular Localization Classification Sylvey Lin et.al. 2505.22926 null
2025-05-28 Frequency-Adaptive Discrete Cosine-ViT-ResNet Architecture for Sparse-Data Vision Ziyue Kang et.al. 2505.22701 null
2025-05-28 S2AFormer: Strip Self-Attention for Efficient Vision Transformer Guoan Xu et.al. 2505.22195 null
2025-05-28 Efficient Ensemble for Fine-tuning Language Models on Multiple Datasets Dongyue Li et.al. 2505.21930 null
2025-05-28 Test-Time Adaptation of Vision-Language Models for Open-Vocabulary Semantic Segmentation Mehrdad Noori et.al. 2505.21844 null
2025-05-27 MedBridge: Bridging Foundation Vision-Language Models to Medical Image Diagnosis Yitong Li et.al. 2505.21698 null
2025-05-27 Leveraging large language models and traditional machine learning ensembles for ADHD detection from narrative transcripts Yuxin Zhu et.al. 2505.21324 null
2025-05-27 Making Every Event Count: Balancing Data Efficiency and Accuracy in Event Camera Subsampling Hesam Araghi et.al. 2505.21187 null
2025-05-27 Information-Theoretic Complementary Prompts for Improved Continual Text Classification Duzhen Zhang et.al. 2505.20933 null
2025-05-27 Evidential Deep Active Learning for Semi-Supervised Classification Shenkai Zhao et.al. 2505.20691 null
2025-05-26 UORA: Uniform Orthogonal Reinitialization Adaptation in Parameter-Efficient Fine-Tuning of Large Models Xueyan Zhang et.al. 2505.20154 null
2025-05-26 Improvement Strategies for Few-Shot Learning in OCT Image Classification of Rare Retinal Diseases Cheng-Yu Tai et.al. 2505.20149 null
2025-05-26 Differential Privacy Analysis of Decentralized Gossip Averaging under Varying Threat Models Antti Koskela et.al. 2505.19969 null
2025-05-26 Task-Oriented Low-Label Semantic Communication With Self-Supervised Learning Run Gu et.al. 2505.19940 null
2025-05-26 Advancements in Medical Image Classification through Fine-Tuning Natural Domain Foundation Models Mobina Mansoori et.al. 2505.19779 link
2025-05-26 Mosaic: Data-Free Knowledge Distillation via Mixture-of-Experts for Heterogeneous Distributed Environments Junming Liu et.al. 2505.19699 null
2025-05-26 Diagnosing and Mitigating Modality Interference in Multimodal Large Language Models Rui Cai et.al. 2505.19616 null
2025-05-26 Applications and Effect Evaluation of Generative Adversarial Networks in Semi-Supervised Learning Jiyu Hu et.al. 2505.19522 null
2025-05-26 DiSa: Directional Saliency-Aware Prompt Learning for Generalizable Vision-Language Models Niloufar Alipour Talemi et.al. 2505.19373 null
2025-05-25 Remote Sensing Image Classification with Decoupled Knowledge Distillation Yaping He et.al. 2505.19111 null
2025-05-24 MoMBS: Mixed-order minibatch sampling enhances model training from diverse-quality images Han Li et.al. 2505.18741 null
2025-05-23 SemSegBench & DetecBench: Benchmarking Reliability and Generalization Beyond Classification Shashank Agnihotri et.al. 2505.18015 null
2025-05-23 KITINet: Kinetics Theory Inspired Network Architectures with PDE Simulation Approaches Mingquan Feng et.al. 2505.17919 null
2025-05-23 Ownership Verification of DNN Models Using White-Box Adversarial Attacks with Specified Probability Manipulation Teruki Sano et.al. 2505.17579 null
2025-05-23 Scaling Up Biomedical Vision-Language Models: Fine-Tuning, Instruction Tuning, and Multi-Modal Learning Cheng Peng et.al. 2505.17436 null
2025-05-23 EVM-Fusion: An Explainable Vision Mamba Architecture with Neural Algorithmic Fusion Zichuan Yang et.al. 2505.17367 null
2025-05-22 Extending Dataset Pruning to Object Detection: A Variance-based Approach Ryota Yagi et.al. 2505.17245 null
2025-05-23 TULiP: Test-time Uncertainty Estimation via Linearization and Weight Perturbation Yuhui Zhang et.al. 2505.16923 null
2025-05-22 Incremental Sequence Classification with Temporal Consistency Lucas Maystre et.al. 2505.16548 null
2025-05-22 Fusion of Foundation and Vision Transformer Model Features for Dermatoscopic Image Classification Amirreza Mahbod et.al. 2505.16338 null
2025-05-22 Accelerating Targeted Hard-Label Adversarial Attacks in Low-Query Black-Box Settings Arjhun Swaminathan et.al. 2505.16313 link
2025-05-22 Swin Transformer for Robust CGI Images Detection: Intra- and Inter-Dataset Analysis across Multiple Color Spaces Preeti Mehta et.al. 2505.16253 null
2025-05-22 When VLMs Meet Image Classification: Test Sets Renovation via Missing Label Identification Zirui Pang et.al. 2505.16149 null
2025-05-21 Small Language Models in the Real World: Insights from Industrial Text Classification Lujun Li et.al. 2505.16078 null
2025-05-21 GradPCA: Leveraging NTK Alignment for Reliable Out-of-Distribution Detection Mariia Seleznova et.al. 2505.16017 null
2025-05-21 Domain Adaptive Skin Lesion Classification via Conformal Ensemble of Vision Transformers Mehran Zoravar et.al. 2505.15997 null
2025-05-21 Large Language Models as Computable Approximations to Solomonoff Induction Jun Wan et.al. 2505.15784 null
2025-05-21 FragFake: A Dataset for Fine-Grained Detection of Edited Images with Vision Language Models Zhen Sun et.al. 2505.15644 null
2025-05-21 SNAP: A Benchmark for Testing the Effects of Capture Conditions on Fundamental Vision Tasks Iuliia Kotseruba et.al. 2505.15628 link
2025-05-21 Aligning Explanations with Human Communication Jacopo Teneggi et.al. 2505.15626 null
2025-05-21 Beyond Linearity: Squeeze-and-Recalibrate Blocks for Few-Shot Whole Slide Image Classification Conghao Xiong et.al. 2505.15504 null
2025-05-21 Adaptive Temperature Scaling with Conformal Prediction Nikita Kotelevskii et.al. 2505.15437 null
2025-05-21 Parameter-Efficient Fine-Tuning of Multispectral Foundation Models for Hyperspectral Image Classification Bernardin Ligan et.al. 2505.15334 null
2025-05-21 Multicrossmodal Automated Agent for Integrating Diverse Materials Science Data Adib Bazgir et.al. 2505.15132 null
2025-05-20 Reliable Decision Support with LLMs: A Framework for Evaluating Consistency in Binary Text Classification Applications Fadel M. Megahed et.al. 2505.14918 null
2025-05-20 Solving MNIST with a globally trained Mixture of Quantum Experts Paolo Alessandro Xavier Tognini et.al. 2505.14789 null
2025-05-20 Guarded Query Routing for Large Language Models Richard Šléher et.al. 2505.14524 null
2025-05-20 PRL: Prompts from Reinforcement Learning Paweł Batorski et.al. 2505.14412 null
2025-05-20 Domain Adaptation for Multi-label Image Classification: a Discriminator-free Approach Inder Pal Singh et.al. 2505.14333 link
2025-05-20 HausaNLP: Current Status, Challenges and Future Directions for Hausa Natural Language Processing Shamsuddeen Hassan Muhammad et.al. 2505.14311 null
2025-05-20 Intra-class Patch Swap for Self-Distillation Hongjun Choi et.al. 2505.14124 link
2025-05-20 Scaling Vision Mamba Across Resolutions via Fractal Traversal Bo Li et.al. 2505.14062 null
2025-05-20 Learning Concept-Driven Logical Rules for Interpretable and Generalizable Medical Image Classification Yibo Gao et.al. 2505.14049 null
2025-05-20 A Challenge to Build Neuro-Symbolic Video Agents Sahil Shah et.al. 2505.13851 null
2025-05-19 Synthetic-Powered Predictive Inference Meshi Bashari et.al. 2505.13432 null
2025-05-20 Unlabeled Data or Pre-trained Model: Rethinking Semi-Supervised Learning and Pretrain-Finetuning Song-Lin Li et.al. 2505.13317 null
2025-05-19 A Physics-Inspired Optimizer: Velocity Regularized Adam Pranav Vaidhyanathan et.al. 2505.13196 null
2025-05-19 Emergence of Fixational and Saccadic Movements in a Multi-Level Recurrent Attention Model for Vision Pengcheng Pan et.al. 2505.13191 null
2025-05-19 Learning to Adapt to Position Bias in Vision Transformer Classifiers Robert-Jan Bruintjes et.al. 2505.13137 link
2025-05-19 When majority rules, minority loses: bias amplification of gradient descent François Bachoc et.al. 2505.13122 null
2025-05-19 Expert-Like Reparameterization of Heterogeneous Pyramid Receptive Fields in Efficient CNNs for Fair Medical Image Classification Xiao Wu et.al. 2505.13039 null
2025-05-19 EPIC: Explanation of Pretrained Image Classification Networks via Prototype Piotr Borycki et.al. 2505.12897 link
2025-05-19 Enhancing Transformers Through Conditioned Embedded Tokens Hemanth Saratchandran et.al. 2505.12789 null
2025-05-19 An approach based on class activation maps for investigating the effects of data augmentation on neural networks for image classification Lucas M. Dorneles et.al. 2505.12581 null
2025-05-16 Energy efficiency analysis of Spiking Neural Networks for space applications Paolo Lunghi et.al. 2505.11418 null
2025-05-16 Harnessing Photon Indistinguishability in Quantum Extreme Learning Machines Malo Joly et.al. 2505.11238 null
2025-05-16 CheX-DS: Improving Chest X-ray Image Classification with Ensemble Learning Based on DenseNet and Swin Transformer Xinran Li et.al. 2505.11168 null
2025-05-16 Privacy-Aware Lifelong Learning Ozan Özdenizci et.al. 2505.10941 null
2025-05-16 MCU: Improving Machine Unlearning through Mode Connectivity Yingdan Shi et.al. 2505.10859 null
2025-05-15 CLIP Embeddings for AI-Generated Image Detection: A Few-Shot Study with Lightweight Classifier Ziyang Ou et.al. 2505.10664 null
2025-05-15 Research of the Variational Shadow Quantum Circuit Based on the Whale Optimization Algorithm in Image Classification Shuang Wu et.al. 2505.09994 null
2025-05-14 Quantum-Enhanced Parameter-Efficient Learning for Typhoon Trajectory Forecasting Chen-Yu Liu et.al. 2505.09395 null
2025-05-14 Marigold: Affordable Adaptation of Diffusion-Based Image Generators for Image Analysis Bingxin Ke et.al. 2505.09358 link
2025-05-17 PrePrompt: Predictive prompting for class incremental learning Libo Huang et.al. 2505.08586 link
2025-05-13 Convolutional Spiking Neural Network for Image Classification Mikhail Kiselev et.al. 2505.08514 null
2025-05-13 CNN and ViT Efficiency Study on Tiny ImageNet and DermaMNIST Datasets Aidar Amangeldi et.al. 2505.08259 null
2025-05-13 Empowering Vision Transformers with Multi-Scale Causal Intervention for Long-Tailed Image Classification Xiaoshuo Yan et.al. 2505.08173 null
2025-05-13 MoKD: Multi-Task Optimization for Knowledge Distillation Zeeshan Hayder et.al. 2505.08170 null
2025-05-12 Hierarchical Sparse Attention Framework for Computationally Efficient Classification of Biological Cells Elad Yoshai et.al. 2505.07661 null
2025-05-12 Synthetic Similarity Search in Automotive Production Christoph Huber et.al. 2505.07256 null
2025-05-12 Discovering Fine-Grained Visual-Concept Relations by Disentangled Optimal Transport Concept Bottleneck Models Yan Xie et.al. 2505.07209 null
2025-05-12 KDH-MLTC: Knowledge Distillation for Healthcare Multi-Label Text Classification Hajar Sakai et.al. 2505.07162 null
2025-05-11 A Vision-Language Foundation Model for Leaf Disease Identification Khang Nguyen Quoc et.al. 2505.07019 null
2025-05-11 Image Classification Using a Diffusion Model as a Pre-Training Model Kosuke Ukita et.al. 2505.06890 null
2025-05-11 NeuRN: Neuro-inspired Domain Generalization for Image Classification Hamd Jalil et.al. 2505.06881 null
2025-05-11 Active Learning for Multi-class Image Classification Thien Nhan Vo et.al. 2505.06825 null
2025-05-10 FNBench: Benchmarking Robust Federated Learning against Noisy Labels Xuefeng Jiang et.al. 2505.06684 link
2025-05-10 The Efficiency of Pre-training with Objective Masking in Pseudo Labeling for Semi-Supervised Text Classification Arezoo Hatefi et.al. 2505.06624 null
2025-05-09 Adapting a Segmentation Foundation Model for Medical Image Classification Pengfei Gu et.al. 2505.06217 null
2025-05-09 Towards Robust Few-Shot Text Classification Using Transformer Architectures and Dual Loss Strategies Xu Han et.al. 2505.06145 null
2025-05-09 Short-circuiting Shortcuts: Mechanistic Investigation of Shortcuts in Text Classification Leon Eshuijs et.al. 2505.06032 link
2025-05-09 Efficient Quantum Convolutional Neural Networks for Image Classification: Overcoming Hardware Constraints Peter Röseler et.al. 2505.05957 null
2025-05-09 Achieving 3D Attention via Triplet Squeeze and Excitation Block Maan Alhazmi et.al. 2505.05943 null
2025-05-09 Improving Generalizability of Kolmogorov-Arnold Networks via Error-Correcting Output Codes Youngjoon Lee et.al. 2505.05798 null
2025-05-09 Variational Bayesian Logistic Tensor Regression with Application to Image Recognition Yunzhi Jin et.al. 2505.05730 null
2025-05-08 V-EfficientNets: Vector-Valued Efficiently Scaled Convolutional Neural Network Models Guilherme Vieira Neto et.al. 2505.05659 link
2025-05-08 KG-HTC: Integrating Knowledge Graphs into LLMs for Effective Zero-shot Hierarchical Text Classification Qianbo Zang et.al. 2505.05583 link
2025-05-08 Hide & Seek: Transformer Symmetries Obscure Sharpness & Riemannian Geometry Finds It Marvin F. da Silva et.al. 2505.05409 null
2025-05-08 Quantum Surrogate-Driven Image Classifier: A Gradient-Free Approach to Avoid Barren Plateaus Yichen Xie et.al. 2505.05249 null
2025-05-08 Biomed-DPT: Dual Modality Prompt Tuning for Biomedical Vision-Language Models Wei Peng et.al. 2505.05189 null
2025-05-08 CacheFL: Efficient Federated Cache Model Fine-Tuning for Vision-Language Models Mengjun Yi et.al. 2505.05130 null
2025-05-08 Direct Image Classification from Fourier Ptychographic Microscopy Measurements without Reconstruction Navya Sonal Agarwal et.al. 2505.05054 null
2025-05-07 ORXE: Orchestrating Experts for Dynamically Configurable Efficiency Qingyuan Wang et.al. 2505.04850 null
2025-05-07 Label-efficient Single Photon Images Classification via Active Learning Zili Zhang et.al. 2505.04376 null
2025-05-07 FRAIN to Train: A Fast-and-Reliable Solution for Decentralized Federated Learning Sanghyeon Park et.al. 2505.04223 null
2025-05-06 Read My Ears! Horse Ear Movement Detection for Equine Affective State Assessment João Alves et.al. 2505.03554 null
2025-05-06 Noisy HQNNs: A Comprehensive Analysis of Noise Robustness in Hybrid Quantum Neural Networks Tasnim Ahmed et.al. 2505.03378 null
2025-05-06 A Vision-Language Model for Focal Liver Lesion Classification Song Jian et.al. 2505.03350 null
2025-05-06 Comparative Analysis of Lightweight Deep Learning Models for Memory-Constrained Devices Tasnim Shahriar et.al. 2505.03303 null
2025-05-06 Survey of Abstract Meaning Representation: Then, Now, Future Behrooz Mansouri et.al. 2505.03229 null
2025-05-06 seq-JEPA: Autoregressive Predictive Learning of Invariant-Equivariant World Models Hafez Ghaemi et.al. 2505.03176 null
2025-05-06 Enhancing Glass Defect Detection with Diffusion Models: Addressing Imbalanced Datasets in Manufacturing Quality Control Sajjad Rezvani Boroujeni et.al. 2505.03134 null
2025-05-05 Bayesian Robust Aggregation for Federated Learning Aleksandr Karakulev et.al. 2505.02490 null
2025-05-06 Adversarial Cooperative Rationalization: The Risk of Spurious Correlations in Even Clean Datasets Wei Liu et.al. 2505.02118 null
2025-05-03 Backdoor Attacks Against Patch-based Mixture of Experts Cedric Chan et.al. 2505.01811 null
2025-05-03 Low-Complexity Acoustic Scene Classification with Device Information in the DCASE 2025 Challenge Florian Schmid et.al. 2505.01747 null
2025-05-03 CLOG-CD: Curriculum Learning based on Oscillating Granularity of Class Decomposed Medical Image Classification Asmaa Abbas et.al. 2505.01741 null
2025-05-02 TActiLE: Tiny Active LEarning for wearable devices Massimo Pavan et.al. 2505.01160 null
2025-04-30 Towards Improved Cervical Cancer Screening: Vision Transformer-Based Classification and Interpretability Khoa Tuan Nguyen et.al. 2504.21340 null
2025-04-28 AGATE: Stealthy Black-box Watermarking for Multimodal Model Copyright Protection Jianbo Gao et.al. 2504.21044 null
2025-04-29 Photonic Quantum Convolutional Neural Networks with Adaptive State Injection Léo Monbroussou et.al. 2504.20989 null
2025-04-30 DS_FusionNet: Dynamic Dual-Stream Fusion with Bidirectional Knowledge Distillation for Plant Disease Recognition Yanghui Song et.al. 2504.20948 link
2025-04-29 MambaMoE: Mixture-of-Spectral-Spatial-Experts State Space Model for Hyperspectral Image Classification Yichu Xu et.al. 2504.20509 null
2025-04-28 DeepAndes: A Self-Supervised Vision Foundation Model for Multi-Spectral Remote Sensing Imagery of the Andes Junlin Guo et.al. 2504.20303 null
2025-04-28 GenCLS++: Pushing the Boundaries of Generative Classification in LLMs Through Comprehensive SFT and RL Studies Across Diverse Datasets Mingqian He et.al. 2504.19898 null
2025-04-28 Reinforcement Learning-Based Heterogeneous Multi-Task Optimization in Semantic Broadcast Communications Zhilin Lu et.al. 2504.19806 null
2025-04-28 Explaining Vision GNNs: A Semantic and Visual Analysis of Graph-based Image Classification Nikolaos Chaidos et.al. 2504.19682 null
2025-04-28 Hardware/Software Co-Design of RISC-V Extensions for Accelerating Sparse DNNs on FPGAs Muhammad Sabih et.al. 2504.19659 null
2025-04-28 Neural network task specialization via domain constraining Roman Malashin et.al. 2504.19592 null
2025-04-28 GMAR: Gradient-Driven Multi-Head Attention Rollout for Vision Transformer Interpretability Sehyeong Jo et.al. 2504.19414 null
2025-04-27 Dual-Branch Residual Network for Cross-Domain Few-Shot Hyperspectral Image Classification with Refined Prototype Anyong Qin et.al. 2504.19074 null
2025-04-26 Advancing Scientific Text Classification: Fine-Tuned Models with Dataset Expansion and Hard-Voting Zhyar Rzgar K Rostam et.al. 2504.19021 null
2025-04-26 A Simple Ensemble Strategy for LLM Inference: Towards More Stable Text Classification Junichiro Niimi et.al. 2504.18884 link
2025-04-26 IoT Botnet Detection: Application of Vision Transformer to Classification of Network Flow Traffic Hassan Wasswa et.al. 2504.18781 null
2025-04-25 Examining the Impact of Optical Aberrations to Image Classification and Object Detection Models Patrick Müller et.al. 2504.18510 null
2025-04-25 Pseudo-Asynchronous Local SGD: Robust and Efficient Data-Parallel Training Hiroki Naganuma et.al. 2504.18454 null
2025-04-25 Passive All-Optical Nonlinear Neuron Activation via PPLN Nanophotonic Waveguides Wujie Fu et.al. 2504.18145 null
2025-04-25 DMS-Net:Dual-Modal Multi-Scale Siamese Network for Binocular Fundus Image Classification Guohao Huo et.al. 2504.18046 null
2025-04-24 Disaggregated Deep Learning via In-Physics Computing at Radio Frequency Zhihui Gao et.al. 2504.17752 null
2025-04-24 Aerial Image Classification in Scarce and Unconstrained Environments via Conformal Prediction Farhad Pourkamali-Anaraki et.al. 2504.17655 null
2025-04-24 Enhanced Sample Selection with Confidence Tracking: Identifying Correctly Labeled yet Hard-to-Learn Samples in Noisy Data Weiran Pan et.al. 2504.17474 null
2025-04-24 Dual-Individual Genetic Algorithm: A Dual-Individual Approach for Efficient Training of Multi-Layer Neural Networks Tran Thuy Nga Truong et.al. 2504.17346 null
2025-04-24 Evaluating and Mitigating Bias in AI-Based Medical Text Generation Xiuying Chen et.al. 2504.17279 null
2025-04-24 Group Downsampling with Equivariant Anti-aliasing Md Ashiqur Rahman et.al. 2504.17258 link
2025-04-24 Multi-Modal Traffic Analysis: Integrating Time-Series Forecasting, Accident Prediction, and Image Classification Nivedita M et.al. 2504.17232 null
2025-04-23 A Diff-Attention Aware State Space Fusion Model for Remote Sensing Classification Wenping Ma et.al. 2504.16665 null
2025-04-23 Streetscape Analysis with Generative AI (SAGAI): Vision-Language Assessment and Mapping of Urban Scenes Joan Perez et.al. 2504.16538 null
2025-04-24 An Effective Gram Matrix Characterizes Generalization in Deep Networks Rubing Yang et.al. 2504.16450 null
2025-04-23 FrogDogNet: Fourier frequency Retained visual prompt Output Guidance for Domain Generalization of CLIP in Remote Sensing Hariseetharam Gunduboina et.al. 2504.16433 null
2025-04-22 CLIP-IT: CLIP-based Pairing for Histology Images Classification Banafsheh Karimian et.al. 2504.16181 null
2025-04-22 Automated Bug Report Prioritization in Large Open-Source Projects Riley Pierson et.al. 2504.15912 null
2025-04-22 Generative AI for Research Data Processing: Lessons Learnt From Three Use Cases Modhurita Mitra et.al. 2504.15829 null
2025-04-22 DualOptim: Enhancing Efficacy and Stability in Machine Unlearning with Dual Optimizers Xuyang Zhong et.al. 2504.15827 null
2025-04-22 HS-Mamba: Full-Field Interaction Multi-Groups Mamba for Hyperspectral Image Classification Hongxing Peng et.al. 2504.15612 null
2025-04-22 LLM-based Semantic Augmentation for Harmful Content Detection Elyas Meguellati et.al. 2504.15548 null
2025-04-21 Feeding LLM Annotations to BERT Classifiers at Your Own Risk Yucheng Lu et.al. 2504.15432 null
2025-04-21 Dynamic 3D KAN Convolution with Adaptive Grid Optimization for Hyperspectral Image Classification Guandong Li et.al. 2504.15155 null
2025-04-21 Application of Sensitivity Analysis Methods for Studying Neural Network Models Jiaxuan Miao et.al. 2504.15100 null
2025-04-21 Trainable Quantum Neural Network for Multiclass Image Classification with the Power of Pre-trained Tree Tensor Networks Keisuke Murota et.al. 2504.14995 null
2025-04-21 ECViT: Efficient Convolutional Vision Transformer with Local-Attention and Multi-scale Stages Zhoujie Qian et.al. 2504.14825 null
2025-04-21 What Lurks Within? Concept Auditing for Shared Diffusion Models at Scale Xiaoyong Yuan et.al. 2504.14815 null
2025-04-21 A Basic Evaluation of Neural Networks Trained with the Error Diffusion Learning Algorithm Kazuhisa Fujita et.al. 2504.14814 null
2025-04-19 Learning from Stochastic Teacher Representations Using Student-Guided Knowledge Distillation Muhammad Haseeb Aslam et.al. 2504.14307 null
2025-04-19 Exploring Modality Guidance to Enhance VFM-based Feature Fusion for UDA in 3D Semantic Segmentation Johannes Spoecklberger et.al. 2504.14231 null
2025-04-19 Enhancing Multimodal In-Context Learning for Image Classification through Coreset Optimization Huiyi Chen et.al. 2504.14200 null
2025-04-19 ThyroidEffi 1.0: A Cost-Effective System for High-Performance Multi-Class Thyroid Carcinoma Classification Hai Pham-Ngoc et.al. 2504.14139 null
2025-04-18 Feature Alignment and Representation Transfer in Knowledge Distillation for Large Language Models Junjie Yang et.al. 2504.13825 null
2025-04-18 CheXWorld: Exploring Image World Modeling for Radiograph Representation Learning Yang Yue et.al. 2504.13820 link
2025-04-18 Towards Accurate and Interpretable Neuroblastoma Diagnosis via Contrastive Multi-scale Pathological Image Analysis Zhu Zhu et.al. 2504.13754 null
2025-04-18 Human-aligned Deep Learning: Explainability, Causality, and Biological Inspiration Gianluca Carloni et.al. 2504.13717 null
2025-04-18 Word Embedding Techniques for Classification of Star Ratings Hesham Abdelmotaleb et.al. 2504.13653 null
2025-04-18 Cross-Hierarchical Bidirectional Consistency Learning for Fine-Grained Visual Classification Pengxiang Gao et.al. 2504.13608 null
2025-04-18 MAAM: A Lightweight Multi-Agent Aggregation Module for Efficient Image Classification Based on the MindSpore Framework Zhenkai Qin et.al. 2504.13574 null
2025-04-18 Bayesian continual learning and forgetting in neural networks Djohan Bonnet et.al. 2504.13569 null
2025-04-17 Dynamic Memory-enhanced Transformer for Hyperspectral Image Classification Muhammad Ahmad et.al. 2504.13242 null
2025-04-17 Perception Encoder: The best visual embeddings are not at the output of the network Daniel Bolya et.al. 2504.13181 null
2025-04-17 Expert Kernel Generation Network Driven by Contextual Mapping for Hyperspectral Image Classification Guandong Li et.al. 2504.13045 null
2025-04-17 Quantum Computing Supported Adversarial Attack-Resilient Autonomous Vehicle Perception Module for Traffic Sign Classification Reek Majumder et.al. 2504.12644 null
2025-04-16 GLUSE: Enhanced Channel-Wise Adaptive Gated Linear Units SE for Onboard Satellite Earth Observation Image Classification Thanh-Dung Le et.al. 2504.12484 null
2025-04-16 FLIP Reasoning Challenge Andreas Plesner et.al. 2504.12256 null
2025-04-16 Weakly Semi-supervised Whole Slide Image Classification by Two-level Cross Consistency Supervision Linhao Qu et.al. 2504.12132 null
2025-04-16 Exploring Video-Based Driver Activity Recognition under Noisy Labels Linjuan Fan et.al. 2504.11966 link
2025-04-17 Selective Attention Federated Learning: Improving Privacy and Efficiency for Clinical Text Classification Yue Li et.al. 2504.11793 null
2025-04-15 The Pontryagin Maximum Principle for Training Convolutional Neural Networks Sebastian Hofmann et.al. 2504.11647 null
2025-04-15 Deep Learning Approaches for Medical Imaging Under Varying Degrees of Label Availability: A Comprehensive Survey Siteng Ma et.al. 2504.11588 null
2025-04-15 Diversity-Driven Learning: Tackling Spurious Correlations and Data Heterogeneity in Federated Models Gergely D. Németh et.al. 2504.11216 null
2025-04-15 Embedding Radiomics into Vision Transformers for Multimodal Medical Image Classification Zhenyu Yang et.al. 2504.10916 null
2025-04-15 Progressive Rock Music Classification Arpan Nagar et.al. 2504.10821 null
2025-04-15 3D Wavelet Convolutions with Extended Receptive Fields for Hyperspectral Image Classification Guandong Li et.al. 2504.10795 null
2025-04-14 Quantum Image Classification: Experiments on Utility-Scale Quantum Computers Hrant Gharibyan et.al. 2504.10595 null
2025-04-14 LEMUR Neural Network Dataset: Towards Seamless AutoML Arash Torabi Goodarzi et.al. 2504.10552 null
2025-04-13 An Efficient Quantum Classifier Based on Hamiltonian Representations Federico Tiblias et.al. 2504.10542 null
2025-04-14 Correlative and Discriminative Label Grouping for Multi-Label Visual Prompt Tuning LeiLei Ma et.al. 2504.09990 null
2025-04-14 GFT: Gradient Focal Transformer Boris Kriuk et.al. 2504.09852 null
2025-04-13 PCM-SAR: Physics-Driven Contrastive Mutual Learning for SAR Classification Pengfei Wang et.al. 2504.09502 null
2025-04-13 InfoBound: A Provable Information-Bounds Inspired Framework for Both OoD Generalization and OoD Detection Lin Zhu et.al. 2504.09448 null
2025-04-13 Sparse Deformable Mamba for Hyperspectral Image Classification Lincoln Linlin Xu et.al. 2504.09446 null
2025-04-12 Cycle Training with Semi-Supervised Domain Adaptation: Bridging Accuracy and Efficiency for Real-Time Mobile Scene Detection Huu-Phong Phan-Nguyen et.al. 2504.09297 null
2025-04-12 Sparse Hybrid Linear-Morphological Networks Konstantinos Fotopoulos et.al. 2504.09289 null
2025-04-12 Mixture of Group Experts for Learning Invariant Representations Lei Kang et.al. 2504.09265 null
2025-04-12 Langformers: Unified NLP Pipelines for Language Models Rabindra Lamsal et.al. 2504.09170 null
2025-04-12 Evolved Hierarchical Masking for Self-Supervised Learning Zhanzhou Feng et.al. 2504.09155 null
2025-04-11 Hypergraph Vision Transformers: Images are More than Nodes, More than Edges Joshua Fixelle et.al. 2504.08710 null
2025-04-11 Integrated ensemble of BERT- and features-based models for authorship attribution in Japanese literary works Taisei Kanda et.al. 2504.08527 null
2025-04-11 An Early Experience with Confidential Computing Architecture for On-Device Model Protection Sina Abdollahi et.al. 2504.08508 null
2025-04-11 The inherent convolution property of quantum neural networks Guangkai Qu et.al. 2504.08487 null
2025-04-11 A Hybrid Fully Convolutional CNN-Transformer Model for Inherently Interpretable Medical Image Classification Kerol Djoumessi et.al. 2504.08481 null
2025-04-11 FocalLens: Instruction Tuning Enables Zero-Shot Conditional Image Representations Cheng-Yu Hsieh et.al. 2504.08368 null
2025-04-11 Comparative Analysis of Different Methods for Classifying Polychromatic Sketches Fahd Baba et.al. 2504.08186 null
2025-04-11 Pychop: Emulating Low-Precision Arithmetic in Numerical Methods and Neural Networks Erin Carson et.al. 2504.07835 null
2025-04-10 Traversal Learning Coordination For Lossless And Efficient Distributed Learning Erdenebileg Batbaatar et.al. 2504.07471 null
2025-04-09 Identifying regions of interest in whole slide images of renal cell carcinoma Mohammed Lamine Benomar et.al. 2504.07313 null
2025-04-09 A new training approach for text classification in Mental Health: LatentGLoss Korhan Sevinç et.al. 2504.07245 null
2025-04-09 Deep Learning for Cardiovascular Risk Assessment: Proxy Features from Carotid Sonography as Predictors of Arterial Damage Christoph Balada et.al. 2504.06680 null
2025-04-08 Memory-Modular Classification: Learning to Generalize with Memory Replacement Dahyun Kang et.al. 2504.06021 null
2025-04-08 Federated Unlearning Made Practical: Seamless Integration via Negated Pseudo-Gradients Alessio Mora et.al. 2504.05822 null
2025-04-08 DefMamba: Deformable Visual State Space Model Leiye Liu et.al. 2504.05794 null
2025-04-08 Layer-Aware Embedding Fusion for LLMs in Text Classifications Jiho Gwak et.al. 2504.05764 null
2025-04-07 REEF: Relevance-Aware and Efficient LLM Adapter for Video Understanding Sakib Reza et.al. 2504.05491 null
2025-04-07 Secure Diagnostics: Adversarial Robustness Meets Clinical Interpretability Mohammad Hossein Najafi et.al. 2504.05483 null
2025-04-07 Explaining Low Perception Model Competency with High-Competency Counterfactuals Sara Pohland et.al. 2504.05254 null
2025-04-07 Federated Learning for Medical Image Classification: A Comprehensive Benchmark Zhekai Zhou et.al. 2504.05238 null
2025-04-07 Batch Aggregation: An Approach to Enhance Text Classification with Correlated Augmented Data Charco Hui et.al. 2504.05020 null
2025-04-07 RS-RAG: Bridging Remote Sensing Imagery and Comprehensive Knowledge with a Multi-Modal Dataset and Retrieval-Augmented Generation Model Congcong Wen et.al. 2504.04988 null
2025-04-06 Your Image Generator Is Your New Private Dataset Nicolo Resmini et.al. 2504.04582 null
2025-04-06 Attributed Synthetic Data Generation for Zero-shot Domain-specific Image Classification Shijian Wang et.al. 2504.04510 null
2025-04-06 Spatial-Geometry Enhanced 3D Dynamic Snake Convolutional Neural Network for Hyperspectral Image Classification Guandong Li et.al. 2504.04463 null
2025-04-05 A Comparative Study of Explainable AI Methods: Model-Agnostic vs. Model-Specific Approaches Keerthi Devireddy et.al. 2504.04276 null
2025-04-05 GlotEval: A Test Suite for Massively Multilingual Evaluation of Large Language Models Hengyu Luo et.al. 2504.04155 null
2025-04-05 Scaling Federated Learning Solutions with Kubernetes for Synthesizing Histopathology Images Andrei-Alexandru Preda et.al. 2504.04130 null
2025-04-04 Adaptive Classification of Interval-Valued Time Series Wan Tian et.al. 2504.03318 null
2025-04-04 Beyond the Next Token: Towards Prompt-Robust Zero-Shot Classification via Efficient Multi-Token Prediction Junlang Qian et.al. 2504.03159 null
2025-04-03 HQViT: Hybrid Quantum Vision Transformer for Image Classification Hui Zhang et.al. 2504.02730 null
2025-04-03 LLM-Guided Evolution: An Autonomous Model Optimization for Object Detection YiMing Yu et.al. 2504.02280 null
2025-04-02 Neural Style Transfer for Synthesising a Dataset of Ancient Egyptian Hieroglyphs Lewis Matheson Creed et.al. 2504.02163 null
2025-04-02 A thorough benchmark of automatic text classification: From traditional approaches to large language models Washington Cunha et.al. 2504.01930 link
2025-04-02 A Randomized Zeroth-Order Hierarchical Framework for Heterogeneous Federated Learning Yuyang Qiu et.al. 2504.01839 null
2025-04-02 A Novel Approach To Implementing Knowledge Distillation In Tsetlin Machines Calvin Kinateder et.al. 2504.01798 null
2025-04-02 Token Pruning in Audio Transformers: Optimizing Performance and Decoding Patch Importance Taehan Lee et.al. 2504.01690 link
2025-04-02 All Patches Matter, More Patches Better: Enhance AI-Generated Image Detection via Panoptic Patch Learning Zheng Yang et.al. 2504.01396 null
2025-04-01 TenAd: A Tensor-based Low-rank Black Box Adversarial Attack for Video Classification Kimia haghjooei et.al. 2504.01228 null
2025-04-01 PolygoNet: Leveraging Simplified Polygonal Representation for Effective Image Classification Salim Khazem et.al. 2504.01214 link
2025-04-01 Enabling Efficient Processing of Spiking Neural Networks with On-Chip Learning on Commodity Neuromorphic Processors for Edge AI Systems Rachmad Vidya Wicaksana Putra et.al. 2504.00957 null
2025-04-01 Impact of Data Duplication on Deep Neural Network-Based Image Classifiers: Robust vs. Standard Models Alireza Aghabagherloo et.al. 2504.00638 null
2025-04-01 Geometric Median Matching for Robust k-Subset Selection from Noisy Data Anish Acharya et.al. 2504.00564 null
2025-03-31 NoProp: Training Neural Networks without Back-propagation or Forward-propagation Qinyu Li et.al. 2503.24322 null
2025-03-31 CIBR: Cross-modal Information Bottleneck Regularization for Robust CLIP Generalization Yingrui Ji et.al. 2503.24182 null
2025-03-31 PixelCAM: Pixel Class Activation Mapping for Histology Image Classification and ROI Localization Alexis Guichemerre et.al. 2503.24135 link
2025-03-31 Crossmodal Knowledge Distillation with WordNet-Relaxed Text Embeddings for Robust Image Classification Chenqi Guo et.al. 2503.24017 null
2025-03-31 FlexiMo: A Flexible Remote Sensing Foundation Model Xuyang Li et.al. 2503.23844 null
2025-03-31 Expanding-and-Shrinking Binary Neural Networks Xulong Shi et.al. 2503.23709 link
2025-03-31 WHERE and WHICH: Iterative Debate for Biomedical Synthetic Data Augmentation Zhengyi Zhao et.al. 2503.23673 null
2025-03-30 Efficient Dynamic Attention 3D Convolution for Hyperspectral Image Classification Guandong Li et.al. 2503.23472 null
2025-03-30 KernelDNA: Dynamic Kernel Sharing via Decoupled Naive Adapters Haiduo Huang et.al. 2503.23379 link
2025-03-29 Optimizing Distributed Training Approaches for Scaling Neural Networks Vishnu Vardhan Baligodugula et.al. 2503.23186 null
2025-03-28 Data-Free Universal Attack by Exploiting the Intrinsic Vulnerability of Deep Models YangTian Yan et.al. 2503.22205 link
2025-03-28 Route-and-Aggregate Decentralized Federated Learning Under Communication Errors Weicai Li et.al. 2503.22186 null
2025-03-27 On Large Multimodal Models as Open-World Image Classifiers Alessandro Conti et.al. 2503.21851 link
2025-03-27 Bayesian Pseudo Posterior Mechanism for Differentially Private Machine Learning Robert Chew et.al. 2503.21528 null
2025-03-27 Retinal Fundus Multi-Disease Image Classification using Hybrid CNN-Transformer-Ensemble Architectures Deependra Singh et.al. 2503.21465 link
2025-03-27 Fine-Tuning LLMs on Small Medical Datasets: Text Classification and Normalization Effectiveness on Cardiology reports and Discharge records Noah Losch et.al. 2503.21349 null
2025-03-27 Improving $(α, f)$ -Byzantine Resilience in Federated Learning via layerwise aggregation and cosine distance Mario García-Márquez et.al. 2503.21244 link
2025-03-27 Neural Architecture Search by Learning a Hierarchical Search Space Mehraveh Javan Roshtkhari et.al. 2503.21061 null
2025-03-26 TS-Inverse: A Gradient Inversion Attack Tailored for Federated Time Series Forecasting Models Caspar Meijer et.al. 2503.20952 link
2025-03-26 VESTA: A Versatile SNN-Based Transformer Accelerator with Unified PEs for Multiple Computational Layers Ching-Yao Chen et.al. 2503.20246 null
2025-03-26 BeLightRec: A lightweight recommender system enhanced with BERT Manh Mai Van et.al. 2503.20206 null
2025-03-25 Vanishing Depth: A Depth Adapter with Positional Depth Encoding for Generalized Image Encoders Paul Koch et.al. 2503.19947 null
2025-03-25 Optimizing Breast Cancer Detection in Mammograms: A Comprehensive Study of Transfer Learning, Resolution Reduction, and Multi-View Classification Daniel G. P. Petrini et.al. 2503.19945 null
2025-03-25 Extensions of regret-minimization algorithm for optimal design Youguang Chen et.al. 2503.19874 null
2025-03-25 VectorFit : Adaptive Singular & Bias Vector Fine-Tuning of Pre-trained Foundation Models Suhas G Hegde et.al. 2503.19530 null
2025-03-25 LRSCLIP: A Vision-Language Foundation Model for Aligning Remote Sensing Image with Longer Text Weizhi Chen et.al. 2503.19311 null
2025-03-25 Face Spoofing Detection using Deep Learning Najeebullah et.al. 2503.19223 link
2025-03-24 Exploring the Integration of Key-Value Attention Into Pure and Hybrid Transformers for Semantic Segmentation DeShin Hwa et.al. 2503.18862 null
2025-03-24 Latent Space Class Dispersion: Effective Test Data Quality Assessment for DNNs Vivek Vekariya et.al. 2503.18799 null
2025-03-24 Unbiasing through Textual Descriptions: Mitigating Representation Bias in Video Benchmarks Nina Shvetsova et.al. 2503.18637 null
2025-03-24 Explaining Domain Shifts in Language: Concept erasing for Interpretable Image Classification Zequn Zeng et.al. 2503.18483 null
2025-03-24 Teaching LLMs for Step-Level Automatic Math Correction via Reinforcement Learning Junsong Li et.al. 2503.18432 null
2025-03-24 Sun-Shine: A Large Language Model for Tibetan Culture Cheng Huang et.al. 2503.18288 null
2025-03-23 Feature Learning beyond the Lazy-Rich Dichotomy: Insights from Representational Geometry Chi-Ning Chou et.al. 2503.18114 null
2025-03-23 What Time Tells Us? An Explorative Study of Time Awareness Learned from Static Images Dongheng Lin et.al. 2503.17899 null
2025-03-21 Spatiotemporal Learning with Context-aware Video Tubelets for Ultrasound Video Analysis Gary Y. Li et.al. 2503.17475 null
2025-03-21 Leveraging Text-to-Image Generation for Handling Spurious Correlation Aryan Yazdan Parast et.al. 2503.17226 null
2025-03-21 CoRLD: Contrastive Representation Learning Of Deformable Shapes In Images Tonmoy Hossain ana Miaomiao Zhang et.al. 2503.17162 null
2025-03-21 Beyond Accuracy: What Matters in Designing Well-Behaved Models? Robin Hesse et.al. 2503.17110 null
2025-03-21 Symbolic Audio Classification via Modal Decision Tree Learning Enrico Marzano et.al. 2503.17018 null
2025-03-21 EasyRobust: A Comprehensive and Easy-to-use Toolkit for Robust and Generalized Vision Xiaofeng Mao et.al. 2503.16975 null
2025-03-21 City2Scene: Improving Acoustic Scene Classification with City Features Yiqiang Cai et.al. 2503.16862 null
2025-03-20 MobilePlantViT: A Mobile-friendly Hybrid ViT for Generalized Plant Disease Image Classification Moshiur Rahman Tonmoy et.al. 2503.16628 null
2025-03-20 PSA-MIL: A Probabilistic Spatial Attention-Based Multiple Instance Learning for Whole Slide Image Classification Sharon Peled et.al. 2503.16284 link
2025-03-20 CLS-RL: Image Classification with Rule-Based Reinforcement Learning Ming Li et.al. 2503.16188 null
2025-03-20 Corrective In-Context Learning: Evaluating Self-Correction in Large Language Models Mario Sanz-Guerrero et.al. 2503.16022 link
2025-03-20 Beyond the Visible: Multispectral Vision-Language Learning for Earth Observation Clive Tinashe Marimo et.al. 2503.15969 null
2025-03-19 Graph-Weighted Contrastive Learning for Semi-Supervised Hyperspectral Image Classification Yuqing Zhang et.al. 2503.15731 null
2025-03-20 Dynamic Bi-Elman Attention Networks (DBEAN): Dual-Directional Context-Aware Representation Learning for Enhanced Text Classification ZhengLin Lai et.al. 2503.15469 link
2025-03-19 Test-Time Backdoor Detection for Object Detection Models Hangtao Zhang et.al. 2503.15293 null
2025-03-19 Efficient allocation of image recognition and LLM tasks on multi-GPU system Marcin Lawenda et.al. 2503.15252 null
2025-03-19 Comparing Llama3 and DeepSeekR1 on Biomedical Text Classification Tasks Yuting Guo et.al. 2503.15169 null
2025-03-19 ARC: Anchored Representation Clouds for High-Resolution INR Classification Joost Luijmes et.al. 2503.15156 null
2025-03-19 Ultrasound Image-to-Video Synthesis via Latent Dynamic Diffusion Models Tingxiu Chen et.al. 2503.14966 null
2025-03-19 Optimal Transport Adapter Tuning for Bridging Modality Gaps in Few-Shot Remote Sensing Scene Classification Zhong Ji et.al. 2503.14938 null
2025-03-18 RAT: Boosting Misclassification Detection Ability without Extra Data Ge Yan et.al. 2503.14783 null
2025-03-18 LipShiFT: A Certifiably Robust Shift-based Vision Transformer Rohan Menon et.al. 2503.14751 null
2025-03-18 Utilization of Neighbor Information for Image Classification with Different Levels of Supervision Gihan Jayatilaka et.al. 2503.14500 null
2025-03-17 Neural Edge Histogram Descriptors for Underwater Acoustic Target Recognition Atharva Agashe et.al. 2503.13763 null
2025-03-17 Micro Text Classification Based on Balanced Positive-Unlabeled Learning Lin-Han Jia et.al. 2503.13562 null
2025-03-17 Escaping Plato's Cave: Robust Conceptual Reasoning through Interpretable 3D Neural Object Volumes Nhi Pham et.al. 2503.13429 null
2025-03-17 Do Vision Models Develop Human-Like Progressive Difficulty Understanding? Zeyi Huang et.al. 2503.13058 null
2025-03-16 Domain Generalization for Improved Human Activity Recognition in Office Space Videos Using Adaptive Pre-processing Partho Ghosh et.al. 2503.12678 null
2025-03-16 Scaling Semantic Categories: Investigating the Impact on Vision Transformer Labeling Performance Anthony Lamelas et.al. 2503.12617 null
2025-03-16 Defense Against Model Stealing Based on Account-Aware Distribution Discrepancy Jian-Ping Mei et.al. 2503.12497 null
2025-03-16 GeoRSMLLM: A Multimodal Large Language Model for Vision-Language Tasks in Geoscience and Remote Sensing Zilun Zhang et.al. 2503.12490 null
2025-03-16 Shape Bias and Robustness Evaluation via Cue Decomposition for Image Classification and Segmentation Edgar Heinert et.al. 2503.12453 null
2025-03-16 MExD: An Expert-Infused Diffusion Model for Whole-Slide Image Classification Jianwei Zhao et.al. 2503.12401 null
2025-03-15 TLAC: Two-stage LMM Augmented CLIP for Zero-Shot Classification Ans Munir et.al. 2503.12206 null
2025-03-15 Goal-Oriented Source Coding using LDPC Codes for Compressed-Domain Image Classification Ahcen Aliouat et.al. 2503.11954 null
2025-03-14 Creating a Good Teacher for Knowledge Distillation in Acoustic Scene Classification Tobias Morocutti et.al. 2503.11363 null
2025-03-14 PARIC: Probabilistic Attention Regularization for Language Guided Image Classification from Pre-trained Vison Language Models Mayank Nautiyal et.al. 2503.11360 null
2025-03-14 APLA: A Simple Adaptation Method for Vision Transformers Moein Sorkhei et.al. 2503.11335 null
2025-03-14 Open-Set Plankton Recognition Joona Kareinen et.al. 2503.11318 null
2025-03-14 MEET: A Million-Scale Dataset for Fine-Grained Geospatial Scene Classification with Zoom-Free Remote Sensing Imagery Yansheng Li et.al. 2503.11219 null
2025-03-14 Falcon: A Remote Sensing Vision-Language Foundation Model Kelu Yao et.al. 2503.11070 null
2025-03-13 $(\varepsilon, δ)$ Considered Harmful: Best Practices for Reporting Differential Privacy Guarantees Juan Felipe Gomez et.al. 2503.10945 null
2025-03-13 Learning Interpretable Logic Rules from Deep Vision Models Chuqin Geng et.al. 2503.10547 null
2025-03-13 Extreme Learning Machines for Attention-based Multiple Instance Learning in Whole-Slide Image Classification Rajiv Krishnakumar et.al. 2503.10510 null
2025-03-13 RoMA: Scaling up Mamba-based Foundation Models for Remote Sensing Fengxiang Wang et.al. 2503.10392 link
2025-03-13 PS3C: An Ensemble-Based Two-Step Framework for Classification of Pep Smear Cell Images Theo Di Piazza et.al. 2503.10312 link
2025-03-13 Wikipedia is Not a Dictionary, Delete! Text Classification as a Proxy for Analysing Wiki Deletion Discussions Hsuvas Borkakoty et.al. 2503.10294 null
2025-03-13 A Multi-Modal Federated Learning Framework for Remote Sensing Image Classification Barış Büyüktaş et.al. 2503.10262 null
2025-03-13 Interpretable Image Classification via Non-parametric Part Prototype Learning Zhijie Zhu et.al. 2503.10247 null
2025-03-13 Multiplicative Learning Han Kim et.al. 2503.10144 null
2025-03-13 Cognitive-Mental-LLM: Leveraging Reasoning in Large Language Models for Mental Health Prediction via Online Text Avinash Patil et.al. 2503.10095 null
2025-03-13 Do We Always Need the Simplicity Bias? Looking for Optimal Inductive Biases in the Wild Damien Teney et.al. 2503.10065 null
2025-03-12 Fair Federated Medical Image Classification Against Quality Shift via Inter-Client Progressive State Matching Nannan Wu et.al. 2503.09587 null
2025-03-12 Double-Stage Feature-Level Clustering-Based Mixture of Experts Framework Bakary Badjie et.al. 2503.09504 null
2025-03-12 ForAug: Recombining Foregrounds and Backgrounds to Improve Vision Transformer Training with Bias Mitigation Tobias Christian Nauen et.al. 2503.09399 null
2025-03-12 Membership Inference Attacks fueled by Few-Short Learning to detect privacy leakage tackling data integrity Daniel Jiménez-López et.al. 2503.09365 null
2025-03-12 Deep Learning for Climate Action: Computer Vision Analysis of Visual Narratives on X Katharina Prasse et.al. 2503.09361 null
2025-03-12 Bayesian Test-Time Adaptation for Vision-Language Models Lihua Zhou et.al. 2503.09248 null
2025-03-12 Probing Network Decisions: Capturing Uncertainties and Unveiling Vulnerabilities Without Label Information Youngju Joung et.al. 2503.09068 null
2025-03-12 Discovering Influential Neuron Path in Vision Transformers Yifan Wang et.al. 2503.09046 null
2025-03-11 KAN-Mixers: a new deep learning architecture for image classification Jorge Luiz dos Santos Canuto et.al. 2503.08939 null
2025-03-12 MsaMIL-Net: An End-to-End Multi-Scale Aware Multiple Instance Learning Network for Efficient Whole Slide Image Classification Jiangping Wen et.al. 2503.08581 null
2025-03-11 Generalizable and Explainable Deep Learning for Medical Image Computing: An Overview Ahmad Chaddad et.al. 2503.08420 null
2025-03-11 Prototype-Based Multiple Instance Learning for Gigapixel Whole Slide Image Classification Susu Sun et.al. 2503.08384 null
2025-03-11 Tangentially Aligned Integrated Gradients for User-Friendly Explanations Lachlan Simpson et.al. 2503.08240 null
2025-03-11 EnergyFormer: Energy Attention with Fourier Embedding for Hyperspectral Image Classification Saad Sohail et.al. 2503.08239 null
2025-03-11 Identification of Star Clusters in M31 from PAndAS Images Based on Deep Learning Baisong Zhang et.al. 2503.08130 null
2025-03-11 LabelCoRank: Revolutionizing Long Tail Multi-Label Classification with Co-Occurrence Reranking Yan Yan et.al. 2503.07968 null
2025-03-12 Measuring directional bias amplification in image captions using predictability Rahul Nair et.al. 2503.07878 null
2025-03-10 Fair Text Classification via Transferable Representations Thibaud Leteno et.al. 2503.07691 null
2025-03-10 Keeping Representation Similarity in Finetuning for Medical Image Analysis Wenqiang Zu et.al. 2503.07399 null
2025-03-10 Brain Inspired Adaptive Memory Dual-Net for Few-Shot Image Classification Kexin Di et.al. 2503.07396 null
2025-03-10 Is My Text in Your AI Model? Gradient-based Membership Inference Test applied to LLMs Gonzalo Mancera et.al. 2503.07384 null
2025-03-10 Distilling Knowledge into Quantum Vision Transformers for Biomedical Image Classification Thomas Boucher et.al. 2503.07294 null
2025-03-10 A Zero-shot Learning Method Based on Large Language Models for Multi-modal Knowledge Graph Embedding Bingchen Liu et.al. 2503.07202 null
2025-03-10 Understanding the Learning Dynamics of LoRA: A Gradient Flow Perspective on Low-Rank Adaptation in Matrix Factorization Ziqing Xu et.al. 2503.06982 null
2025-03-10 Task Vector Quantization for Memory-Efficient Model Merging Youngeun Kim et.al. 2503.06921 null
2025-03-10 MADS: Multi-Attribute Document Supervision for Zero-Shot Image Classification Xiangyan Qu et.al. 2503.06847 null
2025-03-09 Enhancing Layer Attention Efficiency through Pruning Redundant Retrievals Hanze Li et.al. 2503.06473 null
2025-03-09 M $^3$ amba: CLIP-driven Mamba Model for Multi-modal Remote Sensing Classification Mingxiang Cao et.al. 2503.06446 null
2025-03-07 Similarity-Based Domain Adaptation with LLMs Jie He et.al. 2503.05281 null
2025-03-07 Spatial Context-Driven Positive Pair Sampling for Enhanced Histopathology Image Classification Willmer Rafell Quinones Robles et.al. 2503.05170 null
2025-03-07 Ensemble Debiasing Across Class and Sample Levels for Fairer Prompting Accuracy Ruixi Lin et.al. 2503.05157 null
2025-03-07 Grouped Sequential Optimization Strategy -- the Application of Hyperparameter Importance Assessment in Deep Learning Ruinan Wang et.al. 2503.05106 null
2025-03-06 HieroLM: Egyptian Hieroglyph Recovery with Next Word Prediction Language Model Xuheng Cai et.al. 2503.04996 null
2025-03-06 Label Distribution Learning-Enhanced Dual-KNN for Text Classification Bo Yuan et.al. 2503.04869 null
2025-03-06 Guiding LLMs to Generate High-Fidelity and High-Quality Counterfactual Explanations for Text Classification Van Bach Nguyen et.al. 2503.04463 null
2025-03-06 WeakSupCon: Weakly Supervised Contrastive Learning for Encoder Pre-training Bodong Zhang et.al. 2503.04165 null
2025-03-04 Measurement noise scaling laws for cellular representation learning Gokul Gowri et.al. 2503.02726 null
2025-03-04 XFMamba: Cross-Fusion Mamba for Multi-View Medical Image Classification Xiaoyu Zheng et.al. 2503.02619 null
2025-03-04 Remote Sensing Image Classification Using Convolutional Neural Network (CNN) and Transfer Learning Techniques Mustafa Majeed Abd Zaid et.al. 2503.02510 null
2025-03-06 Union of Experts: Adapting Hierarchical Routing to Equivalently Decomposed Transformer Yujiao Yang et.al. 2503.02495 null
2025-03-04 Making Better Mistakes in CLIP-Based Zero-Shot Classification with Hierarchy-Aware Language Prompts Tong Liang et.al. 2503.02248 null
2025-03-04 Sharpness-Aware Minimization: General Analysis and Improved Rates Dimitris Oikonomou et.al. 2503.02225 null
2025-03-03 Mathematical Foundation of Interpretable Equivariant Surrogate Models Jacopo Joy Colombini et.al. 2503.01942 null
2025-03-03 Visual-RFT: Visual Reinforcement Fine-Tuning Ziyu Liu et.al. 2503.01785 link
2025-03-03 Mamba base PKD for efficient knowledge compression José Medina et.al. 2503.01727 null
2025-03-04 SAR-W-MixMAE: SAR Foundation Model Training Using Backscatter Power Weighting Ali Caglayan et.al. 2503.01181 null
2025-03-03 Large Language Models for Healthcare Text Classification: A Systematic Review Hajar Sakai et.al. 2503.01159 null
2025-03-03 Fast and Accurate Gigapixel Pathological Image Classification with Hierarchical Distillation Multi-Instance Learning Jiuyang Dong et.al. 2502.21130 null
2025-02-28 Comparative study of the ansätze in quantum language models Jordi Del Castillo et.al. 2502.20744 null
2025-02-28 Exploring the Impact of Temperature Scaling in Softmax for Classification and Adversarial Robustness Hao Xuan et.al. 2502.20604 null
2025-02-27 In-Model Merging for Enhancing the Robustness of Medical Imaging Classification Models Hu Wang et.al. 2502.20516 null
2025-02-27 Online Meta-learning for AutoML in Real-time (OnMAR) Mia Gerber et.al. 2502.20279 null
2025-03-03 Gradient-Guided Annealing for Domain Generalization Aristotelis Ballas et.al. 2502.20162 link
2025-02-27 QPM: Discrete Optimization for Globally Interpretable Image Classification Thomas Norrenbrock et.al. 2502.20130 link
2025-02-27 ProAPO: Progressively Automatic Prompt Optimization for Visual Classification Xiangyan Qu et.al. 2502.19844 link
2025-02-27 Text classification using machine learning methods Bogdan Oancea et.al. 2502.19801 null
2025-02-27 InPK: Infusing Prior Knowledge into Prompt for Vision-Language Models Shuchang Zhou et.al. 2502.19777 null
2025-02-27 Learning Mask Invariant Mutual Information for Masked Image Modeling Tao Huang et.al. 2502.19718 null
2025-02-27 Language-Informed Hyperspectral Image Synthesis for Imbalanced-Small Sample Classification via Semi-Supervised Conditional Diffusion Model Yimin Zhu et.al. 2502.19700 null
2025-02-27 Spatial-Spectral Diffusion Contrastive Representation Network for Hyperspectral Image Classification Yimin Zhu et.al. 2502.19699 null
2025-02-27 A Residual Multi-task Network for Joint Classification and Regression in Medical Imaging Junji Lin et.al. 2502.19692 null
2025-02-26 I Know What I Don't Know: Improving Model Cascades Through Confidence Tuning Stephan Rabanser et.al. 2502.19335 null
2025-02-26 Active Few-Shot Learning for Text Classification Saeed Ahmadnia et.al. 2502.18782 null
2025-02-25 Enhancing Image Classification with Augmentation: Data Augmentation Techniques for Improved Image Classification Saorj Kumar et.al. 2502.18691 null
2025-02-25 Enhancing Text Classification with a Novel Multi-Agent Collaboration Framework Leveraging BERT Hediyeh Baban et.al. 2502.18653 null
2025-02-25 MedKAN: An Advanced Kolmogorov-Arnold Network for Medical Image Classification Zhuoqin Yang et.al. 2502.18416 null
2025-02-26 A Fusion Model for Art Author Identification Based on Convolutional Neural Networks and Transformers Zhenyu Wang et.al. 2502.18083 null
2025-02-25 MAGE: Multi-Head Attention Guided Embeddings for Low Resource Sentiment Classification Varun Vashisht et.al. 2502.17987 null
2025-02-25 Dual Classification Head Self-training Network for Cross-scene Hyperspectral Image Classification Rong Liu et.al. 2502.17879 null
2025-02-24 Can Score-Based Generative Modeling Effectively Handle Medical Image Classification? Sushmita Sarker et.al. 2502.17727 null
2025-02-24 A Priori Generalizability Estimate for a CNN Cito Balsells et.al. 2502.17622 null
2025-02-24 Neural Attention: A Novel Mechanism for Enhanced Expressive Power in Transformer Models Andrew DiGiugno et.al. 2502.17206 null
2025-02-24 Disentangling Visual Transformers: Patch-level Interpretability for Image Classification Guillaume Jeanneret et.al. 2502.17196 null
2025-02-24 Make LoRA Great Again: Boosting LoRA with Adaptive Singular Values and Mixture-of-Experts Optimization Alignment Chenghao Fan et.al. 2502.16894 null
2025-02-24 Applying LLMs to Active Learning: Towards Cost-Efficient Cross-Task Text Classification without Manually Labeled Data Yejian Zhang et.al. 2502.16892 null
2025-02-24 A Transformer-in-Transformer Network Utilizing Knowledge Distillation for Image Recognition Dewan Tauhid Rahman et.al. 2502.16762 null
2025-02-23 AUKT: Adaptive Uncertainty-Guided Knowledge Transfer with Conformal Prediction Rui Liu et.al. 2502.16736 null
2025-02-22 MOB-GCN: A Novel Multiscale Object-Based Graph Neural Network for Hyperspectral Image Classification Tuan-Anh Yang et.al. 2502.16289 link
2025-02-22 A Multi-Scale Isolation Forest Approach for Real-Time Detection and Filtering of FGSM Adversarial Attacks in Video Streams of Autonomous Vehicles Richard Abhulimhen et.al. 2502.16044 null
2025-02-21 MMRAG: Multi-Mode Retrieval-Augmented Generation with Large Language Models for Biomedical In-Context Learning Zaifu Zhan et.al. 2502.15954 null
2025-02-21 Directional Gradient Projection for Robust Fine-Tuning of Foundation Models Chengyue Huang et.al. 2502.15895 null
2025-02-21 MHQA: A Diverse, Knowledge Intensive Mental Health Question Answering Challenge for Language Models Suraj Racha et.al. 2502.15418 null
2025-02-21 A Novel Riemannian Sparse Representation Learning Network for Polarimetric SAR Image Classification Junfei Shi et.al. 2502.15302 null
2025-02-21 Quantum autoencoders for image classification Hinako Asaoka et.al. 2502.15254 null
2025-02-21 Steganographic Embeddings as an Effective Data Augmentation Nicholas DiSalvo et.al. 2502.15245 null
2025-02-21 Learning to Collaborate: A Capability Vectors-based Architecture for Adaptive Human-AI Decision Making Renlong Jie et.al. 2502.15196 null
2025-02-21 TransMamba: Fast Universal Architecture Adaption from Transformers to Mamba Xiuwei Chen et.al. 2502.15130 null
2025-02-20 Fundamental Survey on Neuromorphic Based Audio Classification Amlan Basu et.al. 2502.15056 null
2025-02-20 Reinforcement Learning for Ultrasound Image Analysis A Comprehensive Review of Advances and Applications Maha Ezzelarab et.al. 2502.14995 null
2025-02-20 Sparse Activations as Conformal Predictors Margarida M. Campos et.al. 2502.14773 link
2025-02-20 An Enhancement of Jiang, Z., et al.s Compression-Based Classification Algorithm Applied to News Article Categorization Sean Lester C. Benavides et.al. 2502.14444 null
2025-02-20 Stochastic Resonance Improves the Detection of Low Contrast Images in Deep Learning Models Siegfried Ludwig et.al. 2502.14442 null
2025-02-20 Token-Level Density-Based Uncertainty Quantification Methods for Eliciting Truthfulness of Large Language Models Artem Vazhentsev et.al. 2502.14427 null
2025-02-20 Reliable Explainability of Deep Learning Spatial-Spectral Classifiers for Improved Semantic Segmentation in Autonomous Driving Jon Gutiérrez-Zaballa et.al. 2502.14416 null
2025-02-20 QUAD-LLM-MLTC: Large Language Models Ensemble Learning for Healthcare Text Multi-Label Classification Hajar Sakai et.al. 2502.14189 null
2025-02-19 Self-Regularization with Latent Space Explanations for Controllable LLM-based Classification Xuansheng Wu et.al. 2502.14133 null
2025-02-19 Medical Image Classification with KAN-Integrated Transformers and Dilated Neighborhood Attention Omid Nejati Manzari et.al. 2502.13693 link
2025-02-18 Language Models Can Predict Their Own Behavior Dhananjay Ashok et.al. 2502.13329 null
2025-02-18 Performance Evaluation of Sentiment Analysis on Text and Emoji Data Using End-to-End, Transfer Learning, Distributed and Explainable AI Models Sirisha Velampalli et.al. 2502.13278 null
2025-02-18 Private Text Generation by Seeding Large Language Model Prompts Supriya Nagesh et.al. 2502.13193 null
2025-02-18 RingFormer: Rethinking Recurrent Transformer with Adaptive Level Signals Jaemu Heo et.al. 2502.13181 null
2025-02-18 Benchmarking MedMNIST dataset on real quantum hardware Gurinder Singh et.al. 2502.13056 null
2025-02-18 Likelihood-Ratio Regularized Quantile Regression: Adapting Conformal Prediction to High-Dimensional Covariate Shifts Sunay Joshi et.al. 2502.13030 null
2025-02-18 A Survey of Text Classification Under Class Distribution Shift Adriana Valentina Costache et.al. 2502.12965 null
2025-02-18 Task-Informed Anti-Curriculum by Masking Improves Downstream Performance on Text Andrei Jarca et.al. 2502.12953 null
2025-02-18 DAMamba: Vision State Space Model with Dynamic Adaptive Scan Tanzhe Li et.al. 2502.12627 null
2025-02-18 When Segmentation Meets Hyperspectral Image: New Paradigm for Hyperspectral Image Classification Weilian Zhou et.al. 2502.12541 null
2025-02-17 Achieving Upper Bound Accuracy of Joint Training in Continual Learning Saleh Momeni et.al. 2502.12388 null
2025-02-17 OCT Data is All You Need: How Vision Transformers with and without Pre-training Benefit Imaging Zihao Han et.al. 2502.12379 null
2025-02-17 AdaSplash: Adaptive Sparse Flash Attention Nuno Gonçalves et.al. 2502.12082 null
2025-02-17 Masked Latent Prediction and Classification for Self-Supervised Audio Representation Learning Aurian Quelennec et.al. 2502.12031 null
2025-02-17 Text Classification in the LLM Era - Where do we stand? Sowmya Vajjala et.al. 2502.11830 null
2025-02-17 Variable-frame CNNLSTM for Breast Nodule Classification using Ultrasound Videos Xiangxiang Cui et.al. 2502.11481 null
2025-02-16 Leveraging Conditional Mutual Information to Improve Large Language Model Fine-Tuning For Classification Thanushon Sivakaran et.al. 2502.11258 null
2025-02-16 UNITE-FND: Reframing Multimodal Fake News Detection through Unimodal Scene Translation Arka Mukherjee et.al. 2502.11132 null
2025-02-16 Towards Achieving Concept Completeness for Unsupervised Textual Concept Bottleneck Models Milan Bhan et.al. 2502.11100 null
2025-02-16 Leveraging Large Language Models for Cybersecurity: Enhancing SMS Spam Detection with Robust and Context-Aware Text Classification Mohsen Ahmadi et.al. 2502.11014 null
2025-02-15 Simulations of Common Unsupervised Domain Adaptation Algorithms for Image Classification Ahmad Chaddad et.al. 2502.10694 null
2025-02-15 REAL: Realism Evaluation of Text-to-Image Generation Models for Effective Data Augmentation Ran Li et.al. 2502.10663 null
2025-02-14 Simplifying DINO via Coding Rate Regularization Ziyang Wu et.al. 2502.10385 null
2025-02-14 Ocular Disease Classification Using CNN with Deep Convolutional Generative Adversarial Network Arun Kunwar et.al. 2502.10334 null
2025-02-14 SeWA: Selective Weight Average via Probabilistic Masking Peng Wang et.al. 2502.10119 null
2025-02-14 On Space Folds of ReLU Neural Networks Michal Lewandowski et.al. 2502.09954 null
2025-02-13 A CNN Approach to Automated Detection and Classification of Brain Tumors Md. Zahid Hasan et.al. 2502.09731 null
2025-02-13 GAIA: A Global, Multi-modal, Multi-scale Vision-Language Dataset for Remote Sensing Image Analysis Angelos Zavras et.al. 2502.09598 link
2025-02-14 Optimizing GPT for Video Understanding: Zero-Shot Performance and Prompt Engineering Mark Beliaev et.al. 2502.09573 null
2025-02-13 Feature-based Graph Attention Networks Improve Online Continual Learning Adjovi Sim et.al. 2502.09143 null
2025-02-13 A Hybrid Model for Few-Shot Text Classification Using Transfer and Meta-Learning Jia Gao et.al. 2502.09086 null
2025-02-13 Hierarchical Vision Transformer with Prototypes for Interpretable Medical Image Classification Luisa Gallée et.al. 2502.08997 null
2025-02-13 Quantum Approaches for Dysphonia Assessment in Small Speech Datasets Ha Tran et.al. 2502.08968 null
2025-02-12 Measuring Diversity in Synthetic Datasets Yuchang Zhu et.al. 2502.08512 null
2025-02-12 ViLa-MIL: Dual-scale Vision-Language Multiple Instance Learning for Whole Slide Image Classification Jiangbo Shi et.al. 2502.08391 null
2025-02-12 Keep your distance: learning dispersed embeddings on $\mathbb{S}_d$ Evgeniia Tokarchuk et.al. 2502.08231 null
2025-02-12 Riemannian Complex Hermit Positive Definite Convolution Network for Polarimetric SAR Image Classification Junfei Shi et.al. 2502.08137 null
2025-02-12 Knowledge Swapping via Learning and Unlearning Mingyu Xing et.al. 2502.08075 null
2025-02-12 Can Machine Learning Support the Selection of Studies for Systematic Literature Review Updates? Marcelo Costalonga et.al. 2502.08050 null
2025-02-11 ESPFormer: Doubly-Stochastic Attention with Expected Sliced Transport Plans Ashkan Shahbazi et.al. 2502.07962 null
2025-02-11 Optimizing Knowledge Distillation in Transformers: Enabling Multi-Head Attention without Alignment Barriers Zhaodong Bing et.al. 2502.07436 null
2025-02-11 MoENAS: Mixture-of-Expert based Neural Architecture Search for jointly Accurate, Fair, and Robust Edge Deep Neural Networks Lotfi Abdelkrim Mecharbat et.al. 2502.07422 null
2025-02-11 MGPATH: Vision-Language Model with Multi-Granular Prompt Learning for Few-Shot WSI Classification Anh-Tien Nguyen et.al. 2502.07409 null
2025-02-11 Don't Just Demo, Teach Me the Principles: A Principle-Based Multi-Agent Prompting Strategy for Text Classification Peipei Wei et.al. 2502.07165 null
2025-02-10 From Image to Video: An Empirical Study of Diffusion Representations Pedro Vélez et.al. 2502.07001 null
2025-02-10 Krum Federated Chain (KFC): Using blockchain to defend against adversarial attacks in Federated Learning Mario García-Márquez et.al. 2502.06917 null
2025-02-10 Enhancing Performance of Explainable AI Models with Constrained Concept Refinement Geyu Liang et.al. 2502.06775 null
2025-02-10 Efficient Scientific Full Text Classification: The Case of EICAT Impact Assessments Marc Felix Brinner et.al. 2502.06551 null
2025-02-10 Hybrid State-Space and GRU-based Graph Tokenization Mamba for Hyperspectral Image Classification Muhammad Ahmad et.al. 2502.06427 null
2025-02-10 Provably Near-Optimal Federated Ensemble Distillation with Negligible Overhead Won-Jun Jang et.al. 2502.06349 null
2025-02-10 From Pixels to Components: Eigenvector Masking for Visual Representation Learning Alice Bizeul et.al. 2502.06314 null
2025-02-10 Beyond Batch Learning: Global Awareness Enhanced Domain Adaptation Lingkun Luo et.al. 2502.06272 null
2025-02-10 Multi-Scale Transformer Architecture for Accurate Medical Image Classification Jiacheng Hu et.al. 2502.06243 null
2025-02-10 Low Tensor-Rank Adaptation of Kolmogorov--Arnold Networks Yihang Gao et.al. 2502.06153 null
2025-02-09 Benchmarking Prompt Sensitivity in Large Language Models Amirhossein Razavi et.al. 2502.06065 null
2025-02-09 ARISE: Iterative Rule Induction and Synthetic Data Generation for Text Classification Yashwanth M. et.al. 2502.05923 null
2025-02-07 Training-free Neural Architecture Search through Variance of Knowledge of Deep Network Weights Ondřej Týbl et.al. 2502.04975 null
2025-02-07 Enhancing Disinformation Detection with Explainable AI and Named Entity Replacement Santiago González-Silot et.al. 2502.04863 null
2025-02-07 AIQViT: Architecture-Informed Post-Training Quantization for Vision Transformers Runqing Jiang et.al. 2502.04628 null
2025-02-06 Augmented Conditioning Is Enough For Effective Training Image Generation Jiahui Chen et.al. 2502.04475 null
2025-02-06 How does a Multilingual LM Handle Multiple Languages? Santhosh Kakarla et.al. 2502.04269 null
2025-02-06 Cross the Gap: Exposing the Intra-modal Misalignment in CLIP via Modality Inversion Marco Mistretta et.al. 2502.04263 null
2025-02-06 Expanding Training Data for Endoscopic Phenotyping of Eosinophilic Esophagitis Juming Xiong et.al. 2502.04199 null
2025-02-06 Improving Natural Language Understanding for LLMs via Large-Scale Instruction Synthesis Lin Yuan et.al. 2502.03843 null
2025-02-06 Self-Supervised Learning for Solar Radio Spectrum Classification Siqi Li et.al. 2502.03778 null
2025-02-06 Conditional Diffusion Models are Medical Image Classifiers that Provide Explainability and Uncertainty for Free Gian Mario Favero et.al. 2502.03687 null
2025-02-05 A Study in Dataset Distillation for Image Super-Resolution Tobias Dietz et.al. 2502.03656 null
2025-02-05 Gompertz Linear Units: Leveraging Asymmetry for Enhanced Learning Dynamics Indrashis Das et.al. 2502.03654 null
2025-02-05 Clinically-Inspired Hierarchical Multi-Label Classification of Chest X-rays with a Penalty-Based Loss Function Mehrdad Asadi et.al. 2502.03591 link
2025-02-05 Optimal Task Order for Continual Learning of Multiple Tasks Ziyan Li et.al. 2502.03350 null
2025-02-05 Out-of-Distribution Detection using Synthetic Data Generation Momin Abbas et.al. 2502.03323 null
2025-02-05 Long-tailed Medical Diagnosis with Relation-aware Representation Learning and Iterative Classifier Calibration Li Pan et.al. 2502.03238 null
2025-02-05 Adversarial Dependence Minimization Pierre-François De Plaen et.al. 2502.03227 null
2025-02-05 Disentangling CLIP Features for Enhanced Localized Understanding Samyak Rawelekar et.al. 2502.02977 null
2025-02-05 Slowing Learning by Erasing Simple Features Lucia Quirke et.al. 2502.02820 null
2025-02-04 The Skin Game: Revolutionizing Standards for AI Dermatology Model Comparison Łukasz Miętkiewicz et.al. 2502.02500 null
2025-02-04 BRIDLE: Generalized Self-supervised Learning with Quantization Hoang M. Nguyen et.al. 2502.02118 null
2025-02-04 DCT-Mamba3D: Spectral Decorrelation and Spatial-Spectral Feature Extraction for Hyperspectral Image Classification Weijia Cao et.al. 2502.01986 null
2025-02-04 Generative Data Mining with Longtail-Guided Diffusion David S. Hayden et.al. 2502.01980 null
2025-02-03 A Multi-Scale Feature Fusion Framework Integrating Frequency Domain and Cross-View Attention for Dual-View X-ray Security Inspections Shilong Hong et.al. 2502.01710 null
2025-02-03 Activation by Interval-wise Dropout: A Simple Way to Prevent Neural Networks from Plasticity Loss Sangyeon Park et.al. 2502.01342 null
2025-02-03 A Framework for Double-Blind Federated Adaptation of Foundation Models Nurbek Tastan et.al. 2502.01289 null
2025-02-02 Synthetic Artifact Auditing: Tracing LLM-Generated Synthetic Data Usage in Downstream Applications Yixin Wu et.al. 2502.00808 null
2025-02-02 Enhanced Convolutional Neural Networks for Improved Image Classification Xiaoran Yang et.al. 2502.00663 null
2025-02-01 Fast Vision Mamba: Pooling Spatial Dimensions for Accelerated Processing Saarthak Kapse et.al. 2502.00594 null
2025-01-31 Redefining Machine Unlearning: A Conformal Prediction-Motivated Approach Yingdan Shi et.al. 2501.19403 null
2025-01-31 An All-digital 65-nm Tsetlin Machine Image Classification Accelerator with 8.6 nJ per MNIST Frame at 60.3k Frames per Second Svein Anders Tunheim et.al. 2501.19347 null
2025-01-31 Through the Looking Glass: LLM-Based Analysis of AR/VR Android Applications Privacy Policies Abdulaziz Alghamdi et.al. 2501.19223 null
2025-01-31 Fairness Analysis of CLIP-Based Foundation Models for X-Ray Image Classification Xiangyu Sun et.al. 2501.19086 null
2025-01-31 Memory-Efficient Fine-Tuning of Transformers via Token Selection Antoine Simoulin et.al. 2501.18824 null
2025-01-30 OT-Transformer: A Continuous-time Transformer Architecture with Optimal Transport Regularization Kelvin Kan et.al. 2501.18793 null
2025-01-29 Semantic Consistency Regularization with Large Language Models for Semi-supervised Sentiment Analysis Kunrong Li et.al. 2501.17598 null
2025-01-28 Extending Information Bottleneck Attribution to Video Sequences Veronika Solopova et.al. 2501.16889 link
2025-01-28 Misspellings in Natural Language Processing: A survey Gianluca Sperduti et.al. 2501.16836 null
2025-01-28 DebugAgent: Efficient and Interpretable Error Slice Discovery for Comprehensive Model Debugging Muxi Chen et.al. 2501.16751 null
2025-01-28 Toward Relative Positional Encoding in Spiking Transformers Changze Lv et.al. 2501.16745 null
2025-01-28 Improving Interpretability and Accuracy in Neuro-Symbolic Rule Extraction Using Class-Specific Sparse Filters Parth Padalkar et.al. 2501.16677 null
2025-01-27 Generating customized prompts for Zero-Shot Rare Event Medical Image Classification using LLM Payal Kamboj et.al. 2501.16481 link
2025-01-28 SPECIAL: Zero-shot Hyperspectral Image Classification With CLIP Li Pang et.al. 2501.16222 link
2025-01-27 The Linear Attention Resurrection in Vision Transformer Chuanyang Zheng et.al. 2501.16182 null
2025-01-27 Enhancing the Convergence of Federated Learning Aggregation Strategies with Limited Data Judith Sáinz-Pardo Díaz et.al. 2501.15949 null
2025-01-26 Quantum-Enhanced Attention Mechanism in NLP: A Hybrid Classical-Quantum Approach S. M. Yousuf Iqbal Tomal et.al. 2501.15630 null
2025-01-26 Building Efficient Lightweight CNN Models Nathan Isong et.al. 2501.15547 null
2025-01-26 Fuzzy-aware Loss for Source-free Domain Adaptation in Visual Emotion Recognition Ying Zheng et.al. 2501.15519 null
2025-01-26 Variational Bayesian Adaptive Learning of Deep Latent Variables for Acoustic Knowledge Transfer Hu Hu et.al. 2501.15496 null
2025-01-25 Pre-trained Model Guided Mixture Knowledge Distillation for Adversarial Federated Learning Yu Qiao et.al. 2501.15257 null
2025-01-24 Feasible Learning Juan Ramirez et.al. 2501.14912 link
2025-01-24 Rethinking Foundation Models for Medical Image Classification through a Benchmark Study on MedMNIST Fuping Wu et.al. 2501.14685 null
2025-01-24 Geometric Mean Improves Loss For Few-Shot Learning Tong Wu et.al. 2501.14593 null
2025-01-24 Idiom Detection in Sorani Kurdish Texts Skala Kamaran Omer et.al. 2501.14528 null
2025-01-24 $SpikePack$ : Enhanced Information Flow in Spiking Neural Networks with High Hardware Compatibility Guobin Shen et.al. 2501.14484 null
2025-01-24 Impact of Batch Normalization on Convolutional Network Representations Hermanus L. Potgieter et.al. 2501.14441 null
2025-01-24 Quantum Neural Networks: A Comparative Analysis and Noise Robustness Evaluation Tasnim Ahmed et.al. 2501.14412 null
2025-01-24 Correlation-Based Band Selection for Hyperspectral Image Classification Dibyabha Deb et.al. 2501.14338 link
2025-01-24 Relative Layer-Wise Relevance Propagation: a more Robust Neural Networks eXplaination Eric Nyiri et.al. 2501.14322 null
2025-01-24 A Comprehensive Framework for Semantic Similarity Detection Using Transformer Architectures and Enhanced Ensemble Techniques Lifu Gao et.al. 2501.14288 null
2025-01-24 TLXML: Task-Level Explanation of Meta-Learning via Influence Functions Yoshihiro Mitsuka et.al. 2501.14271 null
2025-01-23 A Study of the Plausibility of Attention between RNN Encoders in Natural Language Inference Duc Hau Nguyen et.al. 2501.13735 null
2025-01-23 A Transformer-based Autoregressive Decoder Architecture for Hierarchical Text Classification Younes Yousef et.al. 2501.13598 link
2025-01-23 Multi-Level Attention and Contrastive Learning for Enhanced Text Classification with an Optimized Transformer Jia Gao et.al. 2501.13467 null
2025-01-23 Atmospheric Noise-Resilient Image Classification in a Real-World Scenario: Using Hybrid CNN and Pin-GTSVM Shlok Mehendale et.al. 2501.13422 null
2025-01-23 AEON: Adaptive Estimation of Instance-Dependent In-Distribution and Out-of-Distribution Label Noise for Robust Learning Arpit Garg et.al. 2501.13389 null
2025-01-23 Multi-aspect Knowledge Distillation with Large Language Model Taegyeong Lee et.al. 2501.13341 null
2025-01-22 Revisiting Data Augmentation for Ultrasound Images Adam Tupper et.al. 2501.13193 link
2025-01-22 Regularization, Semi-supervision, and Supervision for a Plausible Attention-Based Explanation Duc Hau Nguyen et.al. 2501.12775 link
2025-01-22 Estimating the Conformal Prediction Threshold from Noisy Labels Coby Penso et.al. 2501.12749 link
2025-01-22 Adapting OpenAI's CLIP Model for Few-Shot Image Inspection in Manufacturing Quality Control: An Expository Case Study with Multiple Application Examples Fadel M. Megahed et.al. 2501.12596 null
2025-01-21 Efficient Lung Ultrasound Severity Scoring Using Dedicated Feature Extractor Jiaqi Guo et.al. 2501.12524 null
2025-01-21 CCESAR: Coastline Classification-Extraction From SAR Images Using CNN-U-Net Combination Vidhu Arora et.al. 2501.12384 null
2025-01-21 CBVLM: Training-free Explainable Concept-based Large Vision Language Models for Medical Image Classification Cristiano Patrício et.al. 2501.12266 null
2025-01-21 Early Detection and Classification of Breast Cancer Using Deep Learning Techniques Mst. Mumtahina Labonno et.al. 2501.12217 null
2025-01-21 UAV-Assisted Real-Time Disaster Detection Using Optimized Transformer Model Branislava Jankovic et.al. 2501.12087 null
2025-01-20 Communication-Efficient Federated Learning Based on Explanation-Guided Pruning for Remote Sensing Image Classification Jonas Klotz et.al. 2501.11493 null
2025-01-22 QGAIC: Quantum Inspired Genetic Algorithm for Image Classification Akhilesh Kumar Singh et.al. 2501.11477 null
2025-01-20 GenVidBench: A Challenging Benchmark for Detecting AI-Generated Video Zhenliang Ni et.al. 2501.11340 null
2025-01-20 KPL: Training-Free Medical Knowledge Mining of Vision-Language Models Jiaxiang Liu et.al. 2501.11231 link
2025-01-19 CLOFAI: A Dataset of Real And Fake Image Classification Tasks for Continual Learning William Doherty et.al. 2501.11140 link
2025-01-19 Leveraging counterfactual concepts for debugging and improving CNN model performance Syed Ali Tariq et.al. 2501.11087 null
2025-01-17 A Vision-Language Framework for Multispectral Scene Representation Using Language-Grounded Features Enes Karanfil et.al. 2501.10144 null
2025-01-17 Classifier Ensemble for Efficient Uncertainty Calibration of Deep Neural Networks for Image Classification Michael Schulze et.al. 2501.10089 null
2025-01-17 One-D-Piece: Image Tokenizer Meets Quality-Controllable Compression Keita Miwa et.al. 2501.10064 null
2025-01-17 LWGANet: A Lightweight Group Attention Backbone for Remote Sensing Visual Tasks Wei Lu et.al. 2501.10040 link
2025-01-16 Empirical Evaluation of Embedding Models in the Context of Text Classification in Document Review in Construction Delay Disputes Fusheng Wei et.al. 2501.09859 null
2025-01-16 SRE-Conv: Symmetric Rotation Equivariant Convolution for Biomedical Image Classification Yuexi Du et.al. 2501.09753 link
2025-01-16 Practical Continual Forgetting for Pre-trained Vision Models Hongbo Zhao et.al. 2501.09705 link
2025-01-16 Multimodal Marvels of Deep Learning in Medical Diagnosis: A Comprehensive Review of COVID-19 Detection Md Shofiqul Islama et.al. 2501.09506 link
2025-01-16 HydraMix: Multi-Image Feature Mixing for Small Data Image Classification Christoph Reinders et.al. 2501.09504 null
2025-01-16 Quantum-Enhanced Transformers for Robust Acoustic Scene Classification in IoT Environments Minh K. Quan et.al. 2501.09394 null
2025-01-16 Shape-Based Single Object Classification Using Ensemble Method Classifiers Nur Shazwani Kamarudin et.al. 2501.09311 null
2025-01-16 Efficient Few-Shot Medical Image Analysis via Hierarchical Contrastive Vision-Language Learning Harrison Fuller et.al. 2501.09294 null
2025-01-16 A Simple Graph Contrastive Learning Framework for Short Text Classification Yonghao Liu et.al. 2501.09219 link
2025-01-16 Boosting Short Text Classification with Multi-Source Information Exploration and Dual-Level Contrastive Learning Yonghao Liu et.al. 2501.09214 link
2025-01-15 Augmenting Human-Annotated Training Data with Large Language Model Generation and Distillation in Open-Response Assessment Conrad Borchers et.al. 2501.09126 null
2025-01-15 IDEA: Image Description Enhanced CLIP-Adapter Zhipeng Ye et.al. 2501.08816 null
2025-01-15 MIAFEx: An Attention-based Feature Extraction Method for Medical Image Classification Oscar Ramos-Soto et.al. 2501.08562 null
2025-01-14 Towards Zero-Shot & Explainable Video Description by Reasoning over Graphs of Events in Space and Time Mihai Masala et.al. 2501.08460 null
2025-01-14 Large Language Models For Text Classification: Case Study And Comprehensive Review Arina Kostina et.al. 2501.08457 null
2025-01-14 READ: Reinforcement-based Adversarial Learning for Text Classification with Limited Labeled Data Rohit Sharma et.al. 2501.08035 null
2025-01-14 Training Hybrid Neural Networks with Multimode Optical Nonlinearities Using Digital Twins Ilker Oguz et.al. 2501.07991 null
2025-01-14 deepTerra -- AI Land Classification Made Easy Andrew Keith Wilkinson et.al. 2501.07859 null
2025-01-14 A Low-cost and Ultra-lightweight Binary Neural Network for Traffic Signal Recognition Mingke Xiao et.al. 2501.07808 null
2025-01-14 Balance Divergence for Knowledge Distillation Yafei Qi et.al. 2501.07804 null
2025-01-14 Parameter-Inverted Image Pyramid Networks for Visual Perception and Multimodal Understanding Zhaokai Wang et.al. 2501.07783 link
2025-01-13 Universal Training of Neural Networks to Achieve Bayes Optimal Classification Accuracy Mohammadreza Tavasoli Naeini et.al. 2501.07754 null
2025-01-13 Uncertainty Guarantees on Automated Precision Weeding using Conformal Prediction Paul Melki et.al. 2501.07185 null
2025-01-13 Adaptive Noise-Tolerant Network for Image Segmentation Weizhi Li et.al. 2501.07163 null
2025-01-12 LarvSeg: Exploring Image Classification Data For Large Vocabulary Semantic Segmentation via Category-wise Attentive Classifier Haojun Yu et.al. 2501.06862 link
2025-01-12 Rice Leaf Disease Detection: A Comparative Study Between CNN, Transformer and Non-neural Network Architectures Samia Mehnaz et.al. 2501.06740 null
2025-01-12 Multi-Label Scene Classification in Remote Sensing Benefits from Image Super-Resolution Ashitha Mudraje et.al. 2501.06720 null
2025-01-11 Synthetic Feature Augmentation Improves Generalization Performance of Language Models Ashok Choudhary et.al. 2501.06434 null
2025-01-10 Kolmogorov-Arnold networks for metal surface defect classification Maciej Krzywda et.al. 2501.06389 null
2025-01-10 Merging Feed-Forward Sublayers for Compressed Transformers Neha Verma et.al. 2501.06126 link
2025-01-10 Averaged Adam accelerates stochastic optimization in the training of deep neural network approximations for partial differential equation and optimal control problems Steffen Dereich et.al. 2501.06081 link
2025-01-10 Constrained Over-the-Air Model Updating for Wireless Online Federated Learning with Delayed Information Juncheng Wang et.al. 2501.05637 null
2025-01-10 The Impact of Model Scaling on Seen and Unseen Language Performance Rhitabrat Pokharel et.al. 2501.05629 null
2025-01-09 Vision-Language Models for Autonomous Driving: CLIP-Based Dynamic Scene Understanding Mohammed Elhenawy et.al. 2501.05566 null
2025-01-09 Spatial Information Integration in Small Language Models for Document Layout Generation and Classification Pablo Melendez et.al. 2501.05497 null
2025-01-09 An Empirical Study of Autoregressive Pre-training from Videos Jathushan Rajasegaran et.al. 2501.05453 null
2025-01-09 A 1Mb mixed-precision quantized encoder for image classification and patch-based compression Van Thien Nguyen et.al. 2501.05097 null
2025-01-09 A CT Image Classification Network Framework for Lung Tumors Based on Pre-trained MobileNetV2 Model and Transfer learning, And Its Application and Market Analysis in the Medical field Ziyang Gao et.al. 2501.04996 null
2025-01-09 MambaHSI: Spatial-Spectral Mamba for Hyperspectral Image Classification Yapeng Li et.al. 2501.04944 null
2025-01-09 A New Perspective on Privacy Protection in Federated Learning with Granular-Ball Computing Guannan Lai et.al. 2501.04940 link
2025-01-09 ThriftLLM: On Cost-Effective Selection of Large Language Models for Classification Queries Keke Huang et.al. 2501.04901 null
2025-01-09 Online Continual Learning: A Systematic Literature Review of Approaches, Challenges, and Benchmarks Seyed Amir Bidaki et.al. 2501.04897 link
2025-01-08 Planarian Neural Networks: Evolutionary Patterns from Basic Bilateria Shaping Modern Artificial Neural Network Architectures Ziyuan Huang et.al. 2501.04700 null
2025-01-08 Discrete Wavelet Transform-Based Capsule Network for Hyperspectral Image Classification Zhiqiang Gao et.al. 2501.04643 null
2025-01-08 Enhancing Scene Classification in Cloudy Image Scenarios: A Collaborative Transfer Method with Information Regulation Mechanism using Optical Cloud-Covered and SAR Remote Sensing Images Yuze Wang et.al. 2501.04283 null
2025-01-08 Comparison of Neural Models for X-ray Image Classification in COVID-19 Detection Jimi Togni et.al. 2501.04196 null
2025-01-07 Temporal Feature Weaving for Neonatal Echocardiographic Viewpoint Video Classification Satchel French et.al. 2501.03967 link
2025-01-07 Dolphin: Closed-loop Open-ended Auto-research through Thinking, Practice, and Feedback Jiakang Yuan et.al. 2501.03916 null
2025-01-07 MedFocusCLIP : Improving few shot classification in medical datasets using pixel wise attention Aadya Arora et.al. 2501.03839 null
2025-01-07 LHGNN: Local-Higher Order Graph Neural Networks For Audio Classification and Tagging Shubhr Singh et.al. 2501.03464 null
2025-01-06 FTA-FTL: A Fine-Tuned Aggregation Federated Transfer Learning Scheme for Lithology Microscopic Image Classification Keyvan RahimiZadeh et.al. 2501.03349 link
2025-01-06 CM3T: Framework for Efficient Multimodal Learning for Inhomogeneous Interaction Datasets Tanay Agrawal et.al. 2501.03332 null
2025-01-06 Plant Leaf Disease Detection and Classification Using Deep Learning: A Review and A Proposed System on Bangladesh's Perspective Md. Jalal Uddin Chowdhury et.al. 2501.03305 null
2025-01-06 Deep-Relative-Trust-Based Diffusion for Decentralized Deep Learning Muyun Li et.al. 2501.03162 null
2025-01-06 Graph-based Retrieval Augmented Generation for Dynamic Few-shot Text Classification Yubo Wang et.al. 2501.02844 null
2025-01-06 TARDiS : Text Augmentation for Refining Diversity and Separability Kyungmin Kim et.al. 2501.02739 null
2025-01-05 FedRSClip: Federated Learning for Remote Sensing Scene Classification Using Vision-Language Models Hui Lin et.al. 2501.02461 null
2025-01-04 Exploring Secure Machine Learning Through Payload Injection and FGSM Attacks on ResNet-50 Umesh Yadav et.al. 2501.02147 null
2025-01-03 A Separable Self-attention Inspired by the State Space Model for Computer Vision Juntao Zhang et.al. 2501.02040 link
2025-01-03 Google is all you need: Semi-Supervised Transfer Learning Strategy For Light Multimodal Multi-Task Classification Model Haixu Liu et.al. 2501.01611 null
2025-01-02 Multi-Modal Video Feature Extraction for Popularity Prediction Haixu Liu et.al. 2501.01422 null
2025-01-02 A Multi-task Supervised Compression Model for Split Computing Yoshitomo Matsubara et.al. 2501.01420 link
2025-01-02 Multi-Head Explainer: A General Framework to Improve Explainability in CNNs and Transformers Bohang Sun et.al. 2501.01311 null
2025-01-02 FAST: Fast Audio Spectrogram Transformer Anugunj Naman et.al. 2501.01104 null
2025-01-01 A Novel Approach using CapsNet and Deep Belief Network for Detection and Identification of Oral Leukopenia Hirthik Mathesh GV et.al. 2501.00876 null
2025-01-01 Ensuring superior learning outcomes and data security for authorized learner Jeongho Bang et.al. 2501.00754 null
2024-12-31 TSPE: Task-Specific Prompt Ensemble for Improved Zero-Shot Audio Classification Nishit Anand et.al. 2501.00398 null
2024-12-31 Exploring Variability in Fine-Tuned Models for Text Classification with DistilBERT Giuliano Lorenzoni et.al. 2501.00241 null
2024-12-30 The Text Classification Pipeline: Starting Shallow going Deeper Marco Siino et.al. 2501.00174 null
2024-12-30 Text Classification: Neural Networks VS Machine Learning Models VS Pre-trained Models Christos Petridis et.al. 2412.21022 null
2024-12-30 FPGA-based Acceleration of Neural Network for Image Classification using Vitis AI Zhengdong Li et.al. 2412.20974 null
2024-12-30 Uncertainty-Aware Out-of-Distribution Detection with Gaussian Processes Yang Chen et.al. 2412.20918 null
2024-12-30 UniRS: Unifying Multi-temporal Remote Sensing Tasks through Vision Language Models Yujie Li et.al. 2412.20742 null
2024-12-30 Improving Acoustic Scene Classification in Low-Resource Conditions Zhi Chen et.al. 2412.20722 null
2024-12-29 Hilbert Curve Based Molecular Sequence Analysis Sarwan Ali et.al. 2412.20616 null
2024-12-29 A Novel FPGA-based CNN Hardware Accelerator: Optimization for Convolutional Layers using Karatsuba Ofman Multiplier Amit Sarkar et.al. 2412.20393 null
2024-12-29 HindiLLM: Large Language Model for Hindi Sanjay Chouhan et.al. 2412.20357 null
2024-12-29 Deep Learning in Image Classification: Evaluating VGG19's Performance on Complex Visual Data Weijie He et.al. 2412.20345 null
2024-12-28 Few-shot Algorithm Assurance Dang Nguyen et.al. 2412.20275 null
2024-12-27 Asymmetrical Reciprocity-based Federated Learning for Resolving Disparities in Medical Diagnosis Jiaqi Wang et.al. 2412.19654 null
2024-12-27 Enhancing Fine-grained Image Classification through Attentive Batch Training Duy M. Le et.al. 2412.19606 null
2024-12-27 A Comparative Study of Machine Unlearning Techniques for Image and Text Classification Models Omar M. Safa et.al. 2412.19583 null
2024-12-27 Multi-label Classification using Deep Multi-order Context-aware Kernel Networks Mingyuan Jiu et.al. 2412.19491 null
2024-12-27 Residual Feature-Reutilization Inception Network for Image Classification Yuanpeng He et.al. 2412.19433 null
2024-12-27 An In-Depth Analysis of Adversarial Discriminative Domain Adaptation for Digit Classification Eugene Choi et.al. 2412.19391 link
2024-12-26 Assessing Pre-trained Models for Transfer Learning through Distribution of Spectral Components Tengxue Zhang et.al. 2412.19085 null
2024-12-26 Let the Rule Speak: Enhancing In-context Learning Debiasing with Interpretability Ruixi Lin et.al. 2412.19018 null
2024-12-25 Injecting Bias into Text Classification Models using Backdoor Attacks A. Dilara Yavuz et.al. 2412.18975 null
2024-12-25 Research Experiment on Multi-Model Comparison for Chinese Text Classification Tasks JiaCheng Li et.al. 2412.18908 null
2024-12-24 VisionGRU: A Linear-Complexity RNN Model for Efficient Image Analysis Shicheng Yin et.al. 2412.18178 link
2024-12-24 Beyond Gradient Averaging in Parallel Optimization: Improved Robustness through Gradient Agreement Filtering Francois Chaubard et.al. 2412.18052 null
2024-12-23 Explainability in Neural Networks for Natural Language Processing Tasks Melkamu Mersha et.al. 2412.18036 null
2024-12-23 COBRA: COmBinatorial Retrieval Augmentation for Few-Shot Learning Arnav M. Das et.al. 2412.17684 null
2024-12-23 Resource-Aware Arabic LLM Creation: Model Adaptation, Integration, and Multi-Domain Testing Prakash Aryan et.al. 2412.17548 link
2024-12-23 Domain-Incremental Learning for Audio Classification Manjunath Mulimani et.al. 2412.17424 null
2024-12-23 An Experimental Evaluation of Japanese Tokenizers for Sentiment-Based Text Classification Andre Rusli et.al. 2412.17361 link
2024-12-23 DiffFormer: a Differential Spatial-Spectral Transformer for Hyperspectral Image Classification Muhammad Ahmad et.al. 2412.17350 link
2024-12-22 Survey on Abstractive Text Summarization: Dataset, Models, and Metrics Gospel Ozioma Nnadi et.al. 2412.17165 link
2024-12-22 LH-Mix: Local Hierarchy Correlation Guided Mixup over Hierarchical Prompt Tuning Fanshuang Kong et.al. 2412.16963 link
2024-12-22 Predicting the Reliability of an Image Classifier under Image Distortion Dang Nguyen et.al. 2412.16881 null
2024-12-21 Forget Vectors at Play: Universal Input Perturbations Driving Machine Unlearning in Image Classification Changchang Sun et.al. 2412.16780 null
2024-12-21 UNEM: UNrolled Generalized EM for Transductive Few-Shot Learning Long Zhou et.al. 2412.16739 link
2024-12-20 Mamba2D: A Natively Multi-Dimensional State-Space Model for Vision Tasks Enis Baty et.al. 2412.16146 null
2024-12-20 Towards Interpretable Radiology Report Generation via Concept Bottlenecks using a Multi-Agentic RAG Hasan Md Tusfiqur Alam et.al. 2412.16086 link
2024-12-20 A Thorough Investigation into the Application of Deep CNN for Enhancing Natural Language Processing Capabilities Chang Weng et.al. 2412.15900 null
2024-12-20 Continual Learning Using a Kernel-Based Method Over Foundation Models Saleh Momeni et.al. 2412.15571 link
2024-12-19 Time Will Tell: Timing Side Channels via Output Token Count in Large Language Models Tianchen Zhang et.al. 2412.15431 null
2024-12-19 Till the Layers Collapse: Compressing a Deep Neural Network through the Lenses of Batch Normalization Layers Zhu Liao et.al. 2412.15077 null
2024-12-18 Zero-Shot Prompting and Few-Shot Fine-Tuning: Revisiting Document Image Classification Using Large Language Models Anna Scius-Bertrand et.al. 2412.13859 null
2024-12-18 Modelling Multi-modal Cross-interaction for ML-FSIC Based on Local Feature Selection Kun Yan et.al. 2412.13732 null
2024-12-18 MBInception: A new Multi-Block Inception Model for Enhancing Image Processing Efficiency Fatemeh Froughirad et.al. 2412.13703 null
2024-12-17 Identifying Bias in Deep Neural Networks Using Image Transforms Sai Teja Erukude et.al. 2412.13079 link
2024-12-17 Token-Level Graphs for Short Text Classification Gregor Donabauer et.al. 2412.12754 link
2024-12-17 Your Next State-of-the-Art Could Come from Another Domain: A Cross-Domain Analysis of Hierarchical Text Classification Nan Li et.al. 2412.12744 link
2024-12-17 ShotVL: Human-Centric Highlight Frame Retrieval via Language Queries Wangyu Xue et.al. 2412.12675 null
2024-12-17 Structural Pruning via Spatial-aware Information Redundancy for Semantic Segmentation Dongyue Wu et.al. 2412.12672 link
2024-12-19 RemoteTrimmer: Adaptive Structural Pruning for Remote Sensing Image Classification Guangwenjie Zou et.al. 2412.12603 link
2024-12-17 Addressing Small and Imbalanced Medical Image Datasets Using Generative Models: A Comparative Study of DDPM and PGGANs with Random and Greedy K Sampling Iman Khazrak et.al. 2412.12532 link
2024-12-16 Gramian Multimodal Representation Learning and Alignment Giordano Cicchetti et.al. 2412.11959 null
2024-12-16 The Impact of Generalization Techniques on the Interplay Among Privacy, Utility, and Fairness in Image Classification Ahmad Hassanpour et.al. 2412.11951 null
2024-12-16 Does VLM Classification Benefit from LLM Description Semantics? Pingchuan Ma et.al. 2412.11917 link
2024-12-16 Discrepancy-Aware Attention Network for Enhanced Audio-Visual Zero-Shot Learning RunLin Yu et.al. 2412.11715 null
2024-12-16 LMM-Regularized CLIP Embeddings for Image Classification Maria Tzelepi et.al. 2412.11663 null
2024-12-16 Non-Convex Optimization in Federated Learning via Variance Reduction and Adaptive Learning Dipanwita Thakur et.al. 2412.11660 null
2024-12-16 CNNtention: Can CNNs do better with Attention? Julian Glattki et.al. 2412.11657 null
2024-12-16 Explicit and Implicit Graduated Optimization in Deep Neural Networks Naoki Sato et.al. 2412.11501 link
2024-12-16 Towards Better Multi-task Learning: A Framework for Optimizing Dataset Combinations in Large Language Models Zaifu Zhan et.al. 2412.11455 null
2024-12-16 Scaled Conjugate Gradient Method for Nonconvex Optimization in Deep Neural Networks Naoki Sato et.al. 2412.11400 null
2024-12-13 Robust image classification with multi-modal large language models Francesco Villani et.al. 2412.10353 null
2024-12-13 MVQ:Towards Efficient DNN Compression and Acceleration with Masked Vector Quantization Shuaiting Li et.al. 2412.10261 null
2024-12-13 Label-template based Few-Shot Text Classification with Contrastive Learning Guanghua Hou et.al. 2412.10110 null
2024-12-13 Data Pruning Can Do More: A Comprehensive Data Pruning Approach for Object Re-identification Zi Yang et.al. 2412.10091 link
2024-12-13 Low-Resource Fast Text Classification Based on Intra-Class and Inter-Class Distance Calculation Yanxu Mao et.al. 2412.09922 null
2024-12-12 DQA: An Efficient Method for Deep Quantization of Deep Neural Network Activations Wenhao Hu et.al. 2412.09687 null
2024-12-12 Embeddings are all you need! Achieving High Performance Medical Image Classification through Training-Free Embedding Analysis Raj Hansini Khoiwal et.al. 2412.09445 null
2024-12-12 Learned Compression for Compressed Learning Dan Jacobellis et.al. 2412.09405 link
2024-12-12 Advancing Attribution-Based Neural Network Explainability through Relative Absolute Magnitude Layer-Wise Relevance Propagation and Multi-Component Evaluation Davor Vukadin et.al. 2412.09311 link
2024-12-13 An Efficient Framework for Enhancing Discriminative Models via Diffusion Techniques Chunxiao Li et.al. 2412.09063 null
2024-12-12 STEAM: Squeeze and Transform Enhanced Attention Module Rishabh Sabharwal et.al. 2412.09023 null
2024-12-12 Stochastic Learning of Non-Conjugate Variational Posterior for Image Classification Kart-Leong Lim et.al. 2412.08951 null
2024-12-11 BDA: Bangla Text Data Augmentation Framework Md. Tariquzzaman et.al. 2412.08753 null
2024-12-11 Advancing Single- and Multi-task Text Classification through Large Language Model Fine-tuning Hang Zhao et.al. 2412.08587 null
2024-12-11 ALoRE: Efficient Visual Adaptation via Aggregating Low Rank Experts Sinan Du et.al. 2412.08341 null
2024-12-11 Online training and pruning of photonic neural networks Jiawei Zhang et.al. 2412.08184 null
2024-12-11 Wasserstein Distance Rivals Kullback-Leibler Divergence for Knowledge Distillation Jiaming Lv et.al. 2412.08139 null
2024-12-11 Concept Bottleneck Large Language Models Chung-En Sun et.al. 2412.07992 link
2024-12-10 FastDDS-Based Middleware System for Remote X-Ray Image Classification Using Raspberry Pi Omar H. Khater et.al. 2412.07818 null
2024-12-10 Leveraging Content and Context Cues for Low-Light Image Enhancement Igor Morawski et.al. 2412.07693 link
2024-12-10 Post-Training Non-Uniform Quantization for Convolutional Neural Networks Ahmed Luqman et.al. 2412.07391 null
2024-12-10 Image Classification Using Singular Value Decomposition and Optimization Isabela M. Yepes et.al. 2412.07288 link
2024-12-10 An Enhancement of CNN Algorithm for Rice Leaf Disease Image Classification in Mobile Applications Kayne Uriel K. Rodrigo et.al. 2412.07182 null
2024-12-09 Convolution goes higher-order: a biologically inspired mechanism empowers image classification Simone Azeglio et.al. 2412.06740 null
2024-12-09 Impact of Privacy Parameters on Deep Learning Models for Image Classification Basanta Chaulagain et.al. 2412.06689 null
2024-12-10 Data Quality Enhancement on the Basis of Diversity with Large Language Models for Text Classification: Uncovered, Difficult, and Noisy Min Zeng et.al. 2412.06575 null
2024-12-09 How Certain are Uncertainty Estimates? Three Novel Earth Observation Datasets for Benchmarking Uncertainty Quantification in Machine Learning Yuanyuan Wang et.al. 2412.06451 null
2024-12-09 Optimizing Multi-Task Learning for Enhanced Performance in Large Language Models Zhen Qi et.al. 2412.06249 null
2024-12-08 Hyperspectral Image Spectral-Spatial Feature Extraction via Tensor Principal Component Analysis Yuemei Ren et.al. 2412.06075 null
2024-12-08 Vision Transformer-based Semantic Communications With Importance-Aware Quantization Joohyuk Park et.al. 2412.06038 null
2024-12-06 Sparse autoencoders reveal selective remapping of visual concepts during adaptation Hyesu Lim et.al. 2412.05276 link
2024-12-06 MTSpark: Enabling Multi-Task Learning with Spiking Neural Networks for Generalist Agents Avaneesh Devkota et.al. 2412.04847 null
2024-12-05 Grounding Descriptions in Images informs Zero-Shot Visual Recognition Shaunak Halbe et.al. 2412.04429 link
2024-12-05 FedDUAL: A Dual-Strategy with Adaptive Loss and Dynamic Aggregation for Mitigating Data Heterogeneity in Federated Learning Pranab Sahoo et.al. 2412.04416 link
2024-12-05 Enhancing Whole Slide Image Classification through Supervised Contrastive Domain Adaptation Ilán Carretero et.al. 2412.04260 null
2024-12-05 Demonstration Selection for In-Context Learning via Reinforcement Learning Xubin Wang et.al. 2412.03966 null
2024-12-05 Quantized and Interpretable Learning Scheme for Deep Neural Networks in Classification Task Alireza Maleki et.al. 2412.03915 null
2024-12-05 Multisource Collaborative Domain Generalization for Cross-Scene Remote Sensing Image Classification Zhu Han et.al. 2412.03897 null
2024-12-05 Dual-Branch Subpixel-Guided Network for Hyperspectral Image Classification Zhu Han et.al. 2412.03893 link
2024-12-04 Language Model Meets Prototypes: Towards Interpretable Text Classification Models through Prototypical Networks Ximing Wen et.al. 2412.03761 null
2024-12-05 Continual Low-Rank Scaled Dot-product Attention Ginés Carreto Picón et.al. 2412.03214 null
2024-12-04 Multi-Level Correlation Network For Few-Shot Image Classification Yunkai Dang et.al. 2412.03159 link
2024-12-04 Assessing the performance of CT image denoisers using Laguerre-Gauss Channelized Hotelling Observer for lesion detection Prabhat Kc et.al. 2412.02920 null
2024-12-04 Higher Order Transformers: Efficient Attention Mechanism for Tensor Structured Data Soroush Omranpour et.al. 2412.02919 null
2024-12-03 Synergistic Development of Perovskite Memristors and Algorithms for Robust Analog Computing Nanyang Ye et.al. 2412.02779 null
2024-12-03 Mixture of Physical Priors Adapter for Parameter-Efficient Fine-Tuning Zhaozhi Wang et.al. 2412.02759 null
2024-12-03 Multimodal Remote Sensing Scene Classification Using VLMs and Dual-Cross Attention Networks Jinjin Cai et.al. 2412.02531 null
2024-12-04 GenMix: Effective Data Augmentation with Generative Diffusion Model Image Editing Khawar Islam et.al. 2412.02366 null
2024-12-03 Multi-Granularity Tibetan Textual Adversarial Attack Method Based on Masked Language Model Xi Cao et.al. 2412.02343 null
2024-12-03 Active Learning via Classifier Impact and Greedy Selection for Interactive Image Retrieval Leah Bar et.al. 2412.02310 link
2024-12-03 A Classic-Quantum Hybrid Network Framework: CQH-Net Ao Liu et.al. 2412.02059 null
2024-12-02 PROFIT: A PROximal FIne Tuning Optimizer for Multi-Task Learning Anirudh S Chakravarthy et.al. 2412.01930 null
2024-12-02 Concept Based Continuous Prompts for Interpretable Text Classification Qian Chen et.al. 2412.01644 link
2024-12-02 NYT-Connections: A Deceptively Simple Text Classification Task that Stumps System-1 Thinkers Angel Yahir Loredo Lopez et.al. 2412.01621 null
2024-12-02 Explaining the Unexplained: Revealing Hidden Correlations for Better Interpretability Wen-Dong Jiang et.al. 2412.01365 null
2024-12-02 Class Distance Weighted Cross Entropy Loss for Classification of Disease Severity Gorkem Polat et.al. 2412.01246 null
2024-11-29 LLM Teacher-Student Framework for Text Classification With No Manually Annotated Data: A Case Study in IPTC News Topic Classification Taja Kuzman et.al. 2411.19638 link
2024-11-29 FairDD: Fair Dataset Distillation via Synchronized Matching Qihang Zhou et.al. 2411.19623 null
2024-11-29 Memristive Nanowire Network for Energy Efficient Audio Classification: Pre-Processing-Free Reservoir Computing with Reduced Latency Akshaya Rajesh et.al. 2411.19611 null
2024-11-29 Contextual Checkerboard Denoise -- A Novel Neural Network-Based Approach for Classification-Aware OCT Image Denoising Md. Touhidul Islam et.al. 2411.19549 link
2024-11-28 CLIP meets DINO for Tuning Zero-Shot Classifier using Unlabeled Image Collections Mohamed Fazli Imam et.al. 2411.19346 link
2024-11-28 Quantum Neural Networks in Practice: A Comparative Study with Classical Models from Standard Data Sets to Industrial Images Daniel Basilewitsch et.al. 2411.19276 null
2024-11-28 Controlling Participation in Federated Learning with Feedback Michael Cummins et.al. 2411.19242 null
2024-11-28 Introducing Three New Benchmark Datasets for Hierarchical Text Classification Jaco du Toit et.al. 2411.19119 null
2024-11-28 MVFormer: Diversifying Feature Normalization and Token Mixing for Efficient Vision Transformers Jongseong Bae et.al. 2411.18995 null
2024-11-27 Fall Leaf Adversarial Attack on Traffic Sign Classification Anthony Etim et.al. 2411.18776 null
2024-11-27 Leveraging Semi-Supervised Learning to Enhance Data Mining for Image Classification under Limited Labeled Data Aoran Shen et.al. 2411.18622 null
2024-11-27 Pruning Deep Convolutional Neural Network Using Conditional Mutual Information Tien Vu-Van et.al. 2411.18578 null
2024-11-27 Mixture of Experts in Image Classification: What's the Sweet Spot? Mathurin Videau et.al. 2411.18322 null
2024-11-27 KANs for Computer Vision: An Experimental Study Karthik Mohan et.al. 2411.18224 null
2024-11-27 Spectral-Spatial Transformer with Active Transfer Learning for Hyperspectral Image Classification Muhammad Ahmad et.al. 2411.18115 link
2024-11-27 Vision Mamba Distillation for Low-resolution Fine-grained Image Classification Yao Chen et.al. 2411.17980 link
2024-11-27 Optimized Tradeoffs for Private Prediction with Majority Ensembling Shuli Jiang et.al. 2411.17965 null
2024-11-26 What Differentiates Educational Literature? A Multimodal Fusion Approach of Transformers and Computational Linguistics Jordan J. Bird et.al. 2411.17593 null
2024-11-26 TinyViM: Frequency Decoupling for Tiny Hybrid Vision Mamba Xiaowen Ma et.al. 2411.17473 link
2024-11-26 SpikeAtConv: An Integrated Spiking-Convolutional Attention Architecture for Energy-Efficient Neuromorphic Vision Processing Wangdan Liao et.al. 2411.17439 null
2024-11-26 CoA: Chain-of-Action for Generative Semantic Labels Meng Wei et.al. 2411.17406 link
2024-11-26 BadScan: An Architectural Backdoor Attack on Visual State Space Models Om Suhas Deshmukh et.al. 2411.17283 null
2024-11-26 An In-depth Investigation of Sparse Rate Reduction in Transformer-like Models Yunzhe Hu et.al. 2411.17182 null
2024-11-25 Contrastive Multi-graph Learning with Neighbor Hierarchical Sifting for Semi-supervised Text Classification Wei Ai et.al. 2411.16787 null
2024-11-25 A Supervised Machine Learning Approach for Assessing Grant Peer Review Reports Gabriel Okasa et.al. 2411.16662 link
2024-11-25 Debiasing Classifiers by Amplifying Bias with Latent Diffusion and Large Language Models Donggeun Ko et.al. 2411.16079 null
2024-11-24 Context-Aware Detection of Mixed Critical Events using Video Classification Filza Akhlaq et.al. 2411.15773 null
2024-11-23 MUNBa: Machine Unlearning via Nash Bargaining Jing Wu et.al. 2411.15537 null
2024-11-23 Twin Trigger Generative Networks for Backdoor Attacks against Object Detection Zhiying Li et.al. 2411.15439 null
2024-11-22 MME-Survey: A Comprehensive Survey on Evaluation of Multimodal LLMs Chaoyou Fu et.al. 2411.15296 null
2024-11-21 CODE-CL: COnceptor-Based Gradient Projection for DEep Continual Learning Marco Paul E. Apolinario et.al. 2411.15235 null
2024-11-21 BiomedCoOp: Learning to Prompt for Biomedical Vision-Language Models Taha Koleilat et.al. 2411.15232 null
2024-11-22 FOCUS: Knowledge-enhanced Adaptive Visual Compression for Few-shot Whole Slide Image Classification Zhengrui Guo et.al. 2411.14743 link
2024-11-21 Adaptable Embeddings Network (AEN) Stan Loosmore et.al. 2411.13786 null
2024-11-20 Hierarchical Text Classification (HTC) vs. eXtreme Multilabel Classification (XML): Two Sides of the Same Medal Nerijus Bertalis et.al. 2411.13687 link
2024-11-20 Combining Autoregressive and Autoencoder Language Models for Text Classification João Gonçalves et.al. 2411.13282 link
2024-11-20 MEGL: Multimodal Explanation-Guided Learning Yifei Zhang et.al. 2411.13053 null
2024-11-19 Problem-dependent convergence bounds for randomized linear gradient compression Thomas Flynn et.al. 2411.12898 null
2024-11-19 Enhancing Multi-Class Disease Classification: Neoplasms, Cardiovascular, Nervous System, and Digestive Disorders Using Advanced LLMs Ahmed Akib Jawad Karim et.al. 2411.12712 null
2024-11-22 STREAM: A Universal State-Space Model for Sparse Geometric Data Mark Schöne et.al. 2411.12603 null
2024-11-19 AdaCM $^2$ : On Understanding Extremely Long-Term Video with Adaptive Cross-Modality Memory Reduction Yuanbin Man et.al. 2411.12593 null
2024-11-19 Zero-Shot Crate Digging: DJ Tool Retrieval Using Speech Activity, Music Structure And CLAP Embeddings Iroro Orife et.al. 2411.12209 link
2024-11-19 Invariant Shape Representation Learning For Image Classification Tonmoy Hossain et.al. 2411.12201 link
2024-11-19 Self-Supervised Learning in Deep Networks: A Pathway to Robust Few-Shot Classification Yuyang Xiao et.al. 2411.12151 null
2024-11-18 Just Leaf It: Accelerating Diffusion Classifiers with Hierarchical Class Pruning Arundhati S. Shanbhag et.al. 2411.12073 link
2024-11-18 Vision Language Models Are Few-Shot Audio Spectrogram Classifiers Satvik Dixit et.al. 2411.12058 null
2024-11-18 Fair Distillation: Teaching Fairness from Biased Teachers in Medical Imaging Milad Masroor et.al. 2411.11939 null
2024-11-18 Exploring Emerging Trends and Research Opportunities in Visual Place Recognition Antonios Gasteratos et.al. 2411.11481 null
2024-11-16 MetaLA: Unified Optimal Linear Approximation to Softmax Attention Map Yuhong Chou et.al. 2411.10741 null
2024-11-16 Diagnostic Text-guided Representation Learning in Hierarchical Classification for Pathological Whole Slide Image Jiawen Li et.al. 2411.10709 null
2024-11-16 Multi-perspective Contrastive Logit Distillation Qi Wang et.al. 2411.10693 null
2024-11-15 Vision Eagle Attention: A New Lens for Advancing Image Classification Mahmudul Hasan et.al. 2411.10564 link
2024-11-15 On the Cost of Model-Serving Frameworks: An Experimental Evaluation Pasquale De Rosa et.al. 2411.10337 null
2024-11-15 Embedding Byzantine Fault Tolerance into Federated Learning via Virtual Data-Driven Consistency Scoring Plugin Youngjoon Lee et.al. 2411.10212 link
2024-11-15 Outliers resistant image classification by anomaly detection Anton Sergeev et.al. 2411.10150 null
2024-11-15 Adapting the Biological SSVEP Response to Artificial Neural Networks Emirhan Böge et.al. 2411.10084 null
2024-11-15 Evidential Federated Learning for Skin Lesion Image Classification Rutger Hendrix et.al. 2411.10071 null
2024-11-14 Adversarial Attacks Using Differentiable Rendering: A Survey Matthew Hull et.al. 2411.09749 null
2024-11-14 ResidualDroppath: Enhancing Feature Reuse over Residual Connections Sejik Park et.al. 2411.09475 null
2024-11-14 SAG-ViT: A Scale-Aware, High-Fidelity Patching Approach with Graph Attention for Vision Transformers Shravan Venkatraman et.al. 2411.09420 null
2024-11-14 Heuristical Comparison of Vision Transformers Against Convolutional Neural Networks for Semantic Segmentation on Remote Sensing Imagery Ashim Dahal et.al. 2411.09101 link
2024-11-13 Computed tomography using meta-optics Maksym Zhelyeznuyakov et.al. 2411.08995 null
2024-11-13 CoCoP: Enhancing Text Classification with LLM through Code Completion Prompt Mohammad Mahdi Mohajeri et.al. 2411.08979 null
2024-11-13 ScaleNet: Scale Invariance Learning in Directed Graphs Qin Jiang et.al. 2411.08758 link
2024-11-13 Efficient Whole Slide Image Classification through Fisher Vector Representation Ravi Kant Gupta et.al. 2411.08530 null
2024-11-12 HMIL: Hierarchical Multi-Instance Learning for Fine-Grained Whole Slide Image Classification Cheng Jin et.al. 2411.07660 null
2024-11-12 Semantic segmentation on multi-resolution optical and microwave data using deep learning Jai G Singla et.al. 2411.07581 null
2024-11-11 The Inherent Adversarial Robustness of Analog In-Memory Computing Corey Lammie et.al. 2411.07023 null
2024-11-11 ScaleKD: Strong Vision Transformers Could Be Excellent Teachers Jiawei Fan et.al. 2411.06786 link
2024-11-11 A Text Classification Model Combining Adversarial Training with Pre-trained Language Model and neural networks: A Case Study on Telecom Fraud Incident Texts Liu Zhuoxian et.al. 2411.06772 null
2024-11-11 Can KAN Work? Exploring the Potential of Kolmogorov-Arnold Networks in Computer Vision Yueyang Cang et.al. 2411.06727 null
2024-11-10 Deep Active Learning in the Open World Tian Xie et.al. 2411.06353 null
2024-11-09 Clustering Algorithms and RAG Enhancing Semi-Supervised Text Classification with Large LLMs Shan Zhong et.al. 2411.06175 null
2024-11-09 AI-Compass: A Comprehensive and Effective Multi-module Testing Tool for AI Systems Zhiyu Zhu et.al. 2411.06146 null
2024-11-09 Exploring Structural Nonlinearity in Binary Polariton-Based Neuromorphic Architectures Evgeny Sedov et.al. 2411.06124 null
2024-11-09 Mutual-energy inner product optimization method for constructing feature coordinates and image classification in Machine Learning Yuanxiu Wang et.al. 2411.06100 null
2024-11-08 GUIDEQ: Framework for Guided Questioning for progressive informational collection and classification Priya Mishra et.al. 2411.05991 link
2024-11-08 FisherMask: Enhancing Neural Network Labeling Efficiency in Image Classification Using Fisher Information Shreen Gul et.al. 2411.05752 link
2024-11-08 Visual-TCAV: Concept-based Attribution and Saliency Maps for Post-hoc Explainability in Image Classification Antonio De Santis et.al. 2411.05698 null
2024-11-08 Efficient Audio-Visual Fusion for Video Classification Mahrukh Awan et.al. 2411.05603 null
2024-11-08 Training objective drives the consistency of representational similarity across datasets Laure Ciernik et.al. 2411.05561 link
2024-11-08 Estimating the Influence of Sequentially Correlated Literary Properties in Textual Classification: A Data-Centric Hypothesis-Testing Approach Gideon Yoffe et.al. 2411.04950 null
2024-11-07 Attention Masks Help Adversarial Attacks to Bypass Safety Detectors Yunfan Shi et.al. 2411.04772 link
2024-11-07 Zero-Shot Temporal Resolution Domain Adaptation for Spiking Neural Networks Sanja Karilanova et.al. 2411.04760 null
2024-11-07 Is network fragmentation a useful complexity measure? Coenraad Mouton et.al. 2411.04695 null
2024-11-07 DISCO: DISCovering Overfittings as Causal Rules for Text Classification Models Zijian Zhang et.al. 2411.04649 null
2024-11-07 Neural Fingerprints for Adversarial Attack Detection Haim Fisher et.al. 2411.04533 link
2024-11-06 Multimodal Structure-Aware Quantum Data Processing Hala Hawashin et.al. 2411.04242 null
2024-11-06 RaVL: Discovering and Mitigating Spurious Correlations in Fine-Tuned Vision-Language Models Maya Varma et.al. 2411.04097 link
2024-11-06 Overcoming label shift in targeted federated learning Edvin Listo Zec et.al. 2411.03799 null
2024-11-06 Deferred Poisoning: Making the Model More Vulnerable via Hessian Singularization Yuhao He et.al. 2411.03752 null
2024-11-05 Judge Like a Real Doctor: Dual Teacher Sample Consistency Framework for Semi-supervised Medical Image Classification Zhang Qixiang et.al. 2411.03041 null
2024-11-06 Confidence Calibration of Classifiers with Many Classes Adrien LeCoz et.al. 2411.02988 link
2024-11-05 Domain Expansion and Boundary Growth for Open-Set Single-Source Domain Generalization Pengkun Jiao et.al. 2411.02920 null
2024-11-05 ADOPT: Modified Adam Can Converge with Any $β_2$ with the Optimal Rate Shohei Taniguchi et.al. 2411.02853 link
2024-11-05 Integrated lithium niobate photonic computing circuit based on efficient and high-speed electro-optic conversion Yaowen Hu et.al. 2411.02734 null
2024-11-06 Wave Network: An Ultra-Small Language Model Xin Zhang et.al. 2411.02674 null
2024-11-04 FUSECAPS: Investigating Feature Fusion Based Framework for Capsule Endoscopy Image Classification Bidisha Chakraborty et.al. 2411.02637 null
2024-11-04 TripletCLIP: Improving Compositional Reasoning of CLIP via Synthetic Vision-Language Negatives Maitreya Patel et.al. 2411.02545 null
2024-11-04 A Comparative Analysis of Instruction Fine-Tuning LLMs for Financial Text Classification Sorouralsadat Fatemi et.al. 2411.02476 null
2024-11-04 Exploiting Contextual Uncertainty of Visual Data for Efficient Training of Deep Models Sharat Agarwal et.al. 2411.01925 null
2024-11-03 Optimizing Gastrointestinal Diagnostics: A CNN-Based Model for VCE Image Classification Vaneeta Ahlawat et.al. 2411.01652 null
2024-11-03 ParseCaps: An Interpretable Parsing Capsule Network for Medical Image Diagnosis Xinyu Geng et.al. 2411.01564 null
2024-11-03 Efficient Deep Learning Infrastructures for Embedded Computing Systems: A Comprehensive Survey and Future Envision Xiangzhong Luo et.al. 2411.01431 null
2024-11-02 Combining Financial Data and News Articles for Stock Price Movement Prediction Using Large Language Models Ali Elahi et.al. 2411.01368 null
2024-11-02 Optimizing Violence Detection in Video Classification Accuracy through 3D Convolutional Neural Networks Aarjav Kavathia et.al. 2411.01348 null
2024-11-02 MIC: Medical Image Classification Using Chest X-ray (COVID-19 and Pneumonia) Dataset with the Help of CNN and Customized CNN Nafiz Fahad et.al. 2411.01163 null
2024-11-02 Few-Class Arena: A Benchmark for Efficient Selection of Vision Models and Dataset Difficulty Measurement Bryan Bo Cao et.al. 2411.01099 link
2024-11-01 Towards Robust Text Classification: Mitigating Spurious Correlations with Causal Learning Yuqing Zhou et.al. 2411.01045 null
2024-11-01 FISHing in Uncertainty: Synthetic Contrastive Learning for Genetic Aberration Detection Simon Gutwein et.al. 2411.01025 link
2024-10-31 Video Token Merging for Long-form Video Understanding Seon-Ho Lee et.al. 2410.23782 null
2024-10-31 Neurobench: DCASE 2020 Acoustic Scene Classification benchmark on XyloAudio 2 Weijie Ke et.al. 2410.23776 null
2024-10-31 QUEST-A: Untrained Filtering with Trained Focusing led to Enhanced Quantum Architectures Lian-Hui Yu et.al. 2410.23560 link
2024-11-01 Large Language Models for Patient Comments Multi-Label Classification Hajar Sakai et.al. 2410.23528 null
2024-10-30 Multilingual Vision-Language Pre-training for the Remote Sensing Domain João Daniel Silva et.al. 2410.23370 null
2024-10-30 Domain-decomposed image classification algorithms using linear discriminant analysis and convolutional neural networks Axel Klawonn et.al. 2410.23359 null
2024-10-30 CLIPErase: Efficient Unlearning of Visual-Textual Associations in CLIP Tianyu Yang et.al. 2410.23330 null
2024-10-30 Don't Just Pay Attention, PLANT It: Transfer L2R Models to Fine-tune Attention in Extreme Multi-Label Text Classification Debjyoti Saharoy et.al. 2410.23066 null
2024-10-30 Automated Trustworthiness Oracle Generation for Machine Learning Text Classifiers Lam Nguyen Tung et.al. 2410.22663 null
2024-10-29 Developing Convolutional Neural Networks using a Novel Lamarckian Co-Evolutionary Algorithm Zaniar Sharifi et.al. 2410.22487 null
2024-10-29 EfficientNet with Hybrid Attention Mechanisms for Enhanced Breast Histopathology Classification: A Comprehensive Approach Naren Sengodan et.al. 2410.22392 null
2024-10-29 DISCERN: Decoding Systematic Errors in Natural Language for Text Classifiers Rakesh R. Menon et.al. 2410.22239 null
2024-10-29 Class-Aware Contrastive Optimization for Imbalanced Text Classification Grigorii Khvatskii et.al. 2410.22197 null
2024-10-29 Active Learning for Vision-Language Models Bardia Safaei et.al. 2410.22187 null
2024-10-29 Multi-Level Feature Distillation of Joint Teachers Trained on Distinct Image Datasets Adrian Iordache et.al. 2410.22184 link
2024-10-29 Natural Language Processing for Analyzing Electronic Health Records and Clinical Notes in Cancer Research: A Review Muhammad Bilal et.al. 2410.22180 null
2024-10-29 FakeFormer: Efficient Vulnerability-Driven Transformers for Generalisable Deepfake Detection Dat Nguyen et.al. 2410.21964 null
2024-10-29 Bayesian Optimization for Hyperparameters Tuning in Neural Networks Gabriele Onorato et.al. 2410.21886 null
2024-10-29 Advancing Efficient Brain Tumor Multi-Class Classification -- New Insights from the Vision Mamba Model in Transfer Learning Yinyi Lai et.al. 2410.21872 null
2024-10-28 Audio Classification of Low Feature Spectrograms Utilizing Convolutional Neural Networks Noel Elias et.al. 2410.21561 null
2024-10-30 A Novel Score-CAM based Denoiser for Spectrographic Signature Extraction without Ground Truth Noel Elias et.al. 2410.21557 null
2024-10-28 Attacking Misinformation Detection Using Adversarial Examples Generated by Language Models Piotr Przybyła et.al. 2410.20940 null
2024-10-28 Data-Efficient Low-Complexity Acoustic Scene Classification via Distilling and Progressive Pruning Bing Han et.al. 2410.20775 null
2024-10-28 Interpretable Image Classification with Adaptive Prototype-based Vision Transformers Chiyu Ma et.al. 2410.20722 null
2024-10-27 Graph Neural Networks on Discriminative Graphs of Words Yassine Abbahaddou et.al. 2410.20469 null
2024-10-27 Historical Test-time Prompt Tuning for Vision Foundation Models Jingyi Zhang et.al. 2410.20346 null
2024-10-27 Sequential Large Language Model-Based Hyper-Parameter Optimization Kanan Mahammadli et.al. 2410.20302 link
2024-10-26 Enhancing CNN Classification with Lamarckian Memetic Algorithms and Local Search Akhilbaran Ghosh et.al. 2410.20234 null
2024-10-26 Annotation Efficiency: Identifying Hard Samples via Blocked Sparse Linear Bandits Adit Jain et.al. 2410.20041 null
2024-10-26 Attacks against Abstractive Text Summarization Models through Lead Bias and Influence Functions Poojitha Thota et.al. 2410.20019 null
2024-10-26 Vulnerability of LLMs to Vertically Aligned Text Manipulations Zhecheng Li et.al. 2410.20016 null
2024-10-25 Learning the Regularization Strength for Deep Fine-Tuning via a Data-Emphasized Variational Objective Ethan Harvey et.al. 2410.19675 null
2024-10-24 Noise Adaption Network for Morse Code Image Classification Xiaxia Wang et.al. 2410.19180 link
2024-10-24 Hybrid Quantum-Classical Feature Extraction approach for Image Classification using Autoencoders and Quantum SVMs Donovan Slabbert et.al. 2410.18814 null
2024-10-24 Spatial-Temporal Search for Spiking Neural Networks Kaiwei Che et.al. 2410.18580 null
2024-10-25 Interpretable Bilingual Multimodal Large Language Model for Diverse Biomedical Tasks Lehan Wang et.al. 2410.18387 null
2024-10-23 Using Cartesian slice plots of a cosmological simulation as input of a convolutional neural network Guillermo Arreaga-Garcia et.al. 2410.18320 null
2024-10-25 Backdoor in Seconds: Unlocking Vulnerabilities in Large Pre-trained Models via Model Editing Dongliang Guo et.al. 2410.18267 null
2024-10-23 Future Token Prediction -- Causal Language Modelling with Per-Token Semantic State Vector for Multi-Token Prediction Nicholas Walker et.al. 2410.18160 null
2024-10-23 Deep Learning for Active Region Classification: A Systematic Study from Convolutional Neural Networks to Vision Transformers Edoardo Legnaro et.al. 2410.17816 null
2024-10-23 New Insight in Cervical Cancer Diagnosis Using Convolution Neural Network Architecture Ach. Khozaimi et.al. 2410.17735 null
2024-10-24 Advancing Interpretability in Text Classification through Prototype Learning Bowen Wei et.al. 2410.17546 null
2024-10-23 Enhancing Multimodal Medical Image Classification using Cross-Graph Modal Contrastive Learning Jun-En Ding et.al. 2410.17494 null
2024-10-22 Data Obfuscation through Latent Space Projection (LSP) for Privacy-Preserving AI Governance: Case Studies in Medical Diagnosis and Finance Fraud Detection Mahesh Vaijainthymala Krishnamoorthy et.al. 2410.17459 null
2024-10-22 Altogether: Image Captioning via Re-aligning Alt-text Hu Xu et.al. 2410.17251 null
2024-10-22 KANICE: Kolmogorov-Arnold Networks with Interactive Convolutional Elements Md Meftahul Ferdaus et.al. 2410.17172 link
2024-10-22 Development of CNN Architectures using Transfer Learning Methods for Medical Image Classification Ganga Prasad Basyal et.al. 2410.16711 null
2024-10-21 Efficient Neural Network Training via Subset Pretraining Jan Spörer et.al. 2410.16523 null
2024-10-21 1024m at SMM4H 2024: Tasks 3, 5 & 6 -- Ensembles of Transformers and Large Language Models for Medical Text Classification Ram Mohan Rao Kadiyala et.al. 2410.15998 null
2024-10-21 Visual Representation Learning Guided By Multi-modal Prior Knowledge Hongkuan Zhou et.al. 2410.15981 null
2024-10-21 AutoTrain: No-code training for state-of-the-art models Abhishek Thakur et.al. 2410.15735 link
2024-10-21 ViMoE: An Empirical Study of Designing Vision Mixture-of-Experts Xumeng Han et.al. 2410.15732 null
2024-10-21 P-YOLOv8: Efficient and Accurate Real-Time Detection of Distracted Driving Mohamed R. Elshamy et.al. 2410.15602 null
2024-10-20 Open-vocabulary vs. Closed-set: Best Practice for Few-shot Object Detection Considering Text Describability Yusuke Hosoya et.al. 2410.15315 link
2024-10-19 Spatial-Mamba: Effective Visual State Space Models via Structure-Aware State Fusion Chaodong Xiao et.al. 2410.15091 link
2024-10-19 PAT: Parameter-Free Audio-Text Aligner to Boost Zero-Shot Audio Classification Ashish Seth et.al. 2410.15062 null
2024-10-19 Weakly-supervised diagnosis identification from Italian discharge letters Vittorio Torri et.al. 2410.15051 null
2024-10-19 Reflexive Guidance: Improving OoDD in Vision-Language Models via Self-Guided Image-Adaptive Concept Generation Seulbi Lee et.al. 2410.14975 null
2024-10-18 A Hybrid Feature Fusion Deep Learning Framework for Leukemia Cancer Detection in Microscopic Blood Sample Using Gated Recurrent Unit and Uncertainty Quantification Maksuda Akter et.al. 2410.14536 null
2024-10-18 Unlearning Backdoor Attacks for LLMs with Weak-to-Strong Knowledge Distillation Shuai Zhao et.al. 2410.14425 link
2024-10-18 A Novel Method to Metigate Demographic and Expert Bias in ICD Coding with Causal Inference Bin Zhang et.al. 2410.14236 null
2024-10-18 Comparative Evaluation of Clustered Federated Learning Method Michael Ben Ali et.al. 2410.14212 link
2024-10-17 Reproducibility study of "LICO: Explainable Models with Language-Image Consistency" Luan Fletcher et.al. 2410.13989 link
2024-10-17 LoLDU: Low-Rank Adaptation via Lower-Diag-Upper Decomposition for Parameter-Efficient Fine-Tuning Yiming Shi et.al. 2410.13618 link
2024-10-17 Augmentation Policy Generation for Image Classification Using Large Language Models Ant Duru et.al. 2410.13453 null
2024-10-17 Similarity-Dissimilarity Loss with Supervised Contrastive Learning for Multi-label Classification Guangming Huang et.al. 2410.13439 null
2024-10-16 Interpreting and Analyzing CLIP's Zero-Shot Image Classification via Mutual Knowledge Fawaz Sammani et.al. 2410.13016 link
2024-10-16 PND-Net: Plant Nutrition Deficiency and Disease Classification using Graph Convolutional Network Asish Bera et.al. 2410.12742 null
2024-10-16 Beyond Speech and More: Investigating the Emergent Ability of Speech Foundation Models for Classifying Physiological Time-Series Signals Orchid Chetia Phukan et.al. 2410.12645 null
2024-10-17 From Measurement Instruments to Data: Leveraging Theory-Driven Synthetic Training Data for Classifying Social Constructs Lukas Birkenmaier et.al. 2410.12622 null
2024-10-16 Feature Augmentation for Self-supervised Contrastive Learning: A Closer Look Yong Zhang et.al. 2410.12396 null
2024-10-15 Clustering doc2vec output for topic-dimensionality reduction: A MITRE ATT&CK calibration Nathan Monnet et.al. 2410.11573 null
2024-10-15 LoKO: Low-Rank Kalman Optimizer for Online Fine-Tuning of Large Models Hossein Abdi et.al. 2410.11551 null
2024-10-15 Reducing Labeling Costs in Sentiment Analysis via Semi-Supervised Learning Minoo Jafarlou et.al. 2410.11355 null
2024-10-14 Towards a More Complete Theory of Function Preserving Transforms Michael Painter et.al. 2410.11038 null
2024-10-14 Enhancing JEPAs with Spatial Conditioning: Robust and Efficient Representation Learning Etai Littwin et.al. 2410.10773 null
2024-10-15 Ensemble of ConvNeXt V2 and MaxViT for Long-Tailed CXR Classification with View-Based Aggregation Yosuke Yamagishi et.al. 2410.10710 link
2024-10-14 Queryable Prototype Multiple Instance Learning with Vision-Language Models for Incremental Whole Slide Image Classification Jiaxiang Gou et.al. 2410.10573 null
2024-10-14 Dynamic Power Control in a Hardware Neural Network with Error-Configurable MAC Units Maedeh Ghaderi et.al. 2410.10545 null
2024-10-14 Improve Meta-learning for Few-Shot Text Classification with All You Can Acquire from the Tasks Xinyue Liu et.al. 2410.10454 link
2024-10-14 GlobalMamba: Global Image Serialization for Vision Mamba Chengkun Wang et.al. 2410.10316 link
2024-10-14 A Multi-Task Text Classification Pipeline with Natural Language Explanations: A User-Centric Evaluation in Sentiment Analysis and Offensive Language Identification in Greek Tweets Nikolaos Mylonas et.al. 2410.10290 null
2024-10-14 big.LITTLE Vision Transformer for Efficient Visual Recognition He Guo et.al. 2410.10267 null
2024-10-14 SkillAggregation: Reference-free LLM-Dependent Aggregation Guangzhi Sun et.al. 2410.10215 null
2024-10-14 Will the Inclusion of Generated Data Amplify Bias Across Generations in Future Image Classification Models? Zeliang Zhang et.al. 2410.10160 null
2024-10-11 Efficient Hyperparameter Importance Assessment for CNNs Ruinan Wang et.al. 2410.08920 null
2024-10-11 Parameter-Efficient Fine-Tuning of Large Language Models using Semantic Knowledge Tuning Nusrat Jahan Prottasha et.al. 2410.08598 null
2024-10-11 DeBiFormer: Vision Transformer with Deformable Agent Bi-level Routing Attention Nguyen Huu Bao Long et.al. 2410.08582 link
2024-10-11 Accelerated Distributed Stochastic Non-Convex Optimization over Time-Varying Directed Networks Yiyue Chen et.al. 2410.08508 null
2024-10-11 Semantic Token Reweighting for Interpretable and Controllable Text Embeddings in CLIP Eunji Kim et.al. 2410.08469 null
2024-10-10 Bilinear MLPs enable weight-based mechanistic interpretability Michael T. Pearce et.al. 2410.08417 null
2024-10-10 What is Left After Distillation? How Knowledge Transfer Impacts Fairness and Bias Aida Mohammadshahi et.al. 2410.08407 null
2024-10-10 Time Traveling to Defend Against Adversarial Example Attacks in Image Classification Anthony Etim et.al. 2410.08338 null
2024-10-10 More Experts Than Galaxies: Conditionally-overlapping Experts With Biologically-Inspired Fixed Routing Sagi Shaier et.al. 2410.08003 null
2024-10-10 When the Small-Loss Trick is Not Enough: Multi-Label Image Classification with Noisy Labels Applied to CCTV Sewer Inspections Keryan Chelouche et.al. 2410.07689 null
2024-10-10 Invisibility Cloak: Disappearance under Human Pose Estimation via Backdoor Attacks Minxing Zhang et.al. 2410.07670 null
2024-10-10 StablePrompt: Automatic Prompt Tuning using Reinforcement Learning for Large Language Models Minchan Kwon et.al. 2410.07652 null
2024-10-10 Explainability of Deep Neural Networks for Brain Tumor Detection S. Park et.al. 2410.07613 link
2024-10-10 CSA: Data-efficient Mapping of Unimodal Features to Multimodal Features Po-han Li et.al. 2410.07610 null
2024-10-09 One Initialization to Rule them All: Fine-tuning via Explained Variance Adaptation Fabian Paischer et.al. 2410.07170 link
2024-10-09 JPEG Inspired Deep Learning Ahmed H. Salamah et.al. 2410.07081 link
2024-10-09 Optimizing Estimators of Squared Calibration Errors in Classification Sebastian G. Gruber et.al. 2410.07014 null
2024-10-09 Spectral and Rhythm Features for Audio Classification with Deep Convolutional Neural Networks Friedrich Wolf-Monheim et.al. 2410.06927 null
2024-10-09 QuadMamba: Learning Quadtree-based Selective Scan for Visual State Space Model Fei Xie et.al. 2410.06806 null
2024-10-09 Convex Distillation: Efficient Compression of Deep Networks via Convex Optimization Prateek Varshney et.al. 2410.06567 null
2024-10-08 A Comparative Study of Hybrid Models in Health Misinformation Text Classification Mkululi Sikosana et.al. 2410.06311 null
2024-10-08 Conformal Structured Prediction Botong Zhang et.al. 2410.06296 link
2024-10-08 TEOChat: A Large Vision-Language Assistant for Temporal Earth Observation Data Jeremy Andrew Irvin et.al. 2410.06234 null
2024-10-08 Manual Verbalizer Enrichment for Few-Shot Text Classification Quang Anh Nguyen et.al. 2410.06173 null
2024-10-07 LoTLIP: Improving Language-Image Pre-training for Long Text Understanding Wei Wu et.al. 2410.05249 null
2024-10-07 Variable Resolution Pixel Quantization for Low Power Machine Vision Application on Edge Senorita Deb et.al. 2410.05189 null
2024-10-07 IGroupSS-Mamba: Interval Group Spatial-Spectral Mamba for Hyperspectral Image Classification Yan He et.al. 2410.05100 null
2024-10-07 Explanation sensitivity to the randomness of large language models: the case of journalistic text classification Jeremie Bogaert et.al. 2410.05085 null
2024-10-07 Control-oriented Clustering of Visual Latent Representation Han Qi et.al. 2410.05063 null
2024-10-07 SELECT: A Large-Scale Benchmark of Data Curation Strategies for Image Classification Benjamin Feuer et.al. 2410.05057 link
2024-10-07 Art Forgery Detection using Kolmogorov Arnold and Convolutional Neural Networks Sandro Boccuzzo et.al. 2410.04866 null
2024-10-06 MECFormer: Multi-task Whole Slide Image Classification with Expert Consultation Network Doanh C. Bui et.al. 2410.04507 null
2024-10-06 Interpret Your Decision: Logical Reasoning Regularization for Generalization in Visual Classification Zhaorui Tan et.al. 2410.04492 link
2024-10-05 IT $^3$ : Idempotent Test-Time Training Nikita Durasov et.al. 2410.04201 null
2024-10-04 Classification-Denoising Networks Louis Thiry et.al. 2410.03505 null
2024-10-04 A Multimodal Framework for Deepfake Detection Kashish Gandhi et.al. 2410.03487 null
2024-10-04 On Uncertainty In Natural Language Processing Dennis Ulmer et.al. 2410.03446 link
2024-10-04 Comparing zero-shot self-explanations with human rationales in multilingual text classification Stephanie Brandl et.al. 2410.03296 null
2024-10-04 Sm: enhanced localization in Multiple Instance Learning for medical imaging classification Francisco M. Castro-Macías et.al. 2410.03276 null
2024-10-04 Selective Transformer for Hyperspectral Image Classification Yichu Xu et.al. 2410.03171 null
2024-10-03 CPFD: Confidence-aware Privileged Feature Distillation for Short Video Classification Jinghao Shi et.al. 2410.03038 null
2024-10-03 On Expert Estimation in Hierarchical Mixture of Experts: Beyond Softmax Gating Functions Huy Nguyen et.al. 2410.02935 null
2024-10-03 Lie Algebra Canonicalization: Equivariant Neural Operators under arbitrary Lie Groups Zakhar Shumaylov et.al. 2410.02698 null
2024-10-03 LoGra-Med: Long Context Multi-Graph Alignment for Medical Vision-Language Model Duy M. H. Nguyen et.al. 2410.02615 null
2024-10-03 Personalized Quantum Federated Learning for Privacy Image Classification Jinjing Shi et.al. 2410.02547 null
2024-10-03 BiSSL: Bilevel Optimization for Self-Supervised Pre-Training and Fine-Tuning Gustav Wagner Zakarias et.al. 2410.02387 null
2024-10-03 CTARR: A fast and robust method for identifying anatomical regions on CT images via atlas registration Thomas Buddenkotte et.al. 2410.02316 link
2024-10-03 Hard Negative Sample Mining for Whole Slide Image Classification Wentao Huang et.al. 2410.02212 link
2024-10-02 Kolmogorov-Arnold Network Autoencoders Mohammadamin Moradi et.al. 2410.02077 link
2024-10-02 Synthio: Augmenting Small-Scale Audio Classification Datasets with Synthetic Data Sreyan Ghosh et.al. 2410.02056 null
2024-10-02 FLAG: Financial Long Document Classification via AMR-based GNN Bolun et.al. 2410.02024 link
2024-10-02 MONICA: Benchmarking on Long-tailed Medical Image Classification Lie Ju et.al. 2410.02010 null
2024-10-02 Revisiting Hierarchical Text Classification: Inference and Metrics Roman Plaud et.al. 2410.01305 link
2024-10-02 Automatic deductive coding in discourse analysis: an application of large language models in learning analytics Lishan Zhang et.al. 2410.01240 null
2024-10-01 Deep Nets with Subsampling Layers Unwittingly Discard Useful Activations at Test-Time Chiao-An Yang et.al. 2410.01083 link
2024-10-01 Local-to-Global Self-Supervised Representation Learning for Diabetic Retinopathy Grading Mostafa Hajighasemloua et.al. 2410.00779 null
2024-10-01 NECOMIMI: Neural-Cognitive Multimodal EEG-informed Image Generation with Diffusion Models Chi-Sheng Chen et.al. 2410.00712 null
2024-10-01 TikGuard: A Deep Learning Transformer-Based Solution for Detecting Unsuitable TikTok Content for Kids Mazen Balat et.al. 2410.00403 null
2024-09-30 KPCA-CAM: Visual Explainability of Deep Computer Vision Models using Kernel PCA Sachin Karmani et.al. 2410.00267 null
2024-09-30 A Methodology for Explainable Large Language Models with Integrated Gradients and Linguistic Analysis in Text Classification Marina Ribeiro et.al. 2410.00250 null
2024-09-30 Evaluating the performance of state-of-the-art esg domain-specific pre-trained large language models in text classification against existing models and traditional machine learning techniques Tin Yuet Chung et.al. 2410.00207 null
2024-10-02 Evaluating the fairness of task-adaptive pretraining on unlabeled test data before few-shot text classification Kush Dubey et.al. 2410.00179 link
2024-09-30 POMONAG: Pareto-Optimal Many-Objective Neural Architecture Generator Eugenio Lomurno et.al. 2409.20447 null
2024-09-30 Satellite image classification with neural quantum kernels Pablo Rodriguez-Grasa et.al. 2409.20356 null
2024-09-30 All-optical autoencoder machine learning framework using diffractive processors Peijie Feng et.al. 2409.20346 null
2024-09-30 Fine-Tuning Personalization in Federated Learning to Mitigate Adversarial Clients Youssef Allouah et.al. 2409.20329 null
2024-09-30 Classroom-Inspired Multi-Mentor Distillation with Adaptive Learning Strategies Shalini Sarode et.al. 2409.20237 null
2024-09-30 Classification of Radiological Text in Small and Imbalanced Datasets in a Non-English Language Vincent Beliveau et.al. 2409.20147 null
2024-09-30 SATA: Spatial Autocorrelation Token Analysis for Enhancing the Robustness of Vision Transformers Nick Nikzad et.al. 2409.19850 null
2024-09-29 Adversarial Examples for DNA Classification Hyunwoo Yoo et.al. 2409.19788 null
2024-09-29 FAST: A Dual-tier Few-Shot Learning Paradigm for Whole Slide Image Classification Kexue Fu et.al. 2409.19720 null
2024-09-29 Vision-Language Models are Strong Noisy Label Detectors Tong Wei et.al. 2409.19696 link
2024-09-27 Unconditional stability of a recurrent neural circuit implementing divisive normalization Shivang Rawat et.al. 2409.18946 null
2024-09-27 Subspace Preserving Quantum Convolutional Neural Network Architectures Léo Monbroussou et.al. 2409.18918 null
2024-09-27 Med-IC: Fusing a Single Layer Involution with Convolutions for Enhanced Medical Image Classification and Segmentation Md. Farhadul Islam et.al. 2409.18506 null
2024-09-26 Towards the Mitigation of Confirmation Bias in Semi-supervised Learning: a Debiased Training Perspective Yu Wang et.al. 2409.18316 null
2024-09-26 Realistic Evaluation of Model Merging for Compositional Generalization Derek Tam et.al. 2409.18314 null
2024-09-26 DARE: Diverse Visual Question Answering with Robustness Evaluation Hannah Sterz et.al. 2409.18023 null
2024-09-26 The Lou Dataset -- Exploring the Impact of Gender-Fair Language in German Text Classification Andreas Waldis et.al. 2409.17929 null
2024-09-26 Cascade Prompt Learning for Vision-Language Model Adaptation Ge Wu et.al. 2409.17805 null
2024-09-26 Byzantine-Robust Aggregation for Securing Decentralized Federated Learning Diego Cajaraville-Aboy et.al. 2409.17754 null
2024-09-26 Let the Quantum Creep In: Designing Quantum Neural Network Models by Gradually Swapping Out Classical Components Peiyong Wang et.al. 2409.17583 link
2024-09-26 Leveraging Annotator Disagreement for Text Classification Jin Xu et.al. 2409.17577 null
2024-09-26 Uni-Med: A Unified Medical Generalist Foundation Model For Multi-Task Learning Via Connector-MoE Xun Zhu et.al. 2409.17508 null
2024-09-26 Reducing and Exploiting Data Augmentation Noise through Meta Reweighting Contrastive Learning for Text Classification Guanyi Mou et.al. 2409.17474 null
2024-09-26 Navigating the Shortcut Maze: A Comprehensive Analysis of Shortcut Learning in Text Classification by Language Models Yuqing Zhou et.al. 2409.17455 null
2024-09-25 Block Expanded DINORET: Adapting Natural Domain Foundation Models for Retinal Imaging Without Catastrophic Forgetting Jay Zoellin et.al. 2409.17332 null
2024-09-25 BitQ: Tailoring Block Floating Point Precision for Improved DNN Efficiency on Resource-Constrained Devices Yongqi Xu et.al. 2409.17093 link
2024-09-25 Accumulator-Aware Post-Training Quantization Ian Colbert et.al. 2409.17092 null
2024-09-26 HVT: A Comprehensive Vision Framework for Learning in Non-Euclidean Space Jacob Fein-Ashley et.al. 2409.16897 link
2024-09-25 Shifting from endangerment to rebirth in the Artificial Intelligence Age: An Ensemble Machine Learning Approach for Hawrami Text Classification Aram Khaksar et.al. 2409.16884 null
2024-09-25 Explicitly Modeling Pre-Cortical Vision with a Neuro-Inspired Front-End Improves CNN Robustness Lucas Piper et.al. 2409.16838 link
2024-09-24 Unleashing the Potential of Synthetic Images: A Study on Histopathology Image Classification Leire Benito-Del-Valle et.al. 2409.16002 link
2024-09-24 An ensemble framework approach of hybrid Quantum convolutional neural networks for classification of breast cancer images Dibyasree Guha et.al. 2409.15958 null
2024-09-24 iGAiVA: Integrated Generative AI and Visual Analytics in a Machine Learning Workflow for Text Classification Yuanzhe Jin et.al. 2409.15848 link
2024-09-23 Optimizing News Text Classification with Bi-LSTM and Attention Mechanism for Efficient Data Processing Bingyao Liu et.al. 2409.15576 null
2024-09-23 Critic Loss for Image Classification Brendan Hogan Rappazzo et.al. 2409.15565 null
2024-09-23 VLMine: Long-Tail Data Mining with Vision Language Models Mao Ye et.al. 2409.15486 null
2024-09-23 HydroVision: LiDAR-Guided Hydrometric Prediction with Vision Transformers and Hybrid Graph Learning Naghmeh Shafiee Roudbari et.al. 2409.15213 null
2024-09-23 Benchmarking Edge AI Platforms for High-Performance ML Inference Rakshith Jayanth et.al. 2409.14803 null
2024-09-23 Less yet robust: crucial region selection for scene recognition Jianqi Zhang et.al. 2409.14741 null
2024-09-22 Low-Light Enhancement Effect on Classification and Detection: An Empirical Study Xu Wu et.al. 2409.14461 null
2024-09-18 Unraveling the Hessian: A Key to Smooth Convergence in Loss Function Landscapes Nikita Kiselev et.al. 2409.11995 link
2024-09-18 Data Efficient Acoustic Scene Classification using Teacher-Informed Confusing Class Instruction Jin Jie Sean Yeo et.al. 2409.11964 null
2024-09-18 Agglomerative Token Clustering Joakim Bruslund Haurum et.al. 2409.11923 null
2024-09-18 Distillation-free Scaling of Large SSMs for Images and Videos Hamid Suleman et.al. 2409.11867 null
2024-09-18 Community Shaping in the Digital Age: A Temporal Fusion Framework for Analyzing Discourse Fragmentation in Online Social Networks Amirhossein Dezhboro et.al. 2409.11665 null
2024-09-18 Few-Shot Learning Approach on Tuberculosis Classification Based on Chest X-Ray Images A. A. G. Yogi Pramana et.al. 2409.11644 null
2024-09-18 Hyperspectral Image Classification Based on Faster Residual Multi-branch Spiking Neural Network Yang Liu et.al. 2409.11619 null
2024-09-17 Multi-Cohort Framework with Cohort-Aware Attention and Adversarial Mutual-Information Minimization for Whole Slide Image Classification Sharon Peled et.al. 2409.11119 null
2024-09-17 Anti-ESIA: Analyzing and Mitigating Impacts of Electromagnetic Signal Injection Attacks Denglin Kang et.al. 2409.10922 null
2024-09-16 Are Deep Learning Models Robust to Partial Object Occlusion in Visual Recognition Tasks? Kaleb Kassaw et.al. 2409.10775 null
2024-09-16 Frequency-Guided Masking for Enhanced Vision Self-Supervised Learning Amin Karimi Monsefi et.al. 2409.10362 null
2024-09-16 InfoDisent: Explainability of Image Classification Models by Information Disentanglement Łukasz Struski et.al. 2409.10329 null
2024-09-16 Enhancing Image Classification in Small and Unbalanced Datasets through Synthetic Data Augmentation Neil De La Fuente et.al. 2409.10286 null
2024-09-15 Finetuning CLIP to Reason about Pairwise Differences Dylan Sam et.al. 2409.09721 null
2024-09-15 Compositional Audio Representation Learning Sripathi Sridhar et.al. 2409.09619 null
2024-09-14 One missing piece in Vision and Language: A Survey on Comics Understanding Emanuele Vivoli et.al. 2409.09502 link
2024-09-14 Real-world Adversarial Defense against Patch Attacks based on Diffusion Model Xingxing Wei et.al. 2409.09406 null
2024-09-14 Turbo your multi-modal classification with contrastive learning Zhiyu Zhang et.al. 2409.09282 null
2024-09-14 Leveraging Foundation Models for Efficient Federated Learning in Resource-restricted Edge Networks S. Kawa Atapour et.al. 2409.09273 null
2024-09-13 ReCLAP: Improving Zero Shot Audio Classification by Describing Sounds Sreyan Ghosh et.al. 2409.09213 link
2024-09-13 Pushing the boundaries of event subsampling in event-based video classification using CNNs Hesam Araghi et.al. 2409.08953 link
2024-09-13 Pushing Joint Image Denoising and Classification to the Edge Thomas C Markhorst et.al. 2409.08943 null
2024-09-13 Byzantine-Robust and Communication-Efficient Distributed Learning via Compressed Momentum Filtering Changxin Liu et.al. 2409.08640 null
2024-09-13 Anytime Continual Learning for Open Vocabulary Classification Zhen Zhu et.al. 2409.08518 link
2024-09-12 Enhancing Few-Shot Image Classification through Learnable Multi-Scale Embedding and Attention Mechanisms Fatemeh Askari et.al. 2409.07989 link
2024-09-12 Microscopic-Mamba: Revealing the Secrets of Microscopic Images with Just 4M Parameters Shun Zou et.al. 2409.07896 link
2024-09-12 Classifying Images with CoLaNET Spiking Neural Network -- the MNIST Example Mikhail Kiselev et.al. 2409.07833 null
2024-09-12 Efficient Privacy-Preserving KAN Inference Using Homomorphic Encryption Zhizheng Lai et.al. 2409.07751 null
2024-09-12 DFDG: Data-Free Dual-Generator Adversarial Distillation for One-Shot Federated Learning Kangyang Luo et.al. 2409.07734 null
2024-09-12 Cooperative Inference with Interleaved Operator Partitioning for CNNs Zhibang Liu et.al. 2409.07693 null
2024-09-11 Token Turing Machines are Efficient Vision Models Purvish Jajal et.al. 2409.07613 null
2024-09-11 Minimizing Embedding Distortion for Robust Out-of-Distribution Performance Tom Shaked et.al. 2409.07582 null
2024-09-11 A Contrastive Symmetric Forward-Forward Algorithm (SFFA) for Continual Learning Tasks Erik B. Terres-Escudero et.al. 2409.07387 null
2024-09-11 Optimizing Neural Network Performance and Interpretability with Diophantine Equation Encoding Ronald Katende et.al. 2409.07310 null
2024-09-11 LLM-based feature generation from text for interpretable machine learning Vojtěch Balek et.al. 2409.07132 null
2024-09-11 Privacy-Preserving Federated Learning with Consistency via Knowledge Distillation Using Conditional Generator Kangyang Luo et.al. 2409.06955 null
2024-09-10 Dynamic Decoupling of Placid Terminal Attractor-based Gradient Descent Algorithm Jinwei Zhao et.al. 2409.06542 null
2024-09-10 Seam Carving as Feature Pooling in CNN Mohammad Imrul Jubair et.al. 2409.06311 null
2024-09-10 EntAugment: Entropy-Driven Adaptive Data Augmentation Framework for Image Classification Suorong Yang et.al. 2409.06290 link
2024-09-09 A Small Claims Court for the NLP: Judging Legal Text Classification Strategies With Small Datasets Mariana Yukari Noguti et.al. 2409.05972 null
2024-09-09 SVFit: Parameter-Efficient Fine-Tuning of Large Pre-Trained Models Using Singular Values Chengwei Sun et.al. 2409.05926 null
2024-09-09 Adversarial Attacks on Data Attribution Xinhe Wang et.al. 2409.05657 null
2024-09-09 Look One and More: Distilling Hybrid Order Relational Knowledge for Cross-Resolution Image Recognition Shiming Ge et.al. 2409.05384 null
2024-09-09 RexUniNLU: Recursive Method with Explicit Schema Instructor for Universal NLU Chengyuan Liu et.al. 2409.05275 null
2024-09-09 Scalable Frame Sampling for Video Classification: A Semi-Optimal Policy Approach with Reduced Search Space Junho Lee et.al. 2409.05260 null
2024-09-08 PatchAlign:Fair and Accurate Skin Disease Image Classification by Alignment with Clinical Labels Aayushman et.al. 2409.04975 link
2024-09-07 Activation Function Optimization Scheme for Image Classification Abdur Rahman et.al. 2409.04915 null
2024-09-07 LoCa: Logit Calibration for Knowledge Distillation Runming Yang et.al. 2409.04778 null
2024-09-07 Swin Transformer for Robust Differentiation of Real and Synthetic Images: Intra- and Inter-Dataset Analysis Preetu Mehta et.al. 2409.04734 null
2024-09-06 Connectivity-Inspired Network for Context-Aware Recognition Gianluca Carloni et.al. 2409.04360 null
2024-09-06 An optically accelerated extreme learning machine using hot atomic vapors Pierre Azam et.al. 2409.04312 null
2024-09-06 PlantSeg: A Large-Scale In-the-wild Dataset for Plant Disease Segmentation Tianqi Wei et.al. 2409.04038 null
2024-09-05 Deep Clustering of Remote Sensing Scenes through Heterogeneous Transfer Learning Isaac Ray et.al. 2409.03938 null
2024-09-05 WaterMAS: Sharpness-Aware Maximization for Neural Network Watermarking Carl De Sousa Trias et.al. 2409.03902 null
2024-09-05 On-board Satellite Image Classification for Earth Observation: A Comparative Study of Pre-Trained Vision Transformer Models Thanh-Dung Le et.al. 2409.03901 null
2024-09-05 Have Large Vision-Language Models Mastered Art History? Ombretta Strafforello et.al. 2409.03521 null
2024-09-05 Non-Uniform Illumination Attack for Fooling Convolutional Neural Networks Akshay Jain et.al. 2409.03458 link
2024-09-05 Training-free Conversion of Pretrained ANNs to SNNs for Low-Power and High-Performance Applications Tong Bu et.al. 2409.03368 null
2024-09-05 PEPL: Precision-Enhanced Pseudo-Labeling for Fine-Grained Image Classification in Semi-Supervised Learning Bowen Tian et.al. 2409.03192 null
2024-09-05 The AdEMAMix Optimizer: Better, Faster, Older Matteo Pagliardini et.al. 2409.03137 null
2024-09-04 iConFormer: Dynamic Parameter-Efficient Tuning with Input-Conditioned Adaptation Hayeon Jo et.al. 2409.02838 null
2024-09-03 MedUnA: Language guided Unsupervised Adaptation of Vision-Language Models for Medical Image Classification Umaima Rahman et.al. 2409.02729 null
2024-09-05 OpenFact at CheckThat! 2024: Combining Multiple Attack Methods for Effective Adversarial Text Generation Włodzimierz Lewoniewski et.al. 2409.02649 null
2024-09-04 Boosting Generalizability towards Zero-Shot Cross-Dataset Single-Image Indoor Depth by Meta-Initialization Cho-Ying Wu et.al. 2409.02486 null
2024-09-03 Evaluation and Comparison of Visual Language Models for Transportation Engineering Problems Sanjita Prajapati et.al. 2409.02278 null
2024-09-05 Robust Clustering on High-Dimensional Data with Stochastic Quantization Anton Kozyriev et.al. 2409.02066 link
2024-09-03 Compressed learning based onboard semantic compression for remote sensing platforms Protim Bhattacharjee et.al. 2409.01988 null
2024-09-03 State-of-the-art Advances of Deep-learning Linguistic Steganalysis Research Yihao Wang et.al. 2409.01780 null
2024-09-03 Enhancing Fine-Grained Visual Recognition in the Low-Data Regime Through Feature Magnitude Regularization Avraham Chapman et.al. 2409.01672 null
2024-09-03 ReSpike: Residual Frames-based Hybrid Spiking Neural Networks for Efficient Action Recognition Shiting Xiao et.al. 2409.01564 null
2024-08-30 Assessing Generative Language Models in Classification Tasks: Performance and Self-Evaluation Capabilities in the Environmental and Climate Change Domain Francesca Grasso et.al. 2408.17362 link
2024-08-30 Covariance-corrected Whitening Alleviates Network Degeneration on Imbalanced Classification Zhiwei Zhang et.al. 2408.17197 null
2024-08-30 Improving Extraction of Clinical Event Contextual Properties from Electronic Health Records: A Comparative Study Shubham Agarwal et.al. 2408.17181 null
2024-09-02 Instant Adversarial Purification with Adversarial Consistency Distillation Chun Tong Lei et.al. 2408.17064 null
2024-08-30 Generative Modeling Perspective for Control and Reasoning in Robotics Takuma Yoneda et.al. 2408.17041 null
2024-08-29 Tex-ViT: A Generalizable, Robust, Texture-based dual-branch cross-attention deepfake detector Deepak Dagar et.al. 2408.16892 null
2024-08-29 SODAWideNet++: Combining Attention and Convolutions for Salient Object Detection Rohit Venkata Sai Dulam et.al. 2408.16645 null
2024-08-29 Android Malware Detection Based on RGB Images and Multi-feature Fusion Zhiqiang Wang et.al. 2408.16555 null
2024-08-29 SAU: A Dual-Branch Network to Enhance Long-Tailed Recognition via Generative Models Guangxi Li et.al. 2408.16273 link
2024-08-29 Improving Diffusion-based Data Augmentation with Inversion Spherical Interpolation Yanghao Wang et.al. 2408.16266 null
2024-08-29 Low Saturation Confidence Distribution-based Test-Time Adaptation for Cross-Domain Remote Sensing Image Classification Yu Liang et.al. 2408.16265 null
2024-08-28 EMP: Enhance Memory in Data Pruning Jinying Xiao et.al. 2408.16031 null
2024-08-28 Local Descriptors Weighted Adaptive Threshold Filtering For Few-Shot Learning Bingchen Yan et.al. 2408.15924 null
2024-08-28 ModalityMirror: Improving Audio Classification in Modality Heterogeneity Federated Learning with Multimodal Distillation Tiantian Feng et.al. 2408.15803 null
2024-08-28 Visual Prompt Engineering for Medical Vision Language Models in Radiology Stefan Denner et.al. 2408.15802 null
2024-08-28 Harnessing the Intrinsic Knowledge of Pretrained Language Models for Challenging Text Classification Settings Lingyu Gao et.al. 2408.15650 null
2024-08-27 DCT-CryptoNets: Scaling Private Inference in the Frequency Domain Arjun Roy et.al. 2408.15231 null
2024-08-27 A Review of Transformer-Based Models for Computer Vision Tasks: Capturing Global Context and Spatial Relationships Gracile Astlin Pereira et.al. 2408.15178 null
2024-08-28 AnomalousPatchCore: Exploring the Use of Anomalous Samples in Industrial Anomaly Detection Mykhailo Koshil et.al. 2408.15113 null
2024-08-27 Data downlink prioritization using image classification on-board a 6U CubeSat Keenan A. A. Chatar et.al. 2408.14865 null
2024-08-27 Leveraging Self-supervised Audio Representations for Data-Efficient Acoustic Scene Classification Yiqiang Cai et.al. 2408.14862 null
2024-08-27 Text-guided Foundation Model Adaptation for Long-Tailed Medical Image Classification Sirui Li et.al. 2408.14770 null
2024-08-26 On-Chip Learning with Memristor-Based Neural Networks: Assessing Accuracy and Efficiency Under Device Variations, Conductance Errors, and Input Noise M. Reza Eslami et.al. 2408.14680 null
2024-08-26 Attend-Fusion: Efficient Audio-Visual Fusion for Video Classification Mahrukh Awan et.al. 2408.14441 null
2024-08-26 Uncertainties of Latent Representations in Computer Vision Michael Kirchhof et.al. 2408.14281 null
2024-08-26 MSFMamba: Multi-Scale Feature Fusion State Space Model for Multi-Source Remote Sensing Image Classification Feng Gao et.al. 2408.14255 null
2024-08-26 Feature Aligning Few shot Learning Method Using Local Descriptors Weighted Rules Bingchen Yan et.al. 2408.14192 null
2024-08-26 GenFormer -- Generated Images are All You Need to Improve Robustness of Transformers on Small Datasets Sven Oehri et.al. 2408.14131 null
2024-08-25 Few-Shot Histopathology Image Classification: Evaluating State-of-the-Art Methods and Unveiling Performance Insights Ardhendu Sekhar et.al. 2408.13816 null
2024-08-25 On the Robustness of Kolmogorov-Arnold Networks: An Adversarial Perspective Tal Alter et.al. 2408.13809 null
2024-08-25 Enhancing Adaptive Deep Networks for Image Classification via Uncertainty-aware Decision Fusion Xu Zhang et.al. 2408.13744 link
2024-08-25 3D-RCNet: Learning from Transformer to Build a 3D Relational ConvNet for Hyperspectral Image Classification Haizhao Jing et.al. 2408.13728 null
2024-08-24 Enhanced Astronomical Source Classification with Integration of Attention Mechanisms and Vision Transformers Srinadh Reddy Bhavanam et.al. 2408.13634 null
2024-08-23 Domain-specific long text classification from sparse relevant information Célia D'Cruz et.al. 2408.13253 null
2024-08-23 EAViT: External Attention Vision Transformer for Audio Classification Aquib Iqbal et.al. 2408.13201 null
2024-08-23 A gradient system based on anisotropic monochrome image processing with orientation auto-adjustment Harbir Antil et.al. 2408.12847 null
2024-08-23 Underwater SONAR Image Classification and Analysis using LIME-based Explainable Artificial Intelligence Purushothaman Natarajan et.al. 2408.12837 null
2024-08-23 VALE: A Multimodal Visual and Language Explanation Framework for Image Classifiers using eXplainable AI and Language Models Purushothaman Natarajan et.al. 2408.12808 null
2024-08-23 BackdoorLLM: A Comprehensive Benchmark for Backdoor Attacks on Large Language Models Yige Li et.al. 2408.12798 null
2024-08-23 Semi-Supervised Variational Adversarial Active Learning via Learning to Rank and Agreement-Based Pseudo Labeling Zongyao Lyu et.al. 2408.12774 null
2024-08-23 Symmetric masking strategy enhances the performance of Masked Image Modeling Khanh-Binh Nguyen et.al. 2408.12772 null
2024-08-22 ssProp: Energy-Efficient Training for Convolutional Neural Networks with Scheduled Sparse Back Propagation Lujia Zhong et.al. 2408.12561 link
2024-08-22 The Russian-focused embedders' exploration: ruMTEB benchmark and Russian embedding model design Artem Snegirev et.al. 2408.12503 null
2024-08-22 Enhanced Infield Agriculture with Interpretable Machine Learning Approaches for Crop Classification Sudi Murindanyi et.al. 2408.12426 null
2024-08-22 AT-SNN: Adaptive Tokens for Vision Transformer on Spiking Neural Network Donghwa Kang et.al. 2408.12293 null
2024-08-22 Whole Slide Image Classification of Salivary Gland Tumours John Charlton et.al. 2408.12275 null
2024-08-22 Query-Efficient Video Adversarial Attack with Stylized Logo Duoxun Tang et.al. 2408.12099 null
2024-08-21 Approaching Deep Learning through the Spectral Dynamics of Weights David Yunis et.al. 2408.11804 link
2024-08-21 SBDet: A Symmetry-Breaking Object Detector via Relaxed Rotation-Equivariance Zhiqiang Wu et.al. 2408.11760 null
2024-08-21 Improving Calibration by Relating Focal Loss, Temperature Scaling, and Properness Viacheslav Komisarenko et.al. 2408.11598 link
2024-08-21 MSCPT: Few-shot Whole Slide Image Classification with Multi-scale and Context-focused Prompt Tuning Minghao Han et.al. 2408.11505 null
2024-08-21 Enabling Small Models for Zero-Shot Classification through Model Label Learning Jia Zhang et.al. 2408.11449 null
2024-08-21 Automatic Dataset Construction (ADC): Sample Collection, Data Curation, and Beyond Minghao Liu et.al. 2408.11338 null
2024-08-21 Towards Evaluating Large Language Models on Sarcasm Understanding Yazhou Zhang et.al. 2408.11319 null
2024-08-20 Privacy-preserving Universal Adversarial Defense for Black-box Models Qiao Li et.al. 2408.10647 null
2024-08-20 A Tutorial on Explainable Image Classification for Dementia Stages Using Convolutional Neural Network and Gradient-weighted Class Activation Mapping Kevin Kam Fung Yuen et.al. 2408.10572 null
2024-08-20 NoMatterXAI: Generating "No Matter What" Alterfactual Examples for Explaining Black-Box Text Classification Models Tuc Nguyen et.al. 2408.10528 null
2024-08-20 Cervical Cancer Detection Using Multi-Branch Deep Learning Model Tatsuhiro Baba et.al. 2408.10498 null
2024-08-19 HaSPeR: An Image Repository for Hand Shadow Puppet Recognition Syed Rifat Raiyan et.al. 2408.10360 link
2024-08-19 Leveraging Superfluous Information in Contrastive Representation Learning Xuechu Yu et.al. 2408.10292 null
2024-08-19 SMILE: Zero-Shot Sparse Mixture of Low-Rank Experts Construction From Pre-Trained Foundation Models Anke Tang et.al. 2408.10174 link
2024-08-19 Towards Robust Federated Image Classification: An Empirical Study of Weight Selection Strategies in Manufacturing Vinit Hegiste et.al. 2408.10024 null
2024-08-19 Detecting Adversarial Attacks in Semantic Segmentation via Uncertainty Estimation: A Deep Analysis Kira Maag et.al. 2408.10021 null
2024-08-19 Active Learning for Identifying Disaster-Related Tweets: A Comparison with Keyword Filtering and Generic Fine-Tuning David Hanny et.al. 2408.09914 null
2024-08-19 Ranking Generated Answers: On the Agreement of Retrieval Models with Humans on Consumer Health Questions Sebastian Heineking et.al. 2408.09831 null
2024-08-19 AutoML-guided Fusion of Entity and LLM-based representations Boshko Koloski et.al. 2408.09794 null
2024-08-19 Dataset Distillation for Histopathology Image Classification Cong Cong et.al. 2408.09709 null
2024-08-19 A Strategy to Combine 1stGen Transformers and Open LLMs for Automatic Text Classification Claudio M. V. de Andrade et.al. 2408.09629 null
2024-08-18 Attention Is Not What You Need: Revisiting Multi-Instance Learning for Whole Slide Image Classification Xin Liu et.al. 2408.09449 null
2024-08-17 Narrowing the Focus: Learned Optimizers for Pretrained Models Gus Kristiansen et.al. 2408.09310 null
2024-08-16 DPA: Dual Prototypes Alignment for Unsupervised Adaptation of Vision-Language Models Eman Ali et.al. 2408.08855 null
2024-08-16 LEVIS: Large Exact Verifiable Input Spaces for Neural Networks Mohamad Fares El Hajj Chehade et.al. 2408.08824 null
2024-08-16 Leveraging FourierKAN Classification Head for Pre-Trained Transformer-based Text Classification Abdullah Al Imran et.al. 2408.08803 null
2024-08-16 Xpikeformer: Hybrid Analog-Digital Hardware Acceleration for Spiking Transformers Zihang Song et.al. 2408.08794 null
2024-08-16 Quantum convolutional neural networks for jet images classification Hala Elhag et.al. 2408.08701 null
2024-08-16 MM-UNet: A Mixed MLP Architecture for Improved Ophthalmic Image Segmentation Zunjie Xiao et.al. 2408.08600 null
2024-08-16 Tell Codec What Worth Compressing: Semantically Disentangled Image Coding for Machine with LMMs Jinming Liu et.al. 2408.08575 null
2024-08-16 Efficient Image-to-Image Diffusion Classifier for Adversarial Robustness Hefei Mei et.al. 2408.08502 link
2024-08-15 Beyond Uniform Query Distribution: Key-Driven Grouped Query Attention Zohaib Khan et.al. 2408.08454 null
2024-08-15 Predictive uncertainty estimation in deep learning for lung carcinoma classification in digital pathology under real dataset shifts Abdur R. Fayjie et.al. 2408.08432 null
2024-08-15 SLCA++: Unleash the Power of Sequential Fine-tuning for Continual Learning with Pre-training Gengwei Zhang et.al. 2408.08295 link
2024-08-15 Moving Healthcare AI-Support Systems for Visually Detectable Diseases onto Constrained Devices Tess Watt et.al. 2408.08215 null
2024-08-15 Towards flexible perception with visual memory Robert Geirhos et.al. 2408.08172 null
2024-08-15 Category-Prompt Refined Feature Learning for Long-Tailed Multi-Label Image Classification Jiexuan Yan et.al. 2408.08125 link
2024-08-15 HAIR: Hypernetworks-based All-in-One Image Restoration Jin Cao et.al. 2408.08091 link
2024-08-14 Large Language Models Prompting With Episodic Memory Dai Do et.al. 2408.07465 null
2024-08-14 Leveraging Perceptual Scores for Dataset Pruning in Computer Vision Tasks Raghavendra Singh et.al. 2408.07243 null
2024-08-13 Efficient Search for Customized Activation Functions with Gradient Descent Lukas Strack et.al. 2408.06820 link
2024-08-13 Do Vision-Language Foundational models show Robust Visual Perception? Shivam Chandhok et.al. 2408.06781 link
2024-08-13 Towards Cross-Domain Single Blood Cell Image Classification via Large-Scale LoRA-based Segment Anything Model Yongcheng Li et.al. 2408.06716 link
2024-08-13 Coherence Awareness in Diffractive Neural Networks Matan Kleiner et.al. 2408.06681 null
2024-08-12 Is it a work or leisure travel? Applying text classification to identify work-related travel on social networks Lucas Félix et.al. 2408.06341 null
2024-08-12 Audio Enhancement for Computer Audition -- An Iterative Training Paradigm Using Sample Importance Manuel Milling et.al. 2408.06264 null
2024-08-12 Deep Learning System Boundary Testing through Latent Space Style Mixing Amr Abdellatif et.al. 2408.06258 null
2024-08-12 Global-to-Local Support Spectrums for Language Model Explainability Lucas Agussurja et.al. 2408.05976 null
2024-08-12 A Simple Task-aware Contrastive Local Descriptor Selection Strategy for Few-shot Learning between inter class and intra class Qian Qiao et.al. 2408.05953 null
2024-08-12 Classifier Guidance Enhances Diffusion-based Adversarial Purification by Preserving Predictive Information Mingkun Zhang et.al. 2408.05900 null
2024-08-11 HiLight: A Hierarchy-aware Light Global Model with Hierarchical Local ConTrastive Learning Zhijian Chen et.al. 2408.05786 null
2024-08-11 PRECISe : Prototype-Reservation for Explainable Classification under Imbalanced and Scarce-Data Settings Vaibhav Ganatra et.al. 2408.05754 null
2024-08-11 Disposable-key-based image encryption for collaborative learning of Vision Transformer Rei Aso et.al. 2408.05737 null
2024-08-11 A Novel Momentum-Based Deep Learning Techniques for Medical Image Classification and Segmentation Koushik Biswas et.al. 2408.05692 null
2024-08-09 A conformalized learning of a prediction set with applications to medical imaging classification Roy Hirsch et.al. 2408.05037 null
2024-08-09 Generalisation First, Memorisation Second? Memorisation Localisation for Natural Language Classification Tasks Verna Dankers et.al. 2408.04965 null
2024-08-09 LiD-FL: Towards List-Decodable Federated Learning Hong Liu et.al. 2408.04963 null
2024-08-09 In Defense of Lazy Visual Grounding for Open-Vocabulary Semantic Segmentation Dahyun Kang et.al. 2408.04961 link
2024-08-08 Enhanced Prototypical Part Network (EPPNet) For Explainable Image Classification Via Prototypes Bhushan Atote et.al. 2408.04606 null
2024-08-08 SCENE: Evaluating Explainable AI Techniques Using Soft Counterfactuals Haoran Zheng et.al. 2408.04575 null
2024-08-08 An experimental comparative study of backpropagation and alternatives for training binary neural networks for image classification Ben Crulis et.al. 2408.04460 null
2024-08-08 Dual-branch PolSAR Image Classification Based on GraphMAE and Local Feature Extraction Yuchen Wang et.al. 2408.04294 null
2024-08-07 FMiFood: Multi-modal Contrastive Learning for Food Image Classification Xinyue Pan et.al. 2408.03922 null
2024-08-07 Leveraging Variation Theory in Counterfactual Data Augmentation for Optimized Active Learning Simret Araya Gebreegziabher et.al. 2408.03819 null
2024-08-07 Intuitionistic Fuzzy Cognitive Maps for Interpretable Image Classification Georgia Sovatzidi et.al. 2408.03745 null
2024-08-07 CAS-ViT: Convolutional Additive Self-attention Vision Transformers for Efficient Mobile Applications Tianfang Zhang et.al. 2408.03703 link
2024-08-07 Designing Extremely Memory-Efficient CNNs for On-device Vision Tasks Jaewook Lee et.al. 2408.03663 null
2024-08-07 Making Robust Generalizers Less Rigid with Soft Ascent-Descent Matthew J. Holland et.al. 2408.03619 null
2024-08-06 AI Foundation Models in Remote Sensing: A Survey Siqi Lu et.al. 2408.03464 null
2024-08-06 Compress and Compare: Interactively Evaluating Efficiency and Behavior Across ML Model Compression Experiments Angie Boggust et.al. 2408.03274 null
2024-08-06 A Debiased Nearest Neighbors Framework for Multi-Label Text Classification Zifeng Cheng et.al. 2408.03202 null
2024-08-06 Leveraging Parameter Efficient Training Methods for Low Resource Text Classification: A Case Study in Marathi Pranita Deshmukh et.al. 2408.03172 null
2024-08-06 Comb, Prune, Distill: Towards Unified Pruning for Vision Model Compression Jonas Schmitt et.al. 2408.03046 null
2024-08-06 L3iTC at the FinLLM Challenge Task: Quantization for Financial Text Classification & Summarization Elvys Linhares Pontes et.al. 2408.03033 null
2024-08-06 Adversarial Robustness of Open-source Text Classification Models and Fine-Tuning Chains Hao Qin et.al. 2408.02963 null
2024-08-06 Dual-View Pyramid Pooling in Deep Neural Networks for Improved Medical Image Classification and Confidence Calibration Xiaoqing Zhang et.al. 2408.02906 null
2024-08-05 Interpretation of the Intent Detection Problem as Dynamics in a Low-dimensional Space Eduardo Sanchez-Karhunen et.al. 2408.02838 null
2024-08-05 Pre-trained Encoder Inference: Revealing Upstream Encoders In Downstream Machine Learning Services Shaopeng Fu et.al. 2408.02814 null
2024-08-05 FPT+: A Parameter and Memory Efficient Transfer Learning Method for High-resolution Medical Image Classification Yijin Huang et.al. 2408.02426 null
2024-08-05 On the Robustness of Malware Detectors to Adversarial Samples Muhammad Salman et.al. 2408.02310 null
2024-08-05 Low-Cost Self-Ensembles Based on Multi-Branch Transformation and Grouped Convolution Hojung Lee et.al. 2408.02307 null
2024-08-05 Network Fission Ensembles for Low-Cost Self-Ensembles Hojung Lee et.al. 2408.02301 null
2024-08-04 VidModEx: Interpretable and Efficient Black Box Model Extraction for High-Dimensional Spaces Somnath Sendhil Kumar et.al. 2408.02140 null
2024-08-04 DeMansia: Mamba Never Forgets Any Tokens Ricky Fang et.al. 2408.01986 null
2024-08-06 A Survey and Evaluation of Adversarial Attacks for Object Detection Khoi Nguyen Tiet Nguyen et.al. 2408.01934 null
2024-08-03 Safe Semi-Supervised Contrastive Learning Using In-Distribution Data as Positive Examples Min Gu Kwak et.al. 2408.01872 null
2024-08-03 LAM3D: Leveraging Attention for Monocular 3D Object Detection Diana-Alexandra Sas et.al. 2408.01739 null
2024-08-02 Counterfactual Explanations for Medical Image Classification and Regression using Diffusion Autoencoder Matan Atad et.al. 2408.01571 null
2024-08-02 Spatial-Spectral Morphological Mamba for Hyperspectral Image Classification Muhammad Ahmad et.al. 2408.01372 null
2024-08-02 WaveMamba: Spatial-Spectral Wavelet Mamba for Hyperspectral Image Classification Muhammad Ahmad et.al. 2408.01231 null
2024-08-02 Multi-head Spatial-Spectral Mamba for Hyperspectral Image Classification Muhammad Ahmad et.al. 2408.01224 null
2024-08-02 Rethinking Pre-trained Feature Extractor Selection in Multiple Instance Learning for Whole Slide Image Classification Bryan Wong et.al. 2408.01167 null
2024-08-01 CERT-ED: Certifiably Robust Text Classification for Edit Distance Zhuoqun Huang et.al. 2408.00728 null
2024-08-01 Deep Learning in Medical Image Classification from MRI-based Brain Tumor Images Xiaoyi Liu et.al. 2408.00636 null
2024-08-01 DECIDER: Leveraging Foundation Model Priors for Improved Model Failure Detection and Explanation Rakshith Subramanyam et.al. 2408.00331 null
2024-07-31 Vera Verto: Multimodal Hijacking Attack Minxing Zhang et.al. 2408.00129 null
2024-07-31 Learning Video Context as Interleaved Multimodal Sequences Kevin Qinghong Lin et.al. 2407.21757 null
2024-07-30 Contrasting Deep Learning Models for Direct Respiratory Insufficiency Detection Versus Blood Oxygen Saturation Estimation Marcelo Matheus Gauy et.al. 2407.20989 null
2024-07-30 Faithful and Plausible Natural Language Explanations for Image Classification: A Pipeline Approach Adam Wojciechowski et.al. 2407.20899 null
2024-08-01 DFE-IANet: A Method for Polyp Image Classification Based on Dual-domain Feature Extraction and Interaction Attention Wei Wang et.al. 2407.20843 null
2024-08-01 The Susceptibility of Example-Based Explainability Methods to Class Outliers Ikhtiyor Nematov et.al. 2407.20678 null
2024-07-30 Knowledge Fused Recognition: Fusing Hierarchical Knowledge for Image Recognition through Quantitative Relativity Modeling and Deep Metric Learning Yunfeng Zhao et.al. 2407.20600 null
2024-07-30 Exploring Liquid Neural Networks on Loihi-2 Wiktoria Agata Pawlak et.al. 2407.20590 null
2024-07-29 Graphite: A Graph-based Extreme Multi-Label Short Text Classifier for Keyphrase Recommendation Ashirbad Mishra et.al. 2407.20462 null
2024-07-29 Diffusion Feedback Helps CLIP See Better Wenxuan Wang et.al. 2407.20171 null
2024-07-29 Distilling High Diagnostic Value Patches for Whole Slide Image Classification Using Attention Mechanism Tianhang Nan et.al. 2407.19821 null
2024-07-28 Competition-based Adaptive ReLU for Deep Neural Networks Junjia Chen et.al. 2407.19441 null
2024-07-28 Depth-Wise Convolutions in Vision Transformers for Efficient Training on Small Datasets Tianxiao Zhang et.al. 2407.19394 link
2024-07-27 Inference-Time Selective Debiasing Gleb Kuzmin et.al. 2407.19345 null
2024-07-27 Stellar Blend Image Classification Using Computationally Efficient Gaussian Processes Chinedu Eleh et.al. 2407.19297 null
2024-07-27 Towards Robust Few-shot Class Incremental Learning in Audio Classification using Contrastive Representation Riyansha Singh et.al. 2407.19265 null
2024-07-27 A Survey of Malware Detection Using Deep Learning Ahmed Bensaoud et.al. 2407.19153 null
2024-07-26 UniForensics: Face Forgery Detection via General Facial Representation Ziyuan Fang et.al. 2407.19079 null
2024-07-26 A Scalable Quantum Non-local Neural Network for Image Classification Sparsh Gupta et.al. 2407.18906 link
2024-07-26 Unifying Visual and Semantic Feature Spaces with Diffusion Models for Enhanced Cross-Modal Alignment Yuze Zheng et.al. 2407.18854 null
2024-07-26 Local Binary Pattern(LBP) Optimization for Feature Extraction Zeinab Sedaghatjoo et.al. 2407.18665 null
2024-07-26 Topology Optimization of Random Memristors for Input-Aware Dynamic SNN Bo Wang et.al. 2407.18625 null
2024-07-26 Content-driven Magnitude-Derivative Spectrum Complementary Learning for Hyperspectral Image Classification Huiyan Bai et.al. 2407.18593 null
2024-07-26 VSSD: Vision Mamba with Non-Casual State Space Duality Yuheng Shi et.al. 2407.18559 link
2024-07-25 Self-supervised pre-training with diffusion model for few-shot landmark detection in x-ray images Roberto Di Via et.al. 2407.18125 null
2024-07-25 Mew: Multiplexed Immunofluorescence Image Analysis through an Efficient Multiplex Network Sukwon Yun et.al. 2407.17857 link
2024-07-25 SAM-MIL: A Spatial Contextual Aware Multiple Instance Learning Approach for Whole Slide Image Classification Heng Fang et.al. 2407.17689 link
2024-07-26 Unsqueeze [CLS] Bottleneck to Learn Rich Representations Qing Su et.al. 2407.17671 link
2024-07-24 Explaining the Model, Protecting Your Data: Revealing and Mitigating the Data Privacy Risks of Post-Hoc Model Explanations via Membership Inference Catherine Huang et.al. 2407.17663 null
2024-07-23 S-E Pipeline: A Vision Transformer (ViT) based Resilient Classification Pipeline for Medical Imaging Against Adversarial Attacks Neha A S et.al. 2407.17587 null
2024-07-24 A Novel Two-Step Fine-Tuning Pipeline for Cold-Start Active Learning in Text Classification Tasks Fabiano Belém et.al. 2407.17284 null
2024-07-24 Graph Neural Networks: A suitable Alternative to MLPs in Latent 3D Medical Image Classification? Johannes Kiechle et.al. 2407.17219 link
2024-07-24 Quanv4EO: Empowering Earth Observation by means of Quanvolutional Neural Networks Alessandro Sebastianelli et.al. 2407.17108 null
2024-07-24 An Adaptive Gradient Regularization Method Huixiu Jiang et.al. 2407.16944 null
2024-07-23 Lawma: The Power of Specialization for Legal Tasks Ricardo Dominguez-Olmedo et.al. 2407.16615 null
2024-07-23 Deep Bayesian segmentation for colon polyps: Well-calibrated predictions in medical imaging Daniela L. Ramos et.al. 2407.16608 null
2024-07-23 Designing robust diffractive neural networks with improved transverse shift tolerance Daniil V. Soshnikov et.al. 2407.16456 null
2024-07-23 Image Classification using Fuzzy Pooling in Convolutional Kolmogorov-Arnold Networks Ayan Igali et.al. 2407.16268 null
2024-07-23 HSVLT: Hierarchical Scale-Aware Vision-Language Transformer for Multi-Label Image Classification Shuyi Ouyang et.al. 2407.16244 null
2024-07-23 Improved Few-Shot Image Classification Through Multiple-Choice Questions Dipika Khullar et.al. 2407.16145 null
2024-07-22 Pavement Fatigue Crack Detection and Severity Classification Based on Convolutional Neural Network Zhen Wang et.al. 2407.16021 null
2024-07-22 AIDE: Antithetical, Intent-based, and Diverse Example-Based Explanations Ikhtiyor Nematov et.al. 2407.16010 null
2024-07-22 Comprehensive Study on Performance Evaluation and Optimization of Model Compression: Bridging Traditional Deep Learning and Large Language Models Aayush Saxena et.al. 2407.15904 null
2024-07-22 Beyond Size and Class Balance: Alpha as a New Dataset Quality Metric for Deep Learning Josiah Couch et.al. 2407.15724 null
2024-07-22 Retinomorphic Feature Detection and Machine Vision in a Network Laser Wai Kit Ng et.al. 2407.15558 null
2024-07-22 Learning deep illumination-robust features from multispectral filter array images Anis Amziane et.al. 2407.15472 null
2024-07-22 Is user feedback always informative? Retrieval Latent Defending for Semi-Supervised Domain Adaptation without Source Data Junha Song et.al. 2407.15383 null
2024-07-22 FMDNN: A Fuzzy-guided Multi-granular Deep Neural Network for Histopathological Image Classification Weiping Ding et.al. 2407.15312 null
2024-07-21 Assessing Sample Quality via the Latent Space of Generative Models Jingyi Xu et.al. 2407.15171 null
2024-07-21 A multi-level multi-label text classification dataset of 19th century Ottoman and Russian literary and critical texts Gokcen Gokceoglu et.al. 2407.15136 null
2024-07-20 Toward Efficient Convolutional Neural Networks With Structured Ternary Patterns Christos Kyrkou et.al. 2407.14831 link
2024-07-20 Subgraph Clustering and Atom Learning for Improved Image Classification Aryan Singh et.al. 2407.14772 null
2024-07-20 A Comprehensive Review of Few-shot Action Recognition Yuyang Wanyan et.al. 2407.14744 null
2024-07-19 DEPICT: Diffusion-Enabled Permutation Importance for Image Classification Tasks Sarah Jabbour et.al. 2407.14509 null
2024-07-19 Enhancing Zero-shot Audio Classification using Sound Attribute Knowledge from Large Language Models Xuenan Xu et.al. 2407.14355 null
2024-07-19 EmoCAM: Toward Understanding What Drives CNN-based Emotion Recognition Youssef Doulfoukar et.al. 2407.14314 null
2024-07-18 CoAPT: Context Attribute words for Prompt Tuning Gun Lee et.al. 2407.13808 null
2024-07-18 GroupMamba: Parameter-Efficient and Accurate Group Visual State Space Model Abdelrahman Shaker et.al. 2407.13772 link
2024-07-18 Addressing Imbalance for Class Incremental Learning in Medical Image Classification Xuze Hao et.al. 2407.13768 null
2024-07-18 Differential Privacy Mechanisms in Neural Tangent Kernel Regression Jiuxiang Gu et.al. 2407.13621 null
2024-07-18 CycleMix: Mixing Source Domains for Domain Generalization in Style-Dependent Data Aristotelis Ballas et.al. 2407.13421 link
2024-07-17 LookupViT: Compressing visual information to a limited number of tokens Rajat Koner et.al. 2407.12753 null
2024-07-17 Toward INT4 Fixed-Point Training via Exploring Quantization Error for Gradients Dohyung Kim et.al. 2407.12637 null
2024-07-17 Domain-specific or Uncertainty-aware models: Does it really make a difference for biomedical text classification? Aman Sinha et.al. 2407.12626 null
2024-07-18 Benchmarking Robust Self-Supervised Learning Across Diverse Downstream Tasks Antoni Kowalczuk et.al. 2407.12588 link
2024-07-17 Non-parametric regularization for class imbalance federated medical image classification Jeffry Wicaksana et.al. 2407.12446 link
2024-07-17 FETCH: A Memory-Efficient Replay Approach for Continual Learning in Image Classification Markus Weißflog et.al. 2407.12375 null
2024-07-17 Adaptive Cascading Network for Continual Test-Time Adaptation Kien X. Nguyen et.al. 2407.12240 null
2024-07-16 Generalized Coverage for More Robust Low-Budget Active Learning Wonho Bae et.al. 2407.12212 null
2024-07-18 A Closer Look at Benchmarking Self-Supervised Pre-training with Image Classification Markus Marks et.al. 2407.12210 null
2024-07-16 Novel Artistic Scene-Centric Datasets for Effective Transfer Learning in Fragrant Spaces Shumei Liu et.al. 2407.11701 null
2024-07-16 Probing the Efficacy of Federated Parameter-Efficient Fine-Tuning of Vision Transformers for Medical Image Classification Naif Alkhunaizi et.al. 2407.11573 null
2024-07-16 TCFormer: Visual Recognition via Token Clustering Transformer Wang Zeng et.al. 2407.11321 link
2024-07-16 PADRe: A Unifying Polynomial Attention Drop-in Replacement for Efficient Vision Transformer Pierre-David Letourneau et.al. 2407.11306 null
2024-07-15 Unconstrained Open Vocabulary Image Classification: Zero-Shot Transfer from Text to Image via CLIP Inversion Philipp Allgeuer et.al. 2407.11211 null
2024-07-16 DataDream: Few-shot Guided Dataset Generation Jae Myung Kim et.al. 2407.10910 link
2024-07-15 Pathology-knowledge Enhanced Multi-instance Prompt Learning for Few-shot Whole Slide Image Classification Linhao Qu et.al. 2407.10814 null
2024-07-15 Employing Sentence Space Embedding for Classification of Data Stream from Fake News Domain Paweł Zyblewski et.al. 2407.10807 null
2024-07-15 Anticipating Future Object Compositions without Forgetting Youssef Zahran et.al. 2407.10723 null
2024-07-15 GeoMix: Towards Geometry-Aware Data Augmentation Wentao Zhao et.al. 2407.10681 link
2024-07-15 Learning Natural Consistency Representation for Face Forgery Video Detection Daichi Zhang et.al. 2407.10550 null
2024-07-15 Improving Hyperbolic Representations via Gromov-Wasserstein Regularization Yifei Yang et.al. 2407.10495 null
2024-07-15 Backdoor Attacks against Image-to-Image Networks Wenbo Jiang et.al. 2407.10445 null
2024-07-14 Deep Learning Algorithms for Early Diagnosis of Acute Lymphoblastic Leukemia Dimitris Papaioannou et.al. 2407.10251 null
2024-07-14 Advancing Continual Learning for Robust Deepfake Audio Classification Feiyi Dong et.al. 2407.10108 null
2024-07-12 Evaluating the Adversarial Robustness of Semantic Segmentation: Trying Harder Pays Off Levente Halmosi et.al. 2407.09150 link
2024-07-12 Open Vocabulary Multi-Label Video Classification Rohit Gupta et.al. 2407.09073 null
2024-07-12 GPC: Generative and General Pathology Image Classifier Anh Tien Nguyen et.al. 2407.09035 null
2024-07-12 CAMP: Continuous and Adaptive Learning Model in Pathology Anh Tien Nguyen et.al. 2407.09030 null
2024-07-12 SlideGCD: Slide-based Graph Collaborative Training with Knowledge Distillation for Whole Slide Image Classification Tong Shu et.al. 2407.08968 null
2024-07-12 Domain-Hierarchy Adaptation via Chain of Iterative Reasoning for Few-shot Hierarchical Text Classification Ke Ji et.al. 2407.08959 null
2024-07-11 Local Clustering for Lung Cancer Image Classification via Sparse Solution Technique Jackson Hamel et.al. 2407.08800 null
2024-07-11 Data Adaptive Traceback for Vision-Language Foundation Models in Image Classification Wenshuo Peng et.al. 2407.08787 null
2024-07-11 ElasticAST: An Audio Spectrogram Transformer for All Length and Resolutions Jiu Feng et.al. 2407.08691 link
2024-07-11 Histopathological Image Classification with Cell Morphology Aware Deep Neural Networks Andrey Ignatov et.al. 2407.08625 link
2024-07-11 BiasPruner: Debiased Continual Learning for Medical Image Classification Nourhan Bayasi et.al. 2407.08609 link
2024-07-11 GraphMamba: An Efficient Graph Structure Learning Vision Mamba for Hyperspectral Image Classification Aitao Yang et.al. 2407.08255 link
2024-07-11 Beyond Text: Leveraging Multi-Task Learning and Cognitive Appraisal Theory for Post-Purchase Intention Analysis Gerard Christopher Yeo et.al. 2407.08182 null
2024-07-11 Enrich the content of the image Using Context-Aware Copy Paste Qiushi Guo et.al. 2407.08151 null
2024-07-10 MambaVision: A Hybrid Mamba-Transformer Vision Backbone Ali Hatamizadeh et.al. 2407.08083 link
2024-07-10 The Misclassification Likelihood Matrix: Some Classes Are More Likely To Be Misclassified Than Others Daniel Sikar et.al. 2407.07818 null
2024-07-11 Trainable Highly-expressive Activation Functions Irit Chelly et.al. 2407.07564 null
2024-07-10 HDKD: Hybrid Data-Efficient Knowledge Distillation Network for Medical Image Classification Omar S. EL-Assiouti et.al. 2407.07516 null
2024-07-10 Towards a text-based quantitative and explainable histopathology image analysis Anh Tien Nguyen et.al. 2407.07360 null
2024-07-11 FALFormer: Feature-aware Landmarks self-attention for Whole-slide Image Classification Doanh C. Bui et.al. 2407.07340 link
2024-07-10 Dual-stage Hyperspectral Image Classification Model with Spectral Supertoken Peifu Liu et.al. 2407.07307 link
2024-07-09 Exploring Camera Encoder Designs for Autonomous Driving Perception Barath Lakshmanan et.al. 2407.07276 null
2024-07-09 CTRL-F: Pairing Convolution with Transformer for Image Classification via Multi-Level Feature Cross-Attention and Representation Learning Fusion Hosam S. EL-Assiouti et.al. 2407.06673 null
2024-07-09 NoisyAG-News: A Benchmark for Addressing Instance-Dependent Noise in Text Classification Hongfei Huang et.al. 2407.06579 null
2024-07-08 Hybrid Classical-Quantum architecture for vectorised image classification of hand-written sketches Y. Cordero et.al. 2407.06416 null
2024-07-08 GeoWATCH for Detecting Heavy Construction in Heterogeneous Time Series of Satellite Images Jon Crall et.al. 2407.06337 null
2024-07-08 Multi-Label Plant Species Classification with Self-Supervised Vision Transformers Murilo Gustineli et.al. 2407.06298 link
2024-07-08 Active Label Refinement for Robust Training of Imbalanced Medical Image Classification Tasks in the Presence of High Label Noise Bidur Khanal et.al. 2407.05973 null
2024-07-08 Wavelet Convolutions for Large Receptive Fields Shahaf E. Finder et.al. 2407.05848 link
2024-07-08 Evaluating the Fairness of Neural Collapse in Medical Image Classification Kaouther Mouheb et.al. 2407.05843 null
2024-07-08 Learning to Adapt Category Consistent Meta-Feature of CLIP for Few-Shot Classification Jiaying Shi et.al. 2407.05647 null
2024-07-08 New Directions in Text Classification Research: Maximizing The Performance of Sentiment Classification from Limited Data Surya Agustian et.al. 2407.05627 null
2024-07-08 Momentum Auxiliary Network for Supervised Local Learning Junhao Su et.al. 2407.05623 link
2024-07-08 Open-world Multi-label Text Classification with Extremely Weak Supervision Xintong Li et.al. 2407.05609 link
2024-07-08 FALIP: Visual Prompt as Foveal Attention Boosts CLIP Zero-Shot Performance Jiedong Zhuang et.al. 2407.05578 null
2024-07-08 An accurate detection is not all you need to combat label noise in web-noisy datasets Paul Albert et.al. 2407.05528 null
2024-07-07 Leveraging Topological Guidance for Improved Knowledge Distillation Eun Som Jeon et.al. 2407.05316 link
2024-07-05 AWT: Transferring Vision-Language Models via Augmentation, Weighting, and Transportation Yuhan Zhu et.al. 2407.04603 null
2024-07-05 AMD: Automatic Multi-step Distillation of Large-scale Vision Models Cheng Han et.al. 2407.04208 null
2024-07-04 LeDNet: Localization-enabled Deep Neural Network for Multi-Label Radiography Image Classification Lalit Pant et.al. 2407.03931 null
2024-07-04 DocXplain: A Novel Model-Agnostic Explainability Method for Document Image Classification Saifullah Saifullah et.al. 2407.03830 null
2024-07-04 reBEN: Refined BigEarthNet Dataset for Remote Sensing Image Analysis Kai Norman Clasen et.al. 2407.03653 link
2024-07-04 Resampled Datasets Are Not Enough: Mitigating Societal Bias Beyond Single Attributes Yusuke Hirota et.al. 2407.03623 null
2024-07-04 Self Adaptive Threshold Pseudo-labeling and Unreliable Sample Contrastive Loss for Semi-supervised Image Classification Xuerong Zhang et.al. 2407.03596 null
2024-07-04 DGR-MIL: Exploring Diverse Global Representation in Multiple Instance Learning for Whole Slide Image Classification Wenhui Zhu et.al. 2407.03575 link
2024-07-03 A multicategory jet image classification framework using deep neural network Jairo Orozco Sandoval et.al. 2407.03524 null
2024-07-03 Model Guidance via Explanations Turns Image Classifiers into Segmentation Models Xiaoyan Yu et.al. 2407.03009 null
2024-07-03 ShiftAddAug: Augment Multiplication-Free Tiny Neural Network with Hybrid Computation Yipin Guo et.al. 2407.02881 null
2024-07-03 Fine-Grained Scene Image Classification with Modality-Agnostic Adapter Yiqun Wang et.al. 2407.02769 link
2024-07-03 ADFQ-ViT: Activation-Distribution-Friendly Post-Training Quantization for Vision Transformers Yanfeng Jiang et.al. 2407.02763 null
2024-07-02 Spectral Graph Reasoning Network for Hyperspectral Image Classification Huiling Wang et.al. 2407.02647 null
2024-07-01 CGRclust: Chaos Game Representation for Twin Contrastive Clustering of Unlabelled DNA Sequences Fatemeh Alipour et.al. 2407.02538 link
2024-07-02 Exploring the Role of Transliteration in In-Context Learning for Low-resource Languages Written in Non-Latin Scripts Chunlan Ma et.al. 2407.02320 null
2024-07-03 Federated Distillation for Medical Image Classification: Towards Trustworthy Computer-Aided Diagnosis Sufen Ren et.al. 2407.02261 null
2024-07-02 Hybrid Feature Collaborative Reconstruction Network for Few-Shot Fine-Grained Image Classification Shulei Qiu et.al. 2407.02123 null
2024-07-01 Optimized Learning for X-Ray Image Classification for Multi-Class Disease Diagnoses with Accelerated Computing Strategies Sebastian A. Cruz Romero et.al. 2407.01705 null
2024-07-02 xLSTM-UNet can be an Effective 2D & 3D Medical Image Segmentation Backbone with Vision-LSTM (ViL) better than its Mamba Counterpart Tianrun Chen et.al. 2407.01530 link
2024-07-01 Scarecrow monitoring system:employing mobilenet ssd for enhanced animal supervision Balaji VS et.al. 2407.01435 null
2024-07-01 Semantic Compositions Enhance Vision-Language Contrastive Learning Maxwell Aladago et.al. 2407.01408 null
2024-07-01 GalLoP: Learning Global and Local Prompts for Vision-Language Models Marc Lafon et.al. 2407.01400 null
2024-07-01 Protecting Privacy in Classifiers by Token Manipulation Re'em Harel et.al. 2407.01334 null
2024-07-01 Gradient-based Class Weighting for Unsupervised Domain Adaptation in Dense Prediction Visual Tasks Roberto Alcover-Couso et.al. 2407.01327 null
2024-06-28 Extract More from Less: Efficient Fine-Grained Visual Recognition in Low-Data Regimes Dmitry Demidov et.al. 2406.19814 link
2024-06-27 Fibottention: Inceptive Visual Representation Learning with Diverse Attention Across Heads Ali Khaleghi Rahimian et.al. 2406.19391 link
2024-06-27 Learning Visual Conditioning Tokens to Correct Domain Shift for Fully Test-time Adaptation Yushun Tang et.al. 2406.19341 null
2024-06-27 Spiking Convolutional Neural Networks for Text Classification Changze Lv et.al. 2406.19230 link
2024-06-27 Adaptive Stochastic Weight Averaging Caglar Demir et.al. 2406.19092 link
2024-06-27 FedMLP: Federated Multi-Label Medical Image Classification under Task Heterogeneity Zhaobin Sun et.al. 2406.18995 link
2024-06-26 Detecting Machine-Generated Texts: Not Just "AI vs Humans" and Explainability is Complicated Jiazhou Ji et.al. 2406.18259 null
2024-06-26 ViT-1.58b: Mobile Vision Transformers in the 1-bit Era Zhengqing Yuan et.al. 2406.18051 null
2024-06-25 Benchmarking Deep Learning Models on NVIDIA Jetson Nano for Real-Time Systems: An Empirical Investigation Tushar Prasanna Swaminathan et.al. 2406.17749 link
2024-06-25 Structured Unrestricted-Rank Matrices for Parameter Efficient Fine-tuning Arijit Sehanobish et.al. 2406.17740 null
2024-06-25 BayTTA: Uncertainty-aware medical image classification with optimized test-time augmentation using Bayesian model averaging Zeinab Sherkatghanad et.al. 2406.17640 link
2024-06-26 Mitigate the Gap: Investigating Approaches for Improving Cross-Modal Alignment in CLIP Sedigheh Eslami et.al. 2406.17639 null
2024-06-25 Knowledge Distillation in Automated Annotation: Supervised Text Classification with LLM-Generated Training Labels Nicholas Pangakis et.al. 2406.17633 null
2024-06-25 Retrieval-style In-Context Learning for Few-shot Hierarchical Text Classification Huiyao Chen et.al. 2406.17534 link
2024-06-25 TSynD: Targeted Synthetic Data Generation for Enhanced Medical Image Classification Joshua Niemeijer et.al. 2406.17473 null
2024-06-25 Dynamic Scheduling for Vehicle-to-Vehicle Communications Enhanced Federated Learning Jintao Yan et.al. 2406.17470 null
2024-06-25 Implicit-Zoo: A Large-Scale Dataset of Neural Implicit Functions for 2D Images and 3D Scenes Qi Ma et.al. 2406.17438 null
2024-06-25 Robustly Optimized Deep Feature Decoupling Network for Fatty Liver Diseases Detection Peng Huang et.al. 2406.17338 null
2024-06-24 Evaluation of Language Models in the Medical Context Under Resource-Constrained Settings Andrea Posada et.al. 2406.16611 link
2024-06-24 Improving robustness to corruptions with multiplicative weight perturbations Trung Trinh et.al. 2406.16540 null
2024-06-24 UNICAD: A Unified Approach for Attack Detection, Noise Reduction and Novel Class Identification Alvaro Lopez Pellicer et.al. 2406.16501 null
2024-06-24 Improving Quaternion Neural Networks with Quaternionic Activation Functions Johannes Pöppelbaum et.al. 2406.16481 null
2024-06-24 Learning in Wilson-Cowan model for metapopulation Raffaele Marino et.al. 2406.16453 link
2024-06-24 Context-augmented Retrieval: A Novel Framework for Fast Information Retrieval based Response Generation using Large Language Model Sai Ganesh et.al. 2406.16383 null
2024-06-24 Combining Supervised Learning and Reinforcement Learning for Multi-Label Classification Tasks with Partial Labels Zixia Jia et.al. 2406.16293 null
2024-06-23 Jacobian Descent for Multi-Objective Optimization Pierre Quinton et.al. 2406.16232 null
2024-06-23 Learning with Noisy Ground Truth: From 2D Classification to 3D Reconstruction Yangdi Lu et.al. 2406.15982 null
2024-06-22 PUDD: Towards Robust Multi-modal Prototype-based Deepfake Detection Alvaro Lopez Pellcier et.al. 2406.15921 null
2024-06-21 Retrieval Augmented Zero-Shot Text Classification Tassallah Abdullahi et.al. 2406.15241 null
2024-06-21 DiffExplainer: Unveiling Black Box Models Via Counterfactual Generation Yingying Fang et.al. 2406.15182 null
2024-06-21 This actually looks like that: Proto-BagNets for local and global interpretability-by-design Kerol Djoumessi et.al. 2406.15168 link
2024-06-21 Hierarchical thematic classification of major conference proceedings Arsentii Kuzmin et.al. 2406.14983 null
2024-06-21 Demonstrating the Efficacy of Kolmogorov-Arnold Networks in Vision Tasks Minjong Cheon et.al. 2406.14916 link
2024-06-21 MU-Bench: A Multitask Multimodal Benchmark for Machine Unlearning Jiali Cheng et.al. 2406.14796 null
2024-06-20 Depth $F_1$ : Improving Evaluation of Cross-Domain Text Classification by Measuring Semantic Generalizability Parker Seegmiller et.al. 2406.14695 null
2024-06-20 Automatic Labels are as Effective as Manual Labels in Biomedical Images Classification with Deep Learning Niccolò Marini et.al. 2406.14351 null
2024-06-20 Self-supervised Interpretable Concept-based Models for Text Classification Francesco De Santis et.al. 2406.14335 null
2024-06-20 Adaptive Adversarial Cross-Entropy Loss for Sharpness-Aware Minimization Tanapat Ratchatorn et.al. 2406.14329 null
2024-06-20 Boosting Hyperspectral Image Classification with Gate-Shift-Fuse Mechanisms in a Novel CNN-Transformer Approach Mohamed Fadhlallah Guerri et.al. 2406.14120 null
2024-06-20 Seg-LSTM: Performance of xLSTM for Semantic Segmentation of Remotely Sensed Images Qinfeng Zhu et.al. 2406.14086 link
2024-06-21 CMTNet: Convolutional Meets Transformer Network for Hyperspectral Images Classification Faxu Guo et.al. 2406.14080 null
2024-06-20 Communication-Efficient Adaptive Batch Size Strategies for Distributed Local Gradient Methods Tim Tsz-Kit Lau et.al. 2406.13936 null
2024-06-19 WATT: Weight Average Test-Time Adaption of CLIP David Osowiechi et.al. 2406.13875 link
2024-06-19 CNN Based Flank Predictor for Quadruped Animal Species Vanessa Suessle et.al. 2406.13588 null
2024-06-19 Online Domain-Incremental Learning Approach to Classify Acoustic Scenes in All Locations Manjunath Mulimani et.al. 2406.13386 null
2024-06-18 LayerMerge: Neural Network Depth Compression through Layer Pruning and Merging Jinuk Kim et.al. 2406.12837 link
2024-06-18 Privacy Preserving Federated Learning in Medical Imaging with Uncertainty Estimation Nikolas Koutsoubis et.al. 2406.12815 link
2024-06-18 Online Anchor-based Training for Image Classification Tasks Maria Tzelepi et.al. 2406.12662 null
2024-06-18 Fighting Randomness with Randomness: Mitigating Optimisation Instability of Fine-Tuning using Delayed Ensemble and Noisy Interpolation Branislav Pecher et.al. 2406.12471 null
2024-06-18 GW-MoE: Resolving Uncertainty in MoE Router with Global Workspace Theory Haoze Wu et.al. 2406.12375 null
2024-06-18 What Did I Do Wrong? Quantifying LLMs' Sensitivity and Consistency to Prompt Engineering Federico Errica et.al. 2406.12334 null
2024-06-18 Unleashing the Potential of Open-set Noisy Samples Against Label Noise for Medical Image Classification Zehui Liao et.al. 2406.12293 null
2024-06-18 Advancing Cross-Domain Generalizability in Face Anti-Spoofing: Insights, Design, and Metrics Hyojin Kim et.al. 2406.12258 null
2024-06-19 MiSuRe is all you need to explain your image segmentation Syed Nouman Hasany et.al. 2406.12173 null
2024-06-17 Enhancing Text Classification through LLM-Driven Active Learning and Human Annotation Hamidreza Rouzegar et.al. 2406.12114 link
2024-06-17 Scaling the Codebook Size of VQGAN to 100,000 with a Utilization Rate of 99% Lei Zhu et.al. 2406.11837 link
2024-06-17 PrAViC: Probabilistic Adaptation Framework for Real-Time Video Classification Magdalena Trędowicz et.al. 2406.11443 null
2024-06-17 Cross-domain Open-world Discovery Shuo Wen et.al. 2406.11422 link
2024-06-17 BaFTA: Backprop-Free Test-Time Adaptation For Zero-Shot Vision-Language Models Xuefeng Hu et.al. 2406.11309 null
2024-06-17 An Empirical Investigation of Matrix Factorization Methods for Pre-trained Transformers Ashim Gupta et.al. 2406.11307 null
2024-06-17 Text Grafting: Near-Distribution Weak Supervision for Minority Classes in Text Classification Letian Peng et.al. 2406.11115 null
2024-06-16 Fine-grained Classes and How to Find Them Matej Grcić et.al. 2406.11070 link
2024-06-16 Leveraging Foundation Models for Multi-modal Federated Learning with Incomplete Modality Liwei Che et.al. 2406.11048 null
2024-06-16 Curating Stopwords in Marathi: A TF-IDF Approach for Improved Text Analysis and Information Retrieval Rohan Chavan et.al. 2406.11029 link
2024-06-16 Universal Cross-Lingual Text Classification Riya Savant et.al. 2406.11028 null
2024-06-14 UniAudio 1.5: Large Language Model-driven Audio Codec is A Few-shot Audio Task Learner Dongchao Yang et.al. 2406.10056 null
2024-06-14 Comparison of fine-tuning strategies for transfer learning in medical image classification Ana Davila et.al. 2406.10050 null
2024-06-14 Forgetting Order of Continual Learning: Examples That are Learned First are Forgotten Last Guy Hacohen et.al. 2406.09935 null
2024-06-13 MirrorCheck: Efficient Adversarial Defense for Vision-Language Models Samar Fares et.al. 2406.09250 null
2024-06-13 Self-Training for Sample-Efficient Active Learning for Text Classification with Pre-Trained Language Models Christopher Schröder et.al. 2406.09206 null
2024-06-13 Large-Scale Evaluation of Open-Set Image Classification Techniques Halil Bisgin et.al. 2406.09112 link
2024-06-13 LaCoOT: Layer Collapse through Optimal Transport Victor Quétu et.al. 2406.08933 null
2024-06-13 The Penalized Inverse Probability Measure for Conformal Classification Paul Melki et.al. 2406.08884 null
2024-06-13 Conceptual Learning via Embedding Approximations for Reinforcing Interpretability and Transparency Maor Dikter et.al. 2406.08840 link
2024-06-13 DenoiseReID: Denoising Model for Representation Learning of Person Re-Identification Zhengrui Xu et.al. 2406.08773 null
2024-06-12 Fine-Tuned 'Small' LLMs (Still) Significantly Outperform Zero-Shot Generative AI Models in Text Classification Martin Juan José Bucher et.al. 2406.08660 null
2024-06-12 Intelligent Multi-View Test Time Augmentation Efe Ozturk et.al. 2406.08593 null
2024-06-12 Transformation-Dependent Adversarial Attacks Yaoteng Tan et.al. 2406.08443 null
2024-06-12 AdaNCA: Neural Cellular Automata As Adaptors For More Robust Vision Transformer Yitao Xu et.al. 2406.08298 null
2024-06-12 DistilDoc: Knowledge Distillation for Visually-Rich Document Applications Jordy Van Landeghem et.al. 2406.08226 null
2024-06-12 Fully Few-shot Class-incremental Audio Classification Using Expandable Dual-embedding Extractor Yongjie Si et.al. 2406.08122 null
2024-06-12 Low-Complexity Acoustic Scene Classification Using Parallel Attention-Convolution Network Yanxiong Li et.al. 2406.08119 null
2024-06-12 A $^{2}$ -MAE: A spatial-temporal-spectral unified remote sensing pre-training method based on anchor-aware masked autoencoder Lixian Zhang et.al. 2406.08079 null
2024-06-12 Adversarial Evasion Attack Efficiency against Large Language Models João Vitorino et.al. 2406.08050 null
2024-06-12 Accurate Explanation Model for Image Classifiers using Class Association Embedding Ruitao Xie et.al. 2406.07961 link
2024-06-12 Multi-Teacher Multi-Objective Meta-Learning for Zero-Shot Hyperspectral Band Selection Jie Feng et.al. 2406.07949 null
2024-06-12 Small Scale Data-Free Knowledge Distillation He Liu et.al. 2406.07876 link
2024-06-11 fKAN: Fractional Kolmogorov-Arnold Networks with trainable Jacobi basis functions Alireza Afzal Aghaei et.al. 2406.07456 link
2024-06-11 Minimizing Energy Costs in Deep Learning Model Training: The Gaussian Sampling Approach Challapalli Phanindra Revanth et.al. 2406.07332 null
2024-06-11 Noise-Robust Voice Conversion by Conditional Denoising Training Using Latent Variables of Recording Quality and Environment Takuto Igarashi et.al. 2406.07280 null
2024-06-11 EEG-ImageNet: An Electroencephalogram Dataset and Benchmarks with Image Visual Stimuli of Multi-Granularity Labels Shuqi Zhu et.al. 2406.07151 link
2024-06-11 RS-Agent: Automating Remote Sensing Tasks through Intelligent Agents Wenjia Xu et.al. 2406.07089 null
2024-06-11 DualMamba: A Lightweight Spectral-Spatial Mamba-Convolution Network for Hyperspectral Image Classification Jiamu Sheng et.al. 2406.07050 null
2024-06-11 Fairness-Aware Meta-Learning via Nash Bargaining Yi Zeng et.al. 2406.07029 null
2024-06-11 Mitigating Boundary Ambiguity and Inherent Bias for Text Classification in the Era of Large Language Models Zhenyi Lu et.al. 2406.07001 link
2024-06-11 Scaling up masked audio encoder learning for general audio classification Heinrich Dinkel et.al. 2406.06992 null
2024-06-10 Multi-Objective Neural Architecture Search for In-Memory Computing Md Hasibul Amin et.al. 2406.06746 null
2024-06-10 Robust Latent Representation Tuning for Image-text Classification Hao Sun et.al. 2406.06048 null
2024-06-09 Contrastive Learning from Synthetic Audio Doppelgangers Manuel Cherep et.al. 2406.05923 null
2024-06-09 Scaling Graph Convolutions for Mobile Vision William Avery et.al. 2406.05850 link
2024-06-09 Evolution-aware VAriance (EVA) Coreset Selection for Medical Image Classification Yuxin Hong et.al. 2406.05677 null
2024-06-09 Which Backbone to Use: A Resource-efficient Domain Specific Comparison for Computer Vision Pranav Jeevan et.al. 2406.05612 link
2024-06-08 Aligning Human Knowledge with Visual Concepts Towards Explainable Medical Image Classification Yunhe Gao et.al. 2406.05596 null
2024-06-07 The Unmet Promise of Synthetic Training Images: Using Retrieved Real Images Performs Better Scott Geng et.al. 2406.05184 link
2024-06-07 A Novel Time Series-to-Image Encoding Approach for Weather Phenomena Classification Christian Giannetti et.al. 2406.05096 null
2024-06-07 Classification Metrics for Image Explanations: Towards Building Reliable XAI-Evaluations Benjamin Fresz et.al. 2406.05068 link
2024-06-07 REP: Resource-Efficient Prompting for On-device Continual Learning Sungho Jeon et.al. 2406.04772 null
2024-06-07 AICoderEval: Improving AI Domain Code Generation of Large Language Models Yinghui Xia et.al. 2406.04712 null
2024-06-07 Cooperative Meta-Learning with Gradient Augmentation Jongyun Shin et.al. 2406.04639 link
2024-06-06 OCCAM: Towards Cost-Efficient and Accuracy-Aware Image Classification Inference Dujian Ding et.al. 2406.04508 null
2024-06-06 Can Language Models Use Forecasting Strategies? Sarah Pratt et.al. 2406.04446 null
2024-06-06 Parameter-Inverted Image Pyramid Networks Xizhou Zhu et.al. 2406.04330 link
2024-06-07 BEADs: Bias Evaluation Across Domains Shaina Raza et.al. 2406.04220 null
2024-06-06 What Do Language Models Learn in Context? The Structured Task Hypothesis Jiaoda Li et.al. 2406.04216 null
2024-06-06 Pointer-Guided Pre-Training: Infusing Large Language Models with Paragraph-Level Contextual Awareness Lars Hillebrand et.al. 2406.04156 link
2024-06-07 ReDistill: Residual Encoded Distillation for Peak Memory Reduction Fang Chen et.al. 2406.03744 null
2024-06-06 LLMEmbed: Rethinking Lightweight LLM's Genuine Function in Text Classification Chun Liu et.al. 2406.03725 link
2024-06-05 Convolutional Neural Networks and Vision Transformers for Fashion MNIST Classification: A Literature Review Sonia Bbouzidi et.al. 2406.03478 null
2024-06-05 IrokoBench: A New Benchmark for African Languages in the Age of Large Language Models David Ifeoluwa Adelani et.al. 2406.03368 null
2024-06-05 Audio Mamba: Bidirectional State Space Model for Audio Representation Learning Mehmet Hamza Erol et.al. 2406.03344 link
2024-06-05 FusionBench: A Comprehensive Benchmark of Deep Model Fusion Anke Tang et.al. 2406.03280 null
2024-06-05 VWise: A novel benchmark for evaluating scene classification for vehicular applications Pedro Azevedo et.al. 2406.03273 null
2024-06-05 Tiny models from tiny data: Textual and null-text inversion for few-shot distillation Erik Landolsi et.al. 2406.03146 link
2024-06-05 Exploiting LMM-based knowledge for image classification tasks Maria Tzelepi et.al. 2406.03071 null
2024-06-04 Randomized Geometric Algebra Methods for Convex Neural Networks Yifei Wang et.al. 2406.02806 null
2024-06-04 DL-KDD: Dual-Light Knowledge Distillation for Action Recognition in the Dark Chi-Jui Chang et.al. 2406.02468 null
2024-06-04 GrootVL: Tree Topology is All You Need in State Space Model Yicheng Xiao et.al. 2406.02395 link
2024-06-04 Hybrid Quantum-Classical Neural Network for LAB Color Space Image Classification Kwokho Ng et.al. 2406.02229 null
2024-06-03 Few-Shot Classification of Interactive Activities of Daily Living (InteractADL) Zane Durante et.al. 2406.01662 link
2024-06-03 CoLa-DCE -- Concept-guided Latent Diffusion Counterfactual Explanations Franz Motzkus et.al. 2406.01649 null
2024-06-03 Asynchronous Multi-Server Federated Learning for Geo-Distributed Clients Yuncong Zuo et.al. 2406.01439 null
2024-06-03 Compute-Efficient Medical Image Classification with Softmax-Free Transformers and Sequence Normalization Firas Khader et.al. 2406.01314 null
2024-06-03 Continuous Geometry-Aware Graph Diffusion via Hyperbolic Neural PDE Jiaxu Liu et.al. 2406.01282 null
2024-06-04 MultiMax: Sparse and Multi-Modal Attention Learning Yuxuan Zhou et.al. 2406.01189 link
2024-06-03 Synergizing Unsupervised and Supervised Learning: A Hybrid Approach for Accurate Natural Language Task Modeling Wrick Talukdar et.al. 2406.01096 null
2024-05-31 You Only Scan Once: Efficient Multi-dimension Sequential Modeling with LightNet Zhen Qin et.al. 2405.21022 null
2024-05-31 Investigating Calibration and Corruption Robustness of Post-hoc Pruned Perception CNNs: An Image Classification Benchmark Study Pallavi Mitra et.al. 2405.20876 null
2024-05-31 Improving Generalization and Convergence by Enhancing Implicit Regularization Mingze Wang et.al. 2405.20763 null
2024-05-31 Robust Stable Spiking Neural Networks Jianhao Ding et.al. 2405.20694 null
2024-05-31 Enhancing Counterfactual Image Generation Using Mahalanobis Distance with Distribution Preferences in Feature Space Yukai Zhang et.al. 2405.20685 null
2024-05-31 GenMix: Combining Generative and Mixture Data Augmentation for Medical Image Classification Hansang Lee et.al. 2405.20650 null
2024-05-31 ToxVidLLM: A Multimodal LLM-based Framework for Toxicity Detection in Code-Mixed Videos Krishanu Maity et.al. 2405.20628 null
2024-05-30 Mitigating the Impact of Labeling Errors on Training via Rockafellian Relaxation Louis L. Chen et.al. 2405.20531 null
2024-05-30 DeMamba: AI-Generated Video Detection on Million-Scale GenVideo Benchmark Haoxing Chen et.al. 2405.19707 link
2024-05-30 A Novel Approach for Automated Design Information Mining from Issue Logs Jiuang Zhao et.al. 2405.19623 null
2024-05-29 I Bet You Did Not Mean That: Testing Semantic Importance via Betting Jacopo Teneggi et.al. 2405.19146 link
2024-05-29 Verifiably Robust Conformal Prediction Linus Jeary et.al. 2405.18942 null
2024-05-29 Leveraging Many-To-Many Relationships for Defending Against Visual-Language Adversarial Attacks Futa Waseda et.al. 2405.18770 null
2024-05-29 GIST: Greedy Independent Set Thresholding for Diverse Data Summarization Matthew Fahrbach et.al. 2405.18754 null
2024-05-29 LLM-based Hierarchical Concept Decomposition for Interpretable Fine-Grained Image Classification Renyi Qu et.al. 2405.18672 null
2024-05-28 Its Not a Modality Gap: Characterizing and Addressing the Contrastive Gap Abrar Fahim et.al. 2405.18570 null
2024-05-28 Why are Visually-Grounded Language Models Bad at Image Classification? Yuhui Zhang et.al. 2405.18415 link
2024-05-28 MSPE: Multi-Scale Patch Embedding Prompts Vision Transformers to Any Resolution Wenzhuo Liu et.al. 2405.18240 null
2024-05-28 Confidence-aware multi-modality learning for eye disease screening Ke Zou et.al. 2405.18167 link
2024-05-28 4-bit Shampoo for Memory-Efficient Network Training Sike Wang et.al. 2405.18144 null
2024-05-28 DMT-JEPA: Discriminative Masked Targets for Joint-Embedding Predictive Architecture Shentong Mo et.al. 2405.17995 null
2024-05-27 WASH: Train your Ensemble with Communication-Efficient Weight Shuffling, then Average Louis Fournier et.al. 2405.17517 null
2024-05-27 Model-Agnostic Zeroth-Order Policy Optimization for Meta-Learning of Ergodic Linear Quadratic Regulators Yunian Pan et.al. 2405.17370 null
2024-05-27 On the Noise Robustness of In-Context Learning for Text Generation Hongfu Gao et.al. 2405.17264 null
2024-05-27 Superpixelwise Low-rank Approximation based Partial Label Learning for Hyperspectral Image Classification Shujun Yang et.al. 2405.17110 link
2024-05-26 Demystify Mamba in Vision: A Linear Attention Perspective Dongchen Han et.al. 2405.16605 null
2024-05-26 AdaFisher: Adaptive Second Order Optimization via Fisher Information Damien Martins Gomes et.al. 2405.16397 null
2024-05-25 ModelLock: Locking Your Model With a Spell Yifeng Gao et.al. 2405.16285 null
2024-05-25 Accelerating Transformers with Spectrum-Preserving Token Merging Hoai-Chau Tran et.al. 2405.16148 null
2024-05-25 Breaking the False Sense of Security in Backdoor Defense through Re-Activation Attack Mingli Zhu et.al. 2405.16134 null
2024-05-24 Grounding Stylistic Domain Generalization with Quantitative Domain Shift Measures and Synthetic Scene Images Yiran Luo et.al. 2405.15961 null
2024-05-24 A Neurosymbolic Framework for Bias Correction in CNNs Parth Padalkar et.al. 2405.15886 null
2024-05-24 What Do You See? Enhancing Zero-Shot Image Classification with Multimodal Large Language Models Abdelrahman Abdelhamed et.al. 2405.15668 null
2024-05-24 Class Machine Unlearning for Complex Data via Concepts Inference and Data Poisoning Wenhan Chang et.al. 2405.15662 null
2024-05-24 Exposing Image Classifier Shortcuts with Counterfactual Frequency (CoF) Tables James Hinns et.al. 2405.15661 null
2024-05-24 Harnessing Increased Client Participation with Cohort-Parallel Federated Learning Akash Dhasade et.al. 2405.15644 null
2024-05-24 Transformer-based Federated Learning for Multi-Label Remote Sensing Image Classification Barış Büyüktaş et.al. 2405.15405 null
2024-05-24 CLIP model is an Efficient Online Lifelong Learner Leyuan Wang et.al. 2405.15155 null
2024-05-24 OptLLM: Optimal Assignment of Queries to Large Language Models Yueyue Liu et.al. 2405.15130 null
2024-05-23 A Lost Opportunity for Vision-Language Models: A Comparative Study of Online Test-time Adaptation for Vision-Language Models Mario Döbler et.al. 2405.14977 link
2024-05-23 Domain Wall Magnetic Tunnel Junction Reliable Integrate and Fire Neuron Can Cui1 et.al. 2405.14851 null
2024-05-23 Explaining Black-box Model Predictions via Two-level Nested Feature Attributions with Consistency Property Yuya Yoshikawa et.al. 2405.14522 null
2024-05-23 SIAVC: Semi-Supervised Framework for Industrial Accident Video Classification Zuoyong Li et.al. 2405.14506 null
2024-05-23 Scalable Visual State Space Model with Fractal Scanning Lv Tang et.al. 2405.14480 null
2024-05-23 Segformer++: Efficient Token-Merging Strategies for High-Resolution Semantic Segmentation Daniel Kienzle et.al. 2405.14467 null
2024-05-23 Boosting Robustness by Clipping Gradients in Distributed Learning Youssef Allouah et.al. 2405.14432 null
2024-05-23 Advancing Spiking Neural Networks for Sequential Modeling with Central Pattern Generators Changze Lv et.al. 2405.14362 null
2024-05-23 Simple Hamiltonian dynamics is a powerful quantum processing resource Akitada Sakurai et.al. 2405.14245 null
2024-05-23 ChronosLex: Time-aware Incremental Training for Temporal Generalization of Legal Classification Tasks T. Y. S. S Santosh et.al. 2405.14211 null
2024-05-22 Just rotate it! Uncertainty estimation in closed-source models via multiple queries Konstantinos Pitas et.al. 2405.13864 null
2024-05-21 Decentralized Federated Learning Over Imperfect Communication Channels Weicai Li et.al. 2405.12894 null
2024-05-21 Multimodal Adaptive Inference for Document Image Classification with Anytime Early Exiting Omar Hamed et.al. 2405.12705 null
2024-05-21 Exploration of Masked and Causal Language Modelling for Text Generation Nicolo Micheletti et.al. 2405.12630 null
2024-05-21 3DSS-Mamba: 3D-Spectral-Spatial Mamba for Hyperspectral Image Classification Yan He et.al. 2405.12487 null
2024-05-20 Alzheimer's Magnetic Resonance Imaging Classification Using Deep and Meta-Learning Models Nida Nasir et.al. 2405.12126 null
2024-05-20 Mamba-in-Mamba: Centralized Mamba-Cross-Scan in Tokenized Mamba Model for Hyperspectral Image Classification Weilian Zhou et.al. 2405.12003 link
2024-05-20 A Constraint-Enforcing Reward for Adversarial Attacks on Text Classifiers Tom Roth et.al. 2405.11904 null
2024-05-21 A Novel Cartography-Based Curriculum Learning Method Applied on RoNLI: The First Romanian Natural Language Inference Corpus Eduard Poesina et.al. 2405.11877 link
2024-05-20 SSAMBA: Self-Supervised Audio Representation Learning with Mamba State Space Model Siavash Shams et.al. 2405.11831 link
2024-05-20 Exploring Ordinality in Text Classification: A Comparative Study of Explicit and Implicit Techniques Siva Rajesh Kasa et.al. 2405.11775 null
2024-05-19 SLAB: Efficient Transformers with Simplified Linear Attention and Progressive Re-parameterized Batch Normalization Jialong Guo et.al. 2405.11582 link
2024-05-19 Reproducibility Study of CDUL: CLIP-Driven Unsupervised Learning for Multi-Label Image Classification Manan Shah et.al. 2405.11574 link
2024-05-19 An Invisible Backdoor Attack Based On Semantic Feature Yangming Chen et.al. 2405.11551 null
2024-05-19 Verification technology for finger vein biometric George Kumi Kyeremeh et.al. 2405.11540 null
2024-05-17 Reduced storage direct tensor ring decomposition for convolutional neural networks compression Mateusz Gabor et.al. 2405.10802 link
2024-05-17 Benchmarking Large Language Models on CFLUE -- A Chinese Financial Language Understanding Evaluation Dataset Jie Zhu et.al. 2405.10542 link
2024-05-17 Smart Expert System: Large Language Models as Text Classifiers Zhiqiang Wang et.al. 2405.10523 link
2024-05-16 Data-Efficient Low-Complexity Acoustic Scene Classification in the DCASE 2024 Challenge Florian Schmid et.al. 2405.10018 null
2024-05-16 ROCOv2: Radiology Objects in COntext Version 2, an Updated Multimodal Image Dataset Johannes Rückert et.al. 2405.10004 link
2024-05-15 Improving Label Error Detection and Elimination with Uncertainty Quantification Johannes Jakubik et.al. 2405.09602 null
2024-05-15 Tackling Distribution Shifts in Task-Oriented Communication with Information Bottleneck Hongru Li et.al. 2405.09514 null
2024-05-15 Feature-based Federated Transfer Learning: Communication Efficiency, Robustness and Privacy Feng Wang et.al. 2405.09014 link
2024-05-14 The Pitfalls and Promise of Conformal Inference Under Adversarial Attacks Ziquan Liu et.al. 2405.08886 link
2024-05-14 Harnessing the power of longitudinal medical imaging for eye disease prognosis using Transformer-based sequence modeling Gregory Holste et.al. 2405.08780 null
2024-05-14 FolkTalent: Enhancing Classification and Tagging of Indian Folk Paintings Nancy Hada et.al. 2405.08776 null
2024-05-14 The impact of Compositionality in Zero-shot Multi-label action recognition for Object-based tasks Carmela Calabrese et.al. 2405.08695 null
2024-05-14 Achieving Fairness Through Channel Pruning for Dermatological Disease Diagnosis Qingpeng Kong et.al. 2405.08681 link
2024-05-14 Investigating Design Choices in Joint-Embedding Predictive Architectures for General Audio Representation Learning Alain Riou et.al. 2405.08679 null
2024-05-14 Dual-Branch Network for Portrait Image Quality Assessment Wei Sun et.al. 2405.08555 null
2024-05-13 Who's in and who's out? A case study of multimodal CLIP-filtering in DataComp Rachel Hong et.al. 2405.08209 link
2024-05-14 MambaOut: Do We Really Need Mamba for Vision? Weihao Yu et.al. 2405.07992 link
2024-05-13 Constrained Exploration via Reflected Replica Exchange Stochastic Gradient Langevin Dynamics Haoyang Zheng et.al. 2405.07839 link
2024-05-13 Analysis of the rate of convergence of an over-parametrized convolutional neural network image classifier learned by gradient descent Michael Kohler et.al. 2405.07619 null
2024-05-13 On-device Online Learning and Semantic Management of TinyML Systems Haoyu Ren et.al. 2405.07601 null
2024-05-13 GLiRA: Black-Box Membership Inference Attack via Knowledge Distillation Andrey V. Galichin et.al. 2405.07562 null
2024-05-13 Fine-tuning the SwissBERT Encoder Model for Embedding Sentences and Documents Juri Grosjean et.al. 2405.07513 null
2024-05-13 MoVL:Exploring Fusion Strategies for the Domain-Adaptive Application of Pretrained Models in Medical Imaging Tasks Haijiang Tian et.al. 2405.07411 null
2024-05-12 Explainable Convolutional Neural Networks for Retinal Fundus Classification and Cutting-Edge Segmentation Models for Retinal Blood Vessels from Fundus Images Fatema Tuj Johora Faria et.al. 2405.07338 null
2024-05-12 Differentiable Model Scaling using Differentiable Topk Kai Liu et.al. 2405.07194 null
2024-05-11 A framework of text-dependent speaker verification for chinese numerical string corpus Litong Zheng et.al. 2405.07029 null
2024-05-10 Pseudo-Prompt Generating in Pre-trained Vision-Language Models for Multi-Label Medical Image Classification Yaoqin Ye et.al. 2405.06468 null
2024-05-10 Multi-level Personalized Federated Learning on Heterogeneous and Long-Tailed Data Rongyu Zhang et.al. 2405.06413 null
2024-05-10 SaudiBERT: A Large Language Model Pretrained on Saudi Dialect Corpora Faisal Qarah et.al. 2405.06239 null
2024-05-09 Deep Multi-Task Learning for Malware Image Classification Ahmed Bensaoud et.al. 2405.05906 null
2024-05-09 Enhancing Suicide Risk Detection on Social Media through Semi-Supervised Deep Label Smoothing Matthew Squires et.al. 2405.05795 null
2024-05-09 CSA-Net: Channel-wise Spatially Autocorrelated Attention Networks Nick et.al. 2405.05755 null
2024-05-09 How Quality Affects Deep Neural Networks in Fine-Grained Image Classification Joseph Smith et.al. 2405.05742 null
2024-05-09 End-to-End Generative Semantic Communication Powered by Shared Semantic Knowledge Base Shuling Li et.al. 2405.05738 null
2024-05-09 Using Machine Translation to Augment Multilingual Classification Adam King et.al. 2405.05478 null
2024-05-08 AFEN: Respiratory Disease Classification using Ensemble Learning Rahul Nadkarni et.al. 2405.05467 null
2024-05-08 XAMPLER: Learning to Retrieve Cross-Lingual In-Context Examples Peiqin Lin et.al. 2405.05116 link
2024-05-08 Explanation as a Watermark: Towards Harmless and Multi-bit Model Ownership Verification via Watermarking Feature Attribution Shuo Shao et.al. 2405.04825 null
2024-05-07 Exploring Explainable AI Techniques for Improved Interpretability in Lung and Colon Cancer Classification Mukaffi Bin Moin et.al. 2405.04610 link
2024-05-07 Pragmatist Intelligence: Where the Principle of Usefulness Can Take ANNs Antonio Bikić et.al. 2405.04386 null
2024-05-07 Semi-Supervised Disease Classification based on Limited Medical Image Data Yan Zhang et.al. 2405.04295 null
2024-05-07 DCNN: Dual Cross-current Neural Networks Realized Using An Interactive Deep Learning Discriminator for Fine-grained Objects Da Fu et.al. 2405.04093 null
2024-05-07 Feature Map Convergence Evaluation for Functional Module Ludan Zhang et.al. 2405.04041 null
2024-05-07 VMambaCC: A Visual State Space Model for Crowd Counting Hao-Yuan Ma et.al. 2405.03978 null
2024-05-06 On Adversarial Examples for Text Classification by Perturbing Latent Representations Korn Sooksatra et.al. 2405.03789 null
2024-05-06 CICA: Content-Injected Contrastive Alignment for Zero-Shot Document Image Classification Sankalp Sinha et.al. 2405.03660 null
2024-05-06 Deep Space Separable Distillation for Lightweight Acoustic Scene Classification ShuQi Ye et.al. 2405.03567 null
2024-05-06 Liberating Seen Classes: Boosting Few-Shot and Zero-Shot Text Classification via Anchor Generation and Classification Reframing Han Liu et.al. 2405.03565 null
2024-05-06 A Lightweight Neural Architecture Search Model for Medical Image Classification Lunchen Xie et.al. 2405.03462 null
2024-05-06 Interpretable Network Visualizations: A Human-in-the-Loop Approach for Post-hoc Explainability of CNN-based Image Classification Matteo Bianchi et.al. 2405.03301 null
2024-05-06 TED: Accelerate Model Training by Internal Generalization Jinying Xiao et.al. 2405.03228 null
2024-05-06 Advancing Multimodal Medical Capabilities of Gemini Lin Yang et.al. 2405.03162 null
2024-05-05 A scoping review of using Large Language Models (LLMs) to investigate Electronic Health Records (EHRs) Lingyao Li et.al. 2405.03066 null
2024-05-05 Parameter-Efficient Fine-Tuning with Discrete Fourier Transform Ziqi Gao et.al. 2405.03003 null
2024-05-04 MMEarth: Exploring Multi-Modal Pretext Tasks For Geospatial Representation Learning Vishal Nedungadi et.al. 2405.02771 null
2024-05-03 Multi-method Integration with Confidence-based Weighting for Zero-shot Image Classification Siqi Yin et.al. 2405.02155 null
2024-05-03 The Trade-off between Performance, Efficiency, and Fairness in Adapter Modules for Text Classification Minh Duc Bui et.al. 2405.02010 null
2024-05-03 Which Identities Are Mobilized: Towards an automated detection of social group appeals in political texts Felicia Riethmüller et.al. 2405.01904 null
2024-05-02 PVF (Parameter Vulnerability Factor): A Quantitative Metric Measuring AI Vulnerability and Resilience Against Parameter Corruptions Xun Jiao et.al. 2405.01741 null
2024-05-02 Development of Skip Connection in Deep Neural Networks for Computer Vision and Medical Image Analysis: A Survey Guoping Xu et.al. 2405.01725 link
2024-05-02 SOAR: Advancements in Small Body Object Detection for Aerial Imagery Using State Space Models and Programmable Gradients Tushar Verma et.al. 2405.01699 null
2024-05-02 Explainable AI (XAI) in Image Segmentation in Medicine, Industry, and Beyond: A Survey Rokas Gipiškis et.al. 2405.01636 null
2024-05-02 Improving Intervention Efficacy via Concept Realignment in Concept Bottleneck Models Nishad Singhi et.al. 2405.01531 null
2024-05-03 Decoupling Feature Extraction and Classification Layers for Calibrated Neural Networks Mikkel Jordahn et.al. 2405.01196 null
2024-05-02 Uncertainty-aware self-training with expectation maximization basis transformation Zijia Wang et.al. 2405.01175 null
2024-05-02 Transformers Fusion across Disjoint Samples for Hyperspectral Image Classification Muhammad Ahmad et.al. 2405.01095 null
2024-05-02 Efficient and Flexible Method for Reducing Moderate-size Deep Neural Networks with Condensation Tianyi Chen et.al. 2405.01041 null
2024-05-02 Benchmarking Representations for Speech, Music, and Acoustic Events Moreno La Quatra et.al. 2405.00934 link
2024-05-01 Digital-analog quantum convolutional neural networks for image classification Anton Simen et.al. 2405.00548 null
2024-05-03 BiomedRAG: A Retrieval Augmented Large Language Model for Biomedicine Mingchen Li et.al. 2405.00465 null
2024-05-01 Visual and audio scene classification for detecting discrepancies in video: a baseline method and experimental protocol Konstantinos Apostolidis et.al. 2405.00384 null
2024-05-01 Data Augmentation Policy Search for Long-Term Forecasting Liran Nochumsohn et.al. 2405.00319 null
2024-04-30 Let's Focus: Focused Backdoor Attack against Federated Transfer Learning Marco Arazzi et.al. 2404.19420 null
2024-04-30 Large Language Model Informed Patent Image Retrieval Hao-Cheng Lo et.al. 2404.19360 null
2024-04-30 Enhancing Intrinsic Features for Debiasing via Investigating Class-Discerning Common Attributes in Bias-Contrastive Pair Jeonghoon Park et.al. 2404.19250 null
2024-04-29 Spectral-Spatial Mamba for Hyperspectral Image Classification Lingbo Huang et.al. 2404.18401 null
2024-04-28 TextGram: Towards a better domain-adaptive pretraining Sharayu Hiwarkhedkar et.al. 2404.18228 null
2024-04-28 L3Cube-MahaNews: News-based Short Text and Long Document Classification Datasets in Marathi Saloni Mittal et.al. 2404.18216 link
2024-04-28 S $^2$ Mamba: A Spatial-spectral State Space Model for Hyperspectral Image Classification Guanchun Wang et.al. 2404.18213 null
2024-04-27 Implicit Generative Prior for Bayesian Neural Networks Yijia Liu et.al. 2404.18008 link
2024-04-27 Towards Privacy-Preserving Audio Classification Systems Bhawana Chhaglani et.al. 2404.18002 null
2024-04-27 A Method of Moments Embedding Constraint and its Application to Semi-Supervised Learning Michael Majurski et.al. 2404.17978 null
2024-04-27 Spatial, Temporal, and Geometric Fusion for Remote Sensing Images Hessah Albanwan et.al. 2404.17851 null
2024-04-27 Leveraging Cross-Modal Neighbor Representation for Improved CLIP Classification Chao Yi et.al. 2404.17753 link
2024-04-26 SPLICE -- Streamlining Digital Pathology Image Processing Areej Alsaafin et.al. 2404.17704 null
2024-04-26 SDFD: Building a Versatile Synthetic Face Image Dataset with Diverse Attributes Georgia Baltsou et.al. 2404.17255 null
2024-04-25 Incorporating Lexical and Syntactic Knowledge for Unsupervised Cross-Lingual Transfer Jianyu Zheng et.al. 2404.16627 link
2024-04-25 IMWA: Iterative Model Weight Averaging Benefits Class-Imbalanced Learning Tasks Zitong Huang et.al. 2404.16331 null
2024-04-25 Lacunarity Pooling Layers for Plant Image Classification using Texture Analysis Akshatha Mohan et.al. 2404.16268 link
2024-04-24 MiMICRI: Towards Domain-centered Counterfactual Explanations of Cardiovascular Image Classification Models Grace Guo et.al. 2404.16174 null
2024-04-24 MoDE: CLIP Data Experts via Clustering Jiawei Ma et.al. 2404.16030 link
2024-04-26 A Survey on Visual Mamba Hanwei Zhang et.al. 2404.15956 null
2024-04-24 Vision Transformer-based Adversarial Domain Adaptation Yahan Li et.al. 2404.15817 link
2024-04-24 Rethinking Model Prototyping through the MedMNIST+ Dataset Collection Sebastian Doerrich et.al. 2404.15786 null
2024-04-24 Efficient Multi-Model Fusion with Adversarial Complementary Representation Learning Zuheng Kang et.al. 2404.15704 null
2024-04-24 Brain Storm Optimization Based Swarm Learning for Diabetic Retinopathy Image Classification Liang Qu et.al. 2404.15585 null
2024-04-23 An MRP Formulation for Supervised Learning: Generalized Temporal Difference Learning Models Yangchen Pan et.al. 2404.15518 null
2024-04-23 Deep multi-prototype capsule networks Saeid Abbassi et.al. 2404.15445 null
2024-04-23 A review of deep learning-based information fusion techniques for multimodal medical image classification Yihao Li et.al. 2404.15022 null
2024-04-23 Social Media and Artificial Intelligence for Sustainable Cities and Societies: A Water Quality Analysis Use-case Muhammad Asif Auyb et.al. 2404.14977 null
2024-04-23 Traditional to Transformers: A Survey on Current Trends and Future Prospects for Hyperspectral Image Classification Muhammad Ahmad et.al. 2404.14955 link
2024-04-23 Pyramid Hierarchical Transformer for Hyperspectral Image Classification Muhammad Ahmad et.al. 2404.14945 link
2024-04-23 Importance of Disjoint Sampling in Conventional and Transformer Models for Hyperspectral Image Classification Muhammad Ahmad et.al. 2404.14944 link
2024-04-23 CoProNN: Concept-based Prototypical Nearest Neighbors for Explaining Vision Models Teodor Chiaburu et.al. 2404.14830 link
2024-04-22 WangLab at MEDIQA-M3G 2024: Multimodal Medical Answer Generation using Large Language Models Ronald Xie et.al. 2404.14567 null
2024-04-22 CKD: Contrastive Knowledge Distillation from A Sample-wise Perspective Wencheng Zhu et.al. 2404.14109 null
2024-04-21 EncodeNet: A Framework for Boosting DNN Accuracy with Entropy-driven Generalized Converting Autoencoder Hasanul Mahmud et.al. 2404.13770 null
2024-04-21 PEACH: Pretrained-embedding Explanation Across Contextual and Hierarchical Structure Feiqi Cao et.al. 2404.13645 link
2024-04-21 I2CANSAY:Inter-Class Analogical Augmentation and Intra-Class Significance Analysis for Non-Exemplar Online Task-Free Continual Learning Songlin Dong et.al. 2404.13576 null
2024-04-21 IMO: Greedy Layer-Wise Sparse Representation Learning for Out-of-Distribution Text Classification with Pre-trained Models Tao Feng et.al. 2404.13504 null
2024-04-20 Nested-TNT: Hierarchical Vision Transformers with Multi-Scale Feature Processing Yuang Liu et.al. 2404.13434 null
2024-04-20 Evaluating Subword Tokenization: Alien Subword Composition and OOV Generalization Challenge Khuyagbaatar Batsuren et.al. 2404.13292 link
2024-04-20 3D-Convolution Guided Spectral-Spatial Transformer for Hyperspectral Image Classification Shyam Varahagiri et.al. 2404.13252 link
2024-04-19 On-board classification of underwater images using hybrid classical-quantum CNN based method Sreeraj Rajan Warrier et.al. 2404.13130 null
2024-04-19 Next Generation Loss Function for Image Classification Shakhnaz Akhmedova et.al. 2404.12948 null
2024-04-19 A Hybrid Generative and Discriminative PointNet on Unordered Point Sets Yang Ye et.al. 2404.12925 null
2024-04-19 Transformer-Based Classification Outcome Prediction for Multimodal Stroke Treatment Danqing Ma et.al. 2404.12634 null
2024-04-18 When LLMs are Unfit Use FastFit: Fast and Effective Text Classification with Many Classes Asaf Yehudai et.al. 2404.12365 null
2024-04-18 Observation, Analysis, and Solution: Exploring Strong Lightweight Vision Transformers via Masked Image Modeling Pre-Training Jin Gao et.al. 2404.12210 link
2024-04-18 Concept Induction using LLMs: a user experiment for assessment Adrita Barua et.al. 2404.11875 null
2024-04-17 Pretraining Billion-scale Geospatial Foundational Models on Frontier Aristeidis Tsaris et.al. 2404.11706 null
2024-04-17 AI-Enhanced Cognitive Behavioral Therapy: Deep Learning and Large Language Models for Extracting Cognitive Pathways from Social Media Texts Meng Jiang et.al. 2404.11449 null
2024-04-17 Achieving Rotation Invariance in Convolution Operations: Shifting from Data-Driven to Mechanism-Assured Hanlin Mo et.al. 2404.11309 null
2024-04-17 A Progressive Framework of Vision-language Knowledge Distillation and Alignment for Multilingual Scene Wenbo Zhang et.al. 2404.11249 null
2024-04-17 A Novel ICD Coding Framework Based on Associated and Hierarchical Code Description Distillation Bin Zhang et.al. 2404.11132 null
2024-04-17 Small Language Models are Good Too: An Empirical Study of Zero-Shot Classification Pierre Lepagnol et.al. 2404.11122 null
2024-04-18 Supervised Contrastive Vision Transformer for Breast Histopathological Image Classification Mohammad Shiri et.al. 2404.11052 null
2024-04-17 InfoMatch: Entropy Neural Estimation for Semi-Supervised Image Classification Qi Han et.al. 2404.11003 link
2024-04-16 Incubating Text Classifiers Following User Instruction with Nothing but LLM Letian Peng et.al. 2404.10877 null
2024-04-16 Vocabulary-free Image Classification and Semantic Segmentation Alessandro Conti et.al. 2404.10864 link
2024-04-16 Assessing The Impact of CNN Auto Encoder-Based Image Denoising on Image Classification Tasks Mohsen Hami et.al. 2404.10664 null
2024-04-16 Tree Bandits for Generative Bayes Sean O'Hagan et.al. 2404.10436 null
2024-04-16 AudioProtoPNet: An interpretable deep learning model for bird sound classification René Heinrich et.al. 2404.10420 null
2024-04-16 Lighter, Better, Faster Multi-Source Domain Adaptation with Gaussian Mixture Models and Optimal Transport Eduardo Fernandes Montesuma et.al. 2404.10261 null
2024-04-15 Distributed Federated Learning-Based Deep Learning Model for Privacy MRI Brain Tumor Detection Lisang Zhou et.al. 2404.10026 null
2024-04-15 Interaction as Explanation: A User Interaction-based Method for Explaining Image Classification Models Hyeonggeun Yun et.al. 2404.09828 null
2024-04-15 Quantization of Large Language Models with an Overdetermined Basis Daniil Merkulov et.al. 2404.09737 null
2024-04-15 Pseudo-label Learning with Calibrated Confidence Using an Energy-based Model Masahito Toba et.al. 2404.09585 null
2024-04-14 Breast Cancer Image Classification Method Based on Deep Transfer Learning Weimin Wang et.al. 2404.09226 null
2024-04-14 Coreset Selection for Object Detection Hojun Lee et.al. 2404.09161 null
2024-04-13 Exploring Explainability in Video Action Recognition Avinab Saha et.al. 2404.09067 null
2024-04-13 Fast Fishing: Approximating BAIT for Efficient and Scalable Deep Active Image Classification Denis Huseljic et.al. 2404.08981 link
2024-04-13 PM2: A New Prompting Multi-modal Model Paradigm for Few-shot Medical Image Classification Zhenwei Wang et.al. 2404.08915 null
2024-04-12 VertAttack: Taking advantage of Text Classifiers' horizontal vision Jonathan Rusert et.al. 2404.08538 null
2024-04-12 SpectralMamba: Efficient Mamba for Hyperspectral Image Classification Jing Yao et.al. 2404.08489 null
2024-04-12 OTTER: Improving Zero-Shot Classification via Optimal Transport Changho Shin et.al. 2404.08461 null
2024-04-12 A Survey of Neural Network Robustness Assessment in Image Recognition Jie Wang et.al. 2404.08285 null
2024-04-12 Convolutional neural network classification of cancer cytopathology images: taking breast cancer as an example MingXuan Xiao et.al. 2404.08279 null
2024-04-11 HGRN2: Gated Linear RNNs with State Expansion Zhen Qin et.al. 2404.07904 link
2024-04-11 Exploiting Object-based and Segmentation-based Semantic Features for Deep Learning-based Indoor Scene Classification Ricardo Pereira et.al. 2404.07739 null
2024-04-11 Contrastive-Based Deep Embeddings for Label Noise-Resilient Histopathology Image Classification Lucas Dedieu et.al. 2404.07605 link
2024-04-11 Learning to Classify New Foods Incrementally Via Compressed Exemplars Justin Yang et.al. 2404.07507 null
2024-04-11 Interactive Prompt Debugging with Sequence Salience Ian Tenney et.al. 2404.07498 null
2024-04-11 Privacy preserving layer partitioning for Deep Neural Network models Kishore Rajasekar et.al. 2404.07437 null
2024-04-11 CopilotCAD: Empowering Radiologists with Report Completion Models and Quantitative Evidence from Medical Image Foundation Models Sheng Wang et.al. 2404.07424 null
2024-04-11 Improving Shift Invariance in Convolutional Neural Networks with Translation Invariant Polyphase Sampling Sourajit Saha et.al. 2404.07410 null
2024-04-10 Lost in Translation: Modern Neural Networks Still Struggle With Small Realistic Image Transformations Ofir Shifman et.al. 2404.07153 null
2024-04-10 Learning of deep convolutional network image classifiers via stochastic gradient descent and over-parametrization Michael Kohler et.al. 2404.07128 null
2024-04-10 Accelerating Cardiac MRI Reconstruction with CMRatt: An Attention-Driven Approach Anam Hashmi et.al. 2404.06941 null
2024-04-10 Multi-Label Continual Learning for the Medical Domain: A Novel Benchmark Marina Ceccon et.al. 2404.06859 null
2024-04-10 Neural Optimizer Equation, Decay Function, and Learning Rate Schedule Joint Evolution Brandon Morgan et.al. 2404.06679 null
2024-04-09 Variational Stochastic Gradient Descent for Deep Neural Networks Haotian Chen et.al. 2404.06549 link
2024-04-09 On adversarial training and the 1 Nearest Neighbor classifier Amir Hagai et.al. 2404.06313 link
2024-04-09 Audio-Visual Generalized Zero-Shot Learning using Pre-Trained Large Multi-Modal Models David Kurzendörfer et.al. 2404.06309 link
2024-04-09 Counterfactual Reasoning for Multi-Label Image Classification via Patching-Based Training Ming-Kun Xie et.al. 2404.06287 null
2024-04-09 Quantum Circuit $C^*$ -algebra Net Yuka Hashimoto et.al. 2404.06218 null
2024-04-09 VI-OOD: A Unified Representation Learning Framework for Textual Out-of-distribution Detection Li-Ming Zhan et.al. 2404.06217 link
2024-04-09 Symmetry-guided gradient descent for quantum neural networks Kaiming Bian et.al. 2404.06108 null
2024-04-10 Using Few-Shot Learning to Classify Primary Lung Cancer and Other Malignancy with Lung Metastasis in Cytological Imaging via Endobronchial Ultrasound Procedures Ching-Kai Lin et.al. 2404.06080 null
2024-04-08 Neural Cellular Automata for Lightweight, Robust and Explainable Classification of White Blood Cell Images Michael Deutges et.al. 2404.05584 null
2024-04-08 On the Convergence of Continual Learning with Adaptive Methods Seungyub Han et.al. 2404.05555 null
2024-04-08 Multi-Task Learning for Features Extraction in Financial Annual Reports Syrielle Montariol et.al. 2404.05281 link
2024-04-08 Allowing humans to interactively guide machines where to look does not always improve a human-AI team's classification accuracy Giang Nguyen et.al. 2404.05238 null
2024-04-08 iVPT: Improving Task-relevant Information Sharing in Visual Prompt Tuning by Cross-layer Dynamic Connection Nan Zhou et.al. 2404.05207 null
2024-04-08 Semantic Stealth: Adversarial Text Attacks on NLP Using Several Methods Roopkatha Dey et.al. 2404.05159 null
2024-04-07 PairAug: What Can Augmented Image-Text Pairs Do for Radiology? Yutong Xie et.al. 2404.04960 link
2024-04-07 GvT: A Graph-based Vision Transformer with Talking-Heads Utilizing Sparsity, Trained from Scratch on Small Datasets Dongjing Shan et.al. 2404.04924 null
2024-04-06 Focused Active Learning for Histopathological Image Classification Arne Schmidt et.al. 2404.04663 null
2024-04-06 Trustless Audits without Revealing Data or Models Suppakit Waiwitlikhit et.al. 2404.04500 null
2024-04-05 Evaluating Adversarial Robustness: A Comparison Of FGSM, Carlini-Wagner Attacks, And The Role of Distillation as Defense Mechanism Trilokesh Ranjan Sarkar et.al. 2404.04245 null
2024-04-05 Noisy Label Processing for Classification: A Survey Mengting Li et.al. 2404.04159 null
2024-04-05 Learning Correlation Structures for Vision Transformers Manjin Kim et.al. 2404.03924 null
2024-04-05 LiDAR-Guided Cross-Attention Fusion for Hyperspectral Band Selection and Image Classification Judy X Yang et.al. 2404.03883 null
2024-04-04 Dendrites endow artificial neural networks with accurate, robust and parameter-efficient learning Spyridon Chavlis et.al. 2404.03708 null
2024-04-05 A Methodology to Study the Impact of Spiking Neural Network Parameters considering Event-Based Automotive Data Iqra Bano et.al. 2404.03493 null
2024-04-04 Meta Invariance Defense Towards Generalizable Robustness to Unknown Adversarial Attacks Lei Zhang et.al. 2404.03340 null
2024-04-04 Sparse Concept Bottleneck Models: Gumbel Tricks in Contrastive Learning Andrei Semenov et.al. 2404.03323 link
2024-04-04 FACTUAL: A Novel Framework for Contrastive Learning Based Robust SAR Image Classification Xu Wang et.al. 2404.03225 null
2024-04-03 Exploring the Trade-off Between Model Performance and Explanation Plausibility of Text Classifiers Using Human Rationales Lucas E. Resck et.al. 2404.03098 link
2024-04-03 Guarantees of confidentiality via Hammersley-Chapman-Robbins bounds Kamalika Chaudhuri et.al. 2404.02866 link
2024-04-03 FPT: Feature Prompt Tuning for Few-shot Readability Assessment Ziyang Wang et.al. 2404.02772 link
2024-04-03 Adversarial Attacks and Dimensionality in Text Classifiers Nandish Chattopadhyay et.al. 2404.02660 null
2024-04-04 Non-negative Subspace Feature Representation for Few-shot Learning in Medical Imaging Keqiang Fan et.al. 2404.02656 null
2024-04-03 Adaptive Cross-lingual Text Classification through In-Context One-Shot Demonstrations Emilio Villa-Cueva et.al. 2404.02452 link
2024-04-03 A Novel Approach to Breast Cancer Histopathological Image Classification Using Cross-Colour Space Feature Fusion and Quantum-Classical Stack Ensemble Method Sambit Mallick et.al. 2404.02447 null
2024-04-03 Enhancing Low-Resource LLMs Classification with PEFT and Synthetic Data Parth Patwa et.al. 2404.02422 null
2024-04-02 Smooth Deep Saliency Rudolf Herdt et.al. 2404.02282 null
2024-04-02 Visual Concept Connectome (VCC): Open World Concept Discovery and their Interlayer Connections in Deep Models Matthew Kowal et.al. 2404.02233 null
2024-04-02 ImageNot: A contrast with ImageNet preserves model rankings Olawale Salaudeen et.al. 2404.02112 null
2024-04-02 Explainability in JupyterLab and Beyond: Interactive XAI Systems for Integrated and Collaborative Workflows Grace Guo et.al. 2404.02081 null
2024-04-02 Ukrainian Texts Classification: Exploration of Cross-lingual Knowledge Transfer Approaches Daryna Dementieva et.al. 2404.02043 null
2024-04-02 CAM-Based Methods Can See through Walls Magamed Taimeskhanov et.al. 2404.01964 link
2024-04-02 Beyond Image Super-Resolution for Image Recognition with Task-Driven Perceptual Loss Jaeha Kim et.al. 2404.01692 null
2024-04-02 A Universal Knowledge Embedded Contrastive Learning Framework for Hyperspectral Image Classification Quanwei Liu et.al. 2404.01673 null
2024-04-01 Can Biases in ImageNet Models Explain Generalization? Paul Gavrikov et.al. 2404.01509 link
2024-04-01 Parallel Proportional Fusion of Spiking Quantum Neural Network for Optimizing Image Classification Zuyu Xu et.al. 2404.01359 null
2024-04-01 Bridging Remote Sensors with Multisensor Geospatial Foundation Models Boran Han et.al. 2404.01260 link
2024-04-01 Diagnosis of Skin Cancer Using VGG16 and VGG19 Based Transfer Learning Models Amir Faghihi et.al. 2404.01160 null
2024-03-29 Learn "No" to Say "Yes" Better: Improving Vision-Language Models via Negations Jaisidh Singh et.al. 2403.20312 link
2024-03-29 MCNet: A crowd denstity estimation network based on integrating multiscale attention module Qiang Guo et.al. 2403.20173 null
2024-03-29 Segmentation, Classification and Interpretation of Breast Cancer Medical Images using Human-in-the-Loop Machine Learning David Vázquez-Lema et.al. 2403.20112 null
2024-03-29 Adverb Is the Key: Simple Text Data Augmentation with Adverb Deletion Juhwan Choi et.al. 2403.20015 null
2024-03-29 Diverse Feature Learning by Self-distillation and Reset Sejik Park et.al. 2403.19941 null
2024-03-29 Heterogeneous Network Based Contrastive Learning Method for PolSAR Land Cover Classification Jianfeng Cai et.al. 2403.19902 link
2024-03-28 X-MIC: Cross-Modal Instance Conditioning for Egocentric Action Generalization Anna Kukleva et.al. 2403.19811 link
2024-03-28 RSMamba: Remote Sensing Image Classification with State Space Model Keyan Chen et.al. 2403.19654 link
2024-03-28 Enhance Image Classification via Inter-Class Image Mixup with Diffusion Model Zhicai Wang et.al. 2403.19600 link
2024-03-28 The Bad Batches: Enhancing Self-Supervised Learning in Image Classification Through Representative Batch Curation Ozgu Goksu et.al. 2403.19579 null
2024-03-28 Low-Rank Rescaled Vision Transformer Fine-Tuning: A Residual Design Approach Wei Dong et.al. 2403.19067 link
2024-03-27 Evaluating Large Language Models for Health-Related Text Classification Tasks with Public Social Media Data Yuting Guo et.al. 2403.19031 null
2024-03-27 Robustness and Visual Explanation for Black Box Image, Video, and ECG Signal Classification with Reinforcement Learning Soumyendu Sarkar et.al. 2403.18985 null
2024-03-27 The Impact of Uniform Inputs on Activation Sparsity and Energy-Latency Attacks in Computer Vision Andreas Müller et.al. 2403.18587 link
2024-03-27 Uncertainty-Aware SAR ATR: Defending Against Adversarial Attacks via Bayesian Neural Networks Tian Ye et.al. 2403.18318 null
2024-03-27 Multi-scale Unified Network for Image Classification Wenzhuo Liu et.al. 2403.18294 null
2024-03-26 The Need for Speed: Pruning Transformers with One Recipe Samir Khaki et.al. 2403.17921 link
2024-03-26 Compressed Multi-task embeddings for Data-Efficient Downstream training and inference in Earth Observation Carlos Gomes et.al. 2403.17886 null
2024-03-26 PlainMamba: Improving Non-Hierarchical Mamba in Visual Recognition Chenhongyi Yang et.al. 2403.17695 link
2024-03-26 Language Models for Text Classification: Is In-Context Learning Enough? Aleksandra Edwards et.al. 2403.17661 null
2024-03-26 Boosting Few-Shot Learning with Disentangled Self-Supervised Learning and Meta-Learning for Medical Image Classification Eva Pachetti et.al. 2403.17530 null
2024-03-26 HILL: Hierarchy-aware Information Lossless Contrastive Learning for Hierarchical Text Classification He Zhu et.al. 2403.17307 link
2024-03-25 Histogram Layers for Neural Engineered Features Joshua Peeples et.al. 2403.17176 link
2024-03-25 Task2Box: Box Embeddings for Modeling Asymmetric Task Relationships Rangel Daroya et.al. 2403.17173 link
2024-03-25 CipherFormer: Efficient Transformer Private Inference with Low Round Complexity Weize Wang et.al. 2403.16860 null
2024-03-25 Assessing the Performance of Deep Learning for Automated Gleason Grading in Prostate Cancer Dominik Müller et.al. 2403.16695 null
2024-03-25 DeepGleason: a System for Automated Gleason Grading of Prostate Cancer using Deep Neural Networks Dominik Müller et.al. 2403.16678 link
2024-03-25 LARA: Linguistic-Adaptive Retrieval-Augmented LLMs for Multi-Turn Intent Classification Liu Junhua et.al. 2403.16504 null
2024-03-24 On machine learning analysis of atomic force microscopy images for image classification, sample surface recognition Igor Sokolov et.al. 2403.16230 null
2024-03-24 Leveraging Deep Learning and Xception Architecture for High-Accuracy MRI Classification in Alzheimer Diagnosis Shaojie Li et.al. 2403.16212 null
2024-03-24 Multi-Task Learning with Multi-Task Optimization Lu Bai et.al. 2403.16162 null
2024-03-24 CBGT-Net: A Neuromimetic Architecture for Robust Classification of Streaming Data Shreya Sharma et.al. 2403.15974 link
2024-03-23 A Deep Learning Architectures for Kidney Disease Classification Muhammad Shoaib Farooq et.al. 2403.15895 null
2024-03-23 VLUE: A New Benchmark and Multi-task Knowledge Transfer Learning for Vietnamese Natural Language Understanding Phong Nguyen-Thuan Do et.al. 2403.15882 null
2024-03-23 VLM-CPL: Consensus Pseudo Labels from Vision-Language Models for Human Annotation-Free Pathological Image Classification Lanfeng Zhong et.al. 2403.15836 null
2024-03-22 Your Image is My Video: Reshaping the Receptive Field via Image-To-Video Differentiable AutoAugmentation and Fusion Sofia Casarin et.al. 2403.15194 null
2024-03-22 Image Classification with Rotation-Invariant Variational Quantum Circuits Paul San Sebastian et.al. 2403.15031 null
2024-03-22 Extracting Human Attention through Crowdsourced Patch Labeling Minsuk Chang et.al. 2403.15013 null
2024-03-22 Clean-image Backdoor Attacks Dazhong Rong et.al. 2403.15010 null
2024-03-22 ParFormer: Vision Transformer Baseline with Parallel Local Global Token Mixer and Convolution Attention Patch Embedding Novendra Setyawan et.al. 2403.15004 null
2024-03-22 MasonTigers at SemEval-2024 Task 8: Performance Analysis of Transformer-based Models on Machine-Generated Text Detection Sadiya Sayara Chowdhury Puspo et.al. 2403.14989 null
2024-03-21 Learning with SASQuaTCh: a Novel Variational Quantum Transformer Architecture with Kernel-Based Self-Attention Ethan N. Evans et.al. 2403.14753 null
2024-03-21 Estimating Physical Information Consistency of Channel Data Augmentation for Remote Sensing Images Tom Burgert et.al. 2403.14547 null
2024-03-21 Multi-Level Explanations for Generative Language Models Lucas Monteiro Paes et.al. 2403.14459 null
2024-03-21 Tensor network compressibility of convolutional models Sukhbinder Singh et.al. 2403.14379 null
2024-03-21 LayoutLLM: Large Language Model Instruction Tuning for Visually Rich Document Understanding Masato Fujitake et.al. 2403.14252 null
2024-03-21 Safeguarding Medical Image Segmentation Datasets against Unauthorized Training via Contour- and Texture-Aware Perturbations Xun Lin et.al. 2403.14250 null
2024-03-21 Improving Image Classification Accuracy through Complementary Intra-Class and Inter-Class Mixup Ye Xu et.al. 2403.14137 link
2024-03-20 Bridge the Modality and Capacity Gaps in Vision-Language Model Selection Chao Yi et.al. 2403.13797 null
2024-03-20 Leveraging feature communication in federated learning for remote sensing image classification Anh-Kiet Duong et.al. 2403.13575 null
2024-03-20 MTP: Advancing Remote Sensing Foundation Model via Multi-Task Pretraining Di Wang et.al. 2403.13430 link
2024-03-20 Building Optimal Neural Architectures using Interpretable Knowledge Keith G. Mills et.al. 2403.13293 link
2024-03-19 LUWA Dataset: Learning Lithic Use-Wear Analysis on Microscopic Images Jing Zhang et.al. 2403.13171 null
2024-03-19 Improved EATFormer: A Vision Transformer for Medical Image Classification Yulong Shisu et.al. 2403.13167 null
2024-03-19 SIFT-DBT: Self-supervised Initialization and Fine-Tuning for Imbalanced Digital Breast Tomosynthesis Image Classification Yuexi Du et.al. 2403.13148 link
2024-03-19 Using evolutionary computation to optimize task performance of unclocked, recurrent Boolean circuits in FPGAs Raphael Norman-Tenazas et.al. 2403.13105 null
2024-03-19 Investigating Text Shortening Strategy in BERT: Truncation vs Summarization Mirza Alim Mutasodirin et.al. 2403.12799 link
2024-03-18 Posterior Uncertainty Quantification in Neural Networks using Data Augmentation Luhuan Wu et.al. 2403.12729 null
2024-03-19 SEVEN: Pruning Transformer Model by Reserving Sentinels Jinying Xiao et.al. 2403.12688 link
2024-03-19 Simple Hack for Transformers against Heavy Long-Text Classification on a Time- and Memory-Limited GPU Service Mirza Alim Mutasodirin et.al. 2403.12563 null
2024-03-19 Prompt-Guided Adaptive Model Transformation for Whole Slide Image Classification Yi Lin et.al. 2403.12537 null
2024-03-19 CrossTune: Black-Box Few-Shot Classification with Label Enhancement Danqing Luo et.al. 2403.12468 null
2024-03-18 Generalizing deep learning models for medical image classification Matta Sarah et.al. 2403.12167 null
2024-03-19 Leveraging Spatial and Semantic Feature Extraction for Skin Cancer Diagnosis with Capsule Networks and Graph Neural Networks K. P. Santoso et.al. 2403.12009 null
2024-03-18 High-energy physics image classification: A Survey of Jet Applications Hamza Kheddar et.al. 2403.11934 null
2024-03-18 Better (pseudo-)labels for semi-supervised instance segmentation François Porcher et.al. 2403.11675 null
2024-03-18 Continual Forgetting for Pre-trained Vision Models Hongbo Zhao et.al. 2403.11530 link
2024-03-18 Uncertainty-Calibrated Test-Time Model Adaptation without Forgetting Mingkui Tan et.al. 2403.11491 null
2024-03-17 Potential of Domain Adaptation in Machine Learning in Ecology and Hydrology to Improve Model Extrapolability Haiyang Shi et.al. 2403.11331 null
2024-03-17 A Modified Word Saliency-Based Adversarial Attack on Text Classification Models Hetvi Waghela et.al. 2403.11297 null
2024-03-17 Forging the Forger: An Attempt to Improve Authorship Verification via Data Augmentation Silvia Corbara et.al. 2403.11265 null
2024-03-17 Multiple Teachers-Meticulous Student: A Domain Adaptive Meta-Knowledge Distillation Model for Medical Image Classification Shahabedin Nabavi et.al. 2403.11226 null
2024-03-16 Forward Learning of Graph Neural Networks Namyong Park et.al. 2403.11004 null
2024-03-16 Understanding Robustness of Visual State Space Models for Image Classification Chengbin Du et.al. 2403.10935 null
2024-03-16 Automatic location detection based on deep learning Anjali Karangiya et.al. 2403.10912 null
2024-03-14 Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models Akhil Kedia et.al. 2403.09635 link
2024-03-14 XCoOp: Explainable Prompt Learning for Computer-Aided Diagnosis via Concept-guided Context Optimization Yequan Bie et.al. 2403.09410 null
2024-03-14 ConDiSR: Contrastive Disentanglement and Style Regularization for Single Domain Generalization Aleksandr Matsun et.al. 2403.09400 null
2024-03-14 A Hierarchical Fused Quantum Fuzzy Neural Network for Image Classification Sheng-Yao Wu et.al. 2403.09318 null
2024-03-14 CLIP-EBC: CLIP Can Count Accurately through Enhanced Blockwise Classification Yiming Ma et.al. 2403.09281 null
2024-03-14 Are Vision Language Models Texture or Shape Biased and Can We Steer Them? Paul Gavrikov et.al. 2403.09193 null
2024-03-14 Randomized Principal Component Analysis for Hyperspectral Image Classification Mustafa Ustuner et.al. 2403.09117 null
2024-03-14 CardioCaps: Attention-based Capsule Network for Class-Imbalanced Echocardiogram Classification Hyunkyung Han et.al. 2403.09108 link
2024-03-14 The First to Know: How Token Distributions Reveal Hidden Knowledge in Large Vision-Language Models? Qinyu Zhao et.al. 2403.09037 link
2024-03-13 PathM3: A Multimodal Multi-Task Multiple Instance Learning Framework for Whole Slide Image Classification and Captioning Qifeng Zhou et.al. 2403.08967 null
2024-03-13 DAM: Dynamic Adapter Merging for Continual Video QA Learning Feng Cheng et.al. 2403.08755 link
2024-03-13 Leveraging Compressed Frame Sizes For Ultra-Fast Video Classification Yuxing Han et.al. 2403.08580 null
2024-03-13 HOLMES: HOLonym-MEronym based Semantic inspection for Convolutional Image Classifiers Francesco Dibitonto et.al. 2403.08536 link
2024-03-13 Pig aggression classification using CNN, Transformers and Recurrent Networks Junior Silva Souza et.al. 2403.08528 null
2024-03-13 Reduced Jeffries-Matusita distance: A Novel Loss Function to Improve Generalization Performance of Deep Classification Models Mohammad Lashkari et.al. 2403.08408 null
2024-03-13 Iterative Online Image Synthesis via Diffusion Model for Imbalanced Classification Shuhan Li et.al. 2403.08407 null
2024-03-13 Advancing Security in AI Systems: A Novel Approach to Detecting Backdoors in Deep Neural Networks Khondoker Murad Hossain et.al. 2403.08208 null
2024-03-13 Multiscale Low-Frequency Memory Network for Improved Feature Extraction in Convolutional Neural Networks Fuzhi Wu et.al. 2403.08157 link
2024-03-12 Harnessing Artificial Intelligence to Combat Online Hate: Exploring the Challenges and Opportunities of Large Language Models in Hate Speech Detection Tharindu Kumarage et.al. 2403.08035 null
2024-03-13 Visual Decoding and Reconstruction via EEG Embeddings with Guided Diffusion Dongyang Li et.al. 2403.07721 link
2024-03-12 FPT: Fine-grained Prompt Tuning for Parameter and Memory Efficient Fine Tuning in High-resolution Medical Image Classification Yijin Huang et.al. 2403.07576 null
2024-03-12 Backdoor Attack with Mode Mixture Latent Modification Hongwei Zhang et.al. 2403.07463 null
2024-03-12 In-context learning enables multimodal large language models to classify cancer pathology images Dyke Ferber et.al. 2403.07407 null
2024-03-12 Premonition: Using Generative Models to Preempt Future Data Changes in Continual Learning Mark D. McDonnell et.al. 2403.07356 null
2024-03-12 How does promoting the minority fraction affect generalization? A theoretical study of the one-hidden-layer neural network on group imbalance Hongkang Li et.al. 2403.07310 null
2024-03-12 A Bayesian Approach to OOD Robustness in Image Classification Prakhar Kaushik et.al. 2403.07277 null
2024-03-11 LeOCLR: Leveraging Original Images for Contrastive Learning of Visual Representations Mohammad Alkhalefi et.al. 2403.06813 null
2024-03-11 Dynamic Perturbation-Adaptive Adversarial Training on Medical Image Classification Shuai Li et.al. 2403.06798 null
2024-03-11 Leveraging Internal Representations of Model for Magnetic Image Classification Adarsh N L et.al. 2403.06797 null
2024-03-11 Shortcut Learning in Medical Image Segmentation Manxi Lin et.al. 2403.06748 null
2024-03-11 Active Generation for Image Classification Tao Huang et.al. 2403.06517 null
2024-03-11 Evolving Knowledge Distillation with Large Language Models and Active Learning Chengyuan Liu et.al. 2403.06414 null
2024-03-11 'One size doesn't fit all': Learning how many Examples to use for In-Context Learning for Improved Text Classification Manish Chandra et.al. 2403.06402 null
2024-03-10 Probing Image Compression For Class-Incremental Learning Justin Yang et.al. 2403.06288 null
2024-03-10 Bayesian Random Semantic Data Augmentation for Medical Image Classification Yaoyao Zhu et.al. 2403.06138 link
2024-03-10 Universal Debiased Editing for Fair Medical Image Classification Ruinan Jin et.al. 2403.06104 null
2024-03-08 Tune without Validation: Searching for Learning Rate and Weight Decay on Training Sets Lorenzo Brigato et.al. 2403.05532 null
2024-03-08 Generalized Correspondence Matching via Flexible Hierarchical Refinement and Patch Descriptor Distillation Yu Han et.al. 2403.05388 null
2024-03-08 The Impact of Quantization on the Robustness of Transformer-based Text Classifiers Seyed Parsa Neshaei et.al. 2403.05365 null
2024-03-08 Multiple Instance Learning with random sampling for Whole Slide Image Classification H. Keshvarikhojasteh et.al. 2403.05351 null
2024-03-08 Learning Expressive And Generalizable Motion Features For Face Forgery Detection Jingyi Zhang et.al. 2403.05172 null
2024-03-08 Defending Against Unforeseen Failure Modes with Latent Adversarial Training Stephen Casper et.al. 2403.05030 link
2024-03-07 Fooling Neural Networks for Motion Forecasting via Adversarial Attacks Edgar Medina et.al. 2403.04954 null
2024-03-07 T-TAME: Trainable Attention Mechanism for Explaining Convolutional Networks and Vision Transformers Mariano V. Ntrougkas et.al. 2403.04523 null
2024-03-07 Source Matters: Source Dataset Impact on Model Robustness in Medical Imaging Dovile Juodelyte et.al. 2403.04484 link
2024-03-07 Advancing Biomedical Text Mining with Community Challenges Hui Zong et.al. 2403.04261 null
2024-03-07 Scalable On-Chip Optical Linear Processing Unit Using a Single Thin-Film Lithium Niobate Ring Modulator Zhaoang Deng et.al. 2403.04216 null
2024-03-07 Scalable and Robust Transformer Decoders for Interpretable Image Classification with Foundation Models Evelyn Mannix et.al. 2403.04125 null
2024-03-07 Privacy-preserving Fine-tuning of Large Language Models through Flatness Tiejin Chen et.al. 2403.04124 null
2024-03-06 MedMamba: Vision Mamba for Medical Image Classification Yubiao Yue et.al. 2403.03849 link
2024-03-06 On the Effectiveness of Distillation in Mitigating Backdoors in Pre-trained Encoder Tingxu Han et.al. 2403.03846 link
2024-03-06 RADIA -- Radio Advertisement Detection with Intelligent Analytics Jorge Álvarez et.al. 2403.03538 null
2024-03-06 Inverse-Free Fast Natural Gradient Descent Method for Deep Learning Xinwei Ou et.al. 2403.03473 null
2024-03-06 Sparse Spiking Neural Network: Exploiting Heterogeneity in Timescales for Pruning Recurrent SNN Biswadeep Chakraborty et.al. 2403.03409 null
2024-03-05 RulePrompt: Weakly Supervised Text Classification with Prompting PLMs and Self-Iterative Logical Rules Miaomiao Li et.al. 2403.02932 link
2024-03-05 Demonstrating Mutual Reinforcement Effect through Information Flow Chengguang Gan et.al. 2403.02902 null
2024-03-05 Quantum Mixed-State Self-Attention Network Fu Chen et.al. 2403.02871 null
2024-03-05 SOFIM: Stochastic Optimization Using Regularized Fisher Information Matrix Gayathri C et.al. 2403.02833 null
2024-03-05 SGD with Partial Hessian for Deep Neural Networks Optimization Ying Sun et.al. 2403.02681 link
2024-03-05 G-EvoNAS: Evolutionary Neural Architecture Search Based on Network Growth Juan Zou et.al. 2403.02667 null
2024-03-05 Remove that Square Root: A New Efficient Scale-Invariant Version of AdaGrad Sayantan Choudhury et.al. 2403.02648 link
2024-03-05 Modeling Collaborator: Enabling Subjective Vision Classification With Minimal Human Effort via LLM Tool-Use Imad Eddine Toubal et.al. 2403.02626 null
2024-03-04 When do Convolutional Neural Networks Stop Learning? Sahan Ahmad et.al. 2403.02473 link
2024-03-04 NiNformer: A Network in Network Transformer with Token Mixing Generated Gating Function Abdullah Nazhat Abdullah et.al. 2403.02411 link
2024-03-02 Can a Confident Prior Replace a Cold Posterior? Martin Marek et.al. 2403.01272 link
2024-03-02 Leveraging Self-Supervised Learning for Scene Recognition in Child Sexual Abuse Imagery Pedro H. V. Valois et.al. 2403.01183 null
2024-03-02 Auxiliary Tasks Enhanced Dual-affinity Learning for Weakly Supervised Semantic Segmentation Lian Xu et.al. 2403.01156 null
2024-03-02 ELA: Efficient Local Attention for Deep Convolutional Neural Networks Wei Xu et.al. 2403.01123 null
2024-03-01 Margin Discrepancy-based Adversarial Training for Multi-Domain Text Classification Yuan Wu et.al. 2403.00888 null
2024-03-01 Text classification of column headers with a controlled vocabulary: leveraging LLMs for metadata enrichment Margherita Martorana et.al. 2403.00884 null
2024-03-01 SURE: SUrvey REcipes for building reliable and robust deep networks Yuting Li et.al. 2403.00543 link
2024-03-01 Invariant Test-Time Adaptation for Vision-Language Model Generalization Huan Ma et.al. 2403.00376 null
2024-02-29 TELEClass: Taxonomy Enrichment and LLM-Enhanced Hierarchical Text Classification with Minimal Supervision Yunyi Zhang et.al. 2403.00165 null
2024-02-29 Assessing Visually-Continuous Corruption Robustness of Neural Networks Relative to Human Performance Huakun Shen et.al. 2402.19401 null
2024-02-29 Stitching Gaps: Fusing Situated Perceptual Knowledge with Vision Transformers for High-Level Image Classification Delfina Sol Martinez Pandiani et.al. 2402.19339 null
2024-02-29 Generalizable Whole Slide Image Classification with Fine-Grained Visual-Semantic Interaction Hao Li et.al. 2402.19326 null
2024-02-29 Decompose-and-Compose: A Compositional Approach to Mitigating Spurious Correlation Fahimeh Hosseini Noohdani et.al. 2402.18919 null
2024-02-29 Utilizing Local Hierarchy with Adversarial Training for Hierarchical Text Classification Zihan Wang et.al. 2402.18825 link
2024-02-28 Comparing Importance Sampling Based Methods for Mitigating the Effect of Class Imbalance Indu Panigrahi et.al. 2402.18742 link
2024-02-28 Deep Neural Network Models Trained With A Fixed Random Classifier Transfer Better Across Domains Hafiz Tiomoko Ali et.al. 2402.18614 null
2024-02-28 Orchid: Flexible and Data-Dependent Convolution for Sequence Modeling Mahdi Karami et.al. 2402.18508 null
2024-02-28 Prompt-Driven Dynamic Object-Centric Learning for Single Domain Generalization Deng Li et.al. 2402.18447 null
2024-02-29 A Modular System for Enhanced Robustness of Multimedia Understanding Networks via Deep Parametric Estimation Francesco Barbato et.al. 2402.18402 null
2024-02-28 A Multimodal Handover Failure Detection Dataset and Baselines Santosh Thoduka et.al. 2402.18319 null
2024-02-28 Classes Are Not Equal: An Empirical Study on Image Recognition Fairness Jiequan Cui et.al. 2402.18133 null
2024-02-27 Understanding Neural Network Binarization with Forward and Backward Proximal Quantizers Yiwei Lu et.al. 2402.17710 null
2024-02-27 SDF2Net: Shallow to Deep Feature Fusion Network for PolSAR Image Classification Mohammed Q. Alkhatib et.al. 2402.17672 link
2024-02-27 Predict the Next Word: Evgenia Ilia et.al. 2402.17527 null
2024-02-27 Scaling Supervised Local Learning with Augmented Auxiliary Networks Chenxiang Ma et.al. 2402.17318 link
2024-02-26 Offline Writer Identification Using Convolutional Neural Network Activation Features Vincent Christlein et.al. 2402.17029 null

(back to top)

Object Detection

Publish Date Title Authors PDF Code
2025-06-17 YOLOv11-RGBT: Towards a Comprehensive Single-Stage Multispectral Object Detection Framework Dahang Wan et.al. 2506.14696 null
2025-06-17 VisText-Mosquito: A Multimodal Dataset and Benchmark for AI-Based Mosquito Breeding Site Detection and Reasoning Md. Adnanul Islam et.al. 2506.14629 null
2025-06-17 GAMORA: A Gesture Articulated Meta Operative Robotic Arm for Hazardous Material Handling in Containment-Level Environments Farha Abdul Wasay et.al. 2506.14513 null
2025-06-17 Comparison of Two Methods for Stationary Incident Detection Based on Background Image Deepak Ghimire et.al. 2506.14256 null
2025-06-16 A Point Cloud Completion Approach for the Grasping of Partially Occluded Objects and Its Applications in Robotic Strawberry Harvesting Ali Abouzeid et.al. 2506.14066 null
2025-06-16 FindMeIfYouCan: Bringing Open Set metrics to $\textit{near} $, $ \textit{far} $ and $\textit{farther}$ Out-of-Distribution Object Detection Daniel Montoya et.al. 2506.14008 null
2025-06-16 How Real is CARLAs Dynamic Vision Sensor? A Study on the Sim-to-Real Gap in Traffic Object Detection Kaiyuan Tan et.al. 2506.13722 null
2025-06-17 Lecture Video Visual Objects (LVVO) Dataset: A Benchmark for Visual Object Detection in Educational Videos Dipayan Biswas et.al. 2506.13657 link
2025-06-16 UAV Object Detection and Positioning in a Mining Industrial Metaverse with Custom Geo-Referenced Data Vasiliki Balaska et.al. 2506.13505 null
2025-06-16 Sparse Convolutional Recurrent Learning for Efficient Event-based Neuromorphic Object Detection Shenqi Wang et.al. 2506.13440 null
2025-06-16 Cognitive Synergy Architecture: SEGO for Human-Centric Collaborative Robots Jaehong Oh et.al. 2506.13149 null
2025-06-15 MGDFIS: Multi-scale Global-detail Feature Integration Strategy for Small Object Detection Yuxiang Wang et.al. 2506.12697 null
2025-06-14 UniDet-D: A Unified Dynamic Spectral Attention Model for Object Detection under Adverse Weathers Yuantao Wang et.al. 2506.12324 null
2025-06-14 MatchPlant: An Open-Source Pipeline for UAV-Based Single-Plant Detection and Data Extraction Worasit Sangjan et.al. 2506.12295 link
2025-06-13 Vision-based Lifting of 2D Object Detections for Automated Driving Hendrik Königshof et.al. 2506.11839 null
2025-06-13 Teleoperated Driving: a New Challenge for 3D Object Detection in Compressed Point Clouds Filippo Bragato et.al. 2506.11804 null
2025-06-13 GPLQ: A General, Practical, and Lightning QAT Method for Vision Transformers Guang Liang et.al. 2506.11784 null
2025-06-13 On the Natural Robustness of Vision-Language Models Against Visual Perception Attacks in Autonomous Driving Pedram MohajerAnsari et.al. 2506.11472 null
2025-06-12 Teaching in adverse scenes: a statistically feedback-driven threshold and mask adjustment teacher-student framework for object detection in UAV images under adverse scenes Hongyu Chen et.al. 2506.11175 null
2025-06-12 Discrete Lorenz Attractors in 3D Sinusoidal Maps Sishu Shankar Muni et.al. 2506.10788 null
2025-06-12 Uncertainty-Masked Bernoulli Diffusion for Camouflaged Object Detection Refinement Yuqi Shen et.al. 2506.10712 null
2025-06-12 Semantic-decoupled Spatial Partition Guided Point-supervised Oriented Object Detection Xinyuan Liu et.al. 2506.10601 link
2025-06-12 Improving Medical Visual Representation Learning with Pathological-level Cross-Modal Alignment and Correlation Exploration Jun Wang et.al. 2506.10573 null
2025-06-12 FSATFusion: Frequency-Spatial Attention Transformer for Infrared and Visible Image Fusion Tianpei Zhang et.al. 2506.10366 link
2025-06-11 DySS: Dynamic Queries and State-Space Learning for Efficient 3D Object Detection from Multi-Camera Videos Rajeev Yasarla et.al. 2506.10242 null
2025-06-11 CEM-FBGTinyDet: Context-Enhanced Foreground Balance with Gradient Tuning for tiny Objects Tao Liu et.al. 2506.09897 null
2025-06-11 3DGeoDet: General-purpose Geometry-aware Image-based 3D Object Detection Yi Zhang et.al. 2506.09541 null
2025-06-11 MSSDF: Modality-Shared Self-supervised Distillation for High-Resolution Multi-modal Remote Sensing Image Learning Tong Wang et.al. 2506.09327 null
2025-06-10 Efficient Edge Deployment of Quantized YOLOv4-Tiny for Aerial Emergency Object Detection on Raspberry Pi 5 Sindhu Boddu et.al. 2506.09300 null
2025-06-10 Lightweight Object Detection Using Quantized YOLOv4-Tiny for Emergency Response in Aerial Imagery Sindhu Boddu et.al. 2506.09299 null
2025-06-10 WD-DETR: Wavelet Denoising-Enhanced Real-Time Object Detection Transformer for Robot Perception with Event Cameras Yangjie Cui et.al. 2506.09098 null
2025-06-11 Cosmos-Drive-Dreams: Scalable Synthetic Driving Data Generation with World Foundation Models Xuanchi Ren et.al. 2506.09042 null
2025-06-10 ADAM: Autonomous Discovery and Annotation Model using LLMs for Context-Aware Annotations Amirreza Rouhi et.al. 2506.08968 null
2025-06-10 Data Augmentation For Small Object using Fast AutoAugment DaeEun Yoon et.al. 2506.08956 null
2025-06-11 Gaussian2Scene: 3D Scene Representation Learning via Self-supervised Learning with 3D Gaussian Splatting Keyi Liu et.al. 2506.08777 null
2025-06-09 CrosswalkNet: An Optimized Deep Learning Framework for Pedestrian Crosswalk Detection in Aerial Images with High-Performance Computing Zubin Bhuyan et.al. 2506.07885 null
2025-06-09 SAM2Auto: Auto Annotation Using FLASH Arash Rocky et.al. 2506.07850 null
2025-06-09 Design and Evaluation of Deep Learning-Based Dual-Spectrum Image Fusion Methods Beining Xu et.al. 2506.07779 null
2025-06-09 SpikeSMOKE: Spiking Neural Networks for Monocular 3D Object Detection with Cross-Scale Gated Coding Xuemei Chen et.al. 2506.07737 null
2025-06-09 Domain Randomization for Object Detection in Manufacturing Applications using Synthetic Data: A Comprehensive Study Xiaomeng Zhu et.al. 2506.07539 null
2025-06-09 SpatialLM: Training Large Language Models for Structured Indoor Modeling Yongsen Mao et.al. 2506.07491 null
2025-06-09 Happiness Finder: Exploring the Role of AI in Enhancing Well-Being During Four-Leaf Clover Searches Anna Yokokubo et.al. 2506.07393 null
2025-06-09 Multiple Object Stitching for Unsupervised Representation Learning Chengchao Shen et.al. 2506.07364 link
2025-06-09 CBAM-STN-TPS-YOLO: Enhancing Agricultural Object Detection through Spatially Adaptive Attention Mechanisms Satvik Praveen et.al. 2506.07357 null
2025-06-08 UCOD-DPL: Unsupervised Camouflaged Object Detection via Dynamic Pseudo-label Learning Weiqi Yan et.al. 2506.07087 null
2025-06-06 Domain-RAG: Retrieval-Guided Compositional Image Generation for Cross-Domain Few-Shot Object Detection Yu Li et.al. 2506.05872 null
2025-06-06 Token Transforming: A Unified and Training-Free Token Compression Framework for Vision Transformer Acceleration Fanhu Zeng et.al. 2506.05709 null
2025-06-06 Integer Binary-Range Alignment Neuron for Spiking Neural Networks Binghao Ye et.al. 2506.05679 null
2025-06-05 CL-ISR: A Contrastive Learning and Implicit Stance Reasoning Framework for Misleading Text Detection on Social Media Tianyi Huang et.al. 2506.05107 null
2025-06-05 Synthetic Dataset Generation for Autonomous Mobile Robots Using 3D Gaussian Splatting for Vision Training Aneesh Deogan et.al. 2506.05092 null
2025-06-06 Bridging Annotation Gaps: Transferring Labels to Align Object Detection Datasets Mikhail Kennerley et.al. 2506.04737 null
2025-06-05 Gen-n-Val: Agentic Image Data Generation and Validation Jing-En Huang et.al. 2506.04676 null
2025-06-05 VoxDet: Rethinking 3D Semantic Occupancy Prediction as Dense Object Detection Wuyang Li et.al. 2506.04623 null
2025-06-04 FALO: Fast and Accurate LiDAR 3D Object Detection on Resource-Constrained Devices Shizhong Han et.al. 2506.04499 null
2025-06-04 Neural Object Detection for 4D STEM: High-Throughput Sub-Pixel Electron Diffraction Pattern Recognition Arda Genc et.al. 2506.04477 null
2025-06-04 Diffusion Domain Teacher: Diffusion Guided Domain Adaptive Object Detector Boyong He et.al. 2506.04211 link
2025-06-04 FSHNet: Fully Sparse Hybrid Network for 3D Object Detection Shuai Liu et.al. 2506.03714 null
2025-06-04 How PARTs assemble into wholes: Learning the relative composition of images Melika Ayoughi et.al. 2506.03682 null
2025-06-05 MambaNeXt-YOLO: A Hybrid State Space Model for Real-time Object Detection Xiaochun Lei et.al. 2506.03654 null
2025-06-04 DiagNet: Detecting Objects using Diagonal Constraints on Adjacency Matrix of Graph Neural Network Chong Hyun Lee et.al. 2506.03571 null
2025-06-03 SportMamba: Adaptive Non-Linear Multi-Object Tracking with State Space Models for Team Sports Dheeraj Khanna et.al. 2506.03335 null
2025-06-03 Simulate Any Radar: Attribute-Controllable Radar Simulation via Waveform Parameter Embedding Weiqing Xiao et.al. 2506.03134 null
2025-06-03 HACo-Det: A Study Towards Fine-Grained Machine-Generated Text Detection under Human-AI Coauthoring Zhixiong Su et.al. 2506.02959 null
2025-06-03 Towards Auto-Annotation from Annotation Guidelines: A Benchmark through 3D LiDAR Detection Yechi Ma et.al. 2506.02914 null
2025-06-03 A Dynamic Transformer Network for Vehicle Detection Chunwei Tian et.al. 2506.02765 null
2025-06-03 Open-PMC-18M: A High-Fidelity Large Scale Medical Dataset for Multimodal Representation Learning Negin Baghbanzadeh et.al. 2506.02738 null
2025-06-03 GeneA-SLAM2: Dynamic SLAM with AutoEncoder-Preprocessed Genetic Keypoints Resampling and Depth Variance-Guided Dynamic Region Removal Shufan Qing et.al. 2506.02736 link
2025-06-03 Sight Guide: A Wearable Assistive Perception and Navigation System for the Vision Assistance Race in the Cybathlon 2024 Patrick Pfreundschuh et.al. 2506.02676 null
2025-06-03 Probabilistic Online Event Downsampling Andreu Girbau-Xalabarder et.al. 2506.02547 null
2025-06-03 Efficient Test-time Adaptive Object Detection via Sensitivity-Guided Pruning Kunyu Wang et.al. 2506.02462 null
2025-06-03 Auto-Labeling Data for Object Detection Brent A. Griffin et.al. 2506.02359 null
2025-05-30 Stress-testing Machine Generated Text Detection: Shifting Language Models Writing Style to Fool Detectors Andrea Pedrotti et.al. 2505.24523 null
2025-05-30 Deformable Attention Mechanisms Applied to Object Detection, case of Remote Sensing Anasse Boutayeb et.al. 2505.24489 null
2025-05-30 Leadership Assessment in Pediatric Intensive Care Unit Team Training Liangyang Ouyang et.al. 2505.24389 null
2025-05-30 D2AF: A Dual-Driven Annotation and Filtering Framework for Visual Grounding Yichi Zhang et.al. 2505.24372 null
2025-05-29 Conformal Object Detection by Sequential Risk Control Léo Andéol et.al. 2505.24038 null
2025-05-29 Rooms from Motion: Un-posed Indoor 3D Object Detection as Localization and Mapping Justin Lazarow et.al. 2505.23756 null
2025-05-29 Boosting Domain Incremental Learning: Selecting the Optimal Parameters is All You Need Qiang Wang et.al. 2505.23744 null
2025-05-29 FMG-Det: Foundation Model Guided Robust Object Detection Darryl Hannan et.al. 2505.23726 null
2025-05-29 CF-DETR: Coarse-to-Fine Transformer for Real-Time Object Detection Woojin Shin et.al. 2505.23317 null
2025-05-30 WTEFNet: Real-Time Low-Light Object Detection for Advanced Driver Assistance Systems Hao Wu et.al. 2505.23201 null
2025-05-29 Language-guided Learning for Object Detection Tackling Multiple Variations in Aerial Images Sungjune Park et.al. 2505.23193 null
2025-05-29 DIP-R1: Deep Inspection and Perception with RL Looking Through and Understanding Complex Scenes Sungjune Park et.al. 2505.23179 null
2025-05-29 The Meeseeks Mesh: Spatially Consistent 3D Adversarial Objects for BEV Detector Aixuan Li et.al. 2505.22499 null
2025-05-28 VME: A Satellite Imagery Dataset and Benchmark for Detecting Vehicles in the Middle East and Beyond Noora Al-Emadi et.al. 2505.22353 link
2025-05-28 Task-Driven Implicit Representations for Automated Design of LiDAR Systems Nikhil Behari et.al. 2505.22344 null
2025-05-29 YH-MINER: Multimodal Intelligent System for Natural Ecological Reef Metric Extraction Mingzhuang Wang et.al. 2505.22250 null
2025-05-28 S2AFormer: Strip Self-Attention for Efficient Vision Transformer Guoan Xu et.al. 2505.22195 null
2025-05-28 Learning A Robust RGB-Thermal Detector for Extreme Modality Imbalance Chao Tian et.al. 2505.22154 null
2025-05-28 Prototype Embedding Optimization for Human-Object Interaction Detection in Livestreaming Menghui Zhang et.al. 2505.22011 null
2025-05-28 Cross-DINO: Cross the Deep MLP and Transformer for Small Object Detection Guiping Cao et.al. 2505.21868 null
2025-05-27 Object Concepts Emerge from Motion Haoqian Liang et.al. 2505.21635 null
2025-05-27 Active-O3: Empowering Multimodal Large Language Models with Active Perception via GRPO Muzhi Zhu et.al. 2505.21457 null
2025-05-27 Visual Product Graph: Bridging Visual Products And Composite Images For End-to-End Style Recommendations Yue Li Du et.al. 2505.21454 null
2025-05-27 YOLO-SPCI: Enhancing Remote Sensing Object Detection via Selective-Perspective-Class Integration Xinyuan Wang et.al. 2505.21370 null
2025-05-27 Assured Autonomy with Neuro-Symbolic Perception R. Spencer Hallyburton et.al. 2505.21322 null
2025-05-27 Robust Video-Based Pothole Detection and Area Estimation for Intelligent Vehicles with Depth Map and Kalman Smoothing Dehao Wang et.al. 2505.21049 null
2025-05-27 Facial Attribute Based Text Guided Face Anonymization Mustafa İzzet Muştu et.al. 2505.21002 null
2025-05-27 YOLO-FireAD: Efficient Fire Detection via Attention-Guided Inverted Residual Learning and Dual-Pooling Feature Preservation Weichao Pan et.al. 2505.20884 null
2025-05-27 Open-Det: An Efficient Learning Framework for Open-Ended Detection Guiping Cao et.al. 2505.20639 null
2025-05-27 Roboflow100-VL: A Multi-Domain Object Detection Benchmark for Vision-Language Models Peter Robicheaux et.al. 2505.20612 null
2025-05-26 From Data to Modeling: Fully Open-vocabulary Scene Graph Generation Zuyao Chen et.al. 2505.20106 null
2025-05-26 Target Tracking via LiDAR-RADAR Sensor Fusion for Autonomous Racing Marcello Cellina et.al. 2505.20043 null
2025-05-26 Underwater Diffusion Attention Network with Contrastive Language-Image Joint Learning for Underwater Image Enhancement Afrah Shaahid et.al. 2505.19895 null
2025-05-26 ADD-SLAM: Adaptive Dynamic Dense SLAM with Gaussian Splatting Wenhua Wu et.al. 2505.19420 null
2025-05-26 Neural nanophotonic object detector with ultra-wide field-of-view Ji Chen et.al. 2505.19379 null
2025-05-25 What do Blind and Low-Vision People Really Want from Assistive Smart Devices? Comparison of the Literature with a Focus Study Bhanuka Gamage et.al. 2505.19325 null
2025-05-25 VL-SAM-V2: Open-World Object Detection with General and Specific Query Fusion Zhiwei Lin et.al. 2505.18986 null
2025-05-24 Mitigating Context Bias in Domain Adaptation for Object Detection using Mask Pooling Hojun Son et.al. 2505.18446 null
2025-05-23 Sampling Strategies for Efficient Training of Deep Learning Object Detection Algorithms Gefei Shen et.al. 2505.18302 null
2025-05-23 One RL to See Them All: Visual Triple Unified Reinforcement Learning Yan Ma et.al. 2505.18129 null
2025-05-23 SemSegBench & DetecBench: Benchmarking Reliability and Generalization Beyond Classification Shashank Agnihotri et.al. 2505.18015 null
2025-05-23 RQR3D: Reparametrizing the regression targets for BEV-based 3D object detection Ozsel Kilinc et.al. 2505.17732 null
2025-05-23 Adaptive Semantic Token Communication for Transformer-based Edge Inference Alessio Devoto et.al. 2505.17604 null
2025-05-23 Distance Estimation in Outdoor Driving Environments Using Phase-only Correlation Method with Event Cameras Masataka Kobayashi et.al. 2505.17582 null
2025-05-23 OrionBench: A Benchmark for Chart and Human-Recognizable Object Detection in Infographics Jiangning Zhu et.al. 2505.17473 null
2025-05-23 Reflectance Prediction-based Knowledge Distillation for Robust 3D Object Detection in Compressed Point Clouds Hao Jing et.al. 2505.17442 null
2025-05-23 Optimizing YOLOv8 for Parking Space Detection: Comparative Analysis of Custom YOLOv8 Architecture Apar Pokhrel et.al. 2505.17364 null
2025-05-22 Extending Dataset Pruning to Object Detection: A Variance-based Approach Ryota Yagi et.al. 2505.17245 null
2025-05-22 Semi-Supervised State-Space Model with Dynamic Stacking Filter for Real-World Video Deraining Shangquan Sun et.al. 2505.16811 null
2025-05-22 Robust Vision-Based Runway Detection through Conformal Prediction and Conformal mAP Alya Zouzou et.al. 2505.16740 link
2025-05-22 CodeMerge: Codebook-Guided Model Merging for Robust Test-Time Adaptation in Autonomous Driving Huitong Yang et.al. 2505.16524 null
2025-05-22 MAFE R-CNN: Selecting More Samples to Learn Category-aware Features for Small Object Detection Yichen Li et.al. 2505.16442 null
2025-05-22 AdvReal: Adversarial Patch Generation Framework with Application to Adversarial Safety Evaluation of Object Detection Systems Yuanhao Huang et.al. 2505.16402 link
2025-05-22 Self-Classification Enhancement and Correction for Weakly Supervised Object Detection Yufei Yin et.al. 2505.16294 null
2025-05-21 MIKU-PAL: An Automated and Standardized Multi-Modal Method for Speech Paralinguistic and Affect Labeling Cheng Yifan et.al. 2505.15772 null
2025-05-21 The Devil is in Fine-tuning and Long-tailed Problems:A New Benchmark for Scene Text Detection Tianjiao Cao et.al. 2505.15649 link
2025-05-21 SNAP: A Benchmark for Testing the Effects of Capture Conditions on Fundamental Vision Tasks Iuliia Kotseruba et.al. 2505.15628 link
2025-05-21 Detection of Underwater Multi-Targets Based on Self-Supervised Learning and Deformable Path Aggregation Feature Pyramid Network Chang Liu et.al. 2505.15518 null
2025-05-21 Trends and Challenges in Authorship Analysis: A Review of ML, DL, and LLM Approaches Nudrat Habib et.al. 2505.15422 null
2025-05-21 RAZER: Robust Accelerated Zero-Shot 3D Open-Vocabulary Panoptic Reconstruction with Spatio-Temporal Aggregation Naman Patel et.al. 2505.15373 null
2025-05-21 AGENT-X: Adaptive Guideline-based Expert Network for Threshold-free AI-generated teXt detection Jiatao Li et.al. 2505.15261 null
2025-05-21 Multispectral Detection Transformer with Infrared-Centric Sensor Fusion Seongmin Hwang et.al. 2505.15137 null
2025-05-20 Colors Matter: AI-Driven Exploration of Human Feature Colors Rama Alyoubi et.al. 2505.14931 link
2025-05-20 Language Models Optimized to Fool Detectors Still Have a Distinct Style (And How to Change It) Rafael Rivera Soto et.al. 2505.14608 null
2025-05-20 SCAN: Semantic Document Layout Analysis for Textual and Visual Retrieval-Augmented Generation Yuyang Dong et.al. 2505.14381 null
2025-05-20 FAID: Fine-grained AI-generated Text Detection using Multi-task Auxiliary and Multi-level Contrastive Learning Minh Ngoc Ta et.al. 2505.14271 null
2025-05-20 Decoupling Classifier for Boosting Few-shot Object Detection and Instance Segmentation Bin-Bin Gao et.al. 2505.14239 null
2025-05-20 Intra-class Patch Swap for Self-Distillation Hongjun Choi et.al. 2505.14124 link
2025-05-20 Scaling Vision Mamba Across Resolutions via Fractal Traversal Bo Li et.al. 2505.14062 null
2025-05-20 Automated Quality Evaluation of Cervical Cytopathology Whole Slide Images Based on Content Analysis Lanlan Kang et.al. 2505.13875 null
2025-05-20 Safety2Drive: Safety-Critical Scenario Benchmark for the Evaluation of Autonomous Driving Jingzheng Li et.al. 2505.13872 null
2025-05-20 Domain Gating Ensemble Networks for AI-Generated Text Detection Arihant Tripathi et.al. 2505.13855 null
2025-05-20 A Challenge to Build Neuro-Symbolic Video Agents Sahil Shah et.al. 2505.13851 null
2025-05-19 Dynamic Graph Induced Contour-aware Heat Conduction Network for Event-based Object Detection Xiao Wang et.al. 2505.12908 link
2025-05-19 Rethinking Features-Fused-Pyramid-Neck for Object Detection Hulin Li et.al. 2505.12820 link
2025-05-19 Enhancing Transformers Through Conditioned Embedded Tokens Hemanth Saratchandran et.al. 2505.12789 null
2025-05-19 LiDAR MOT-DETR: A LiDAR-based Two-Stage Transformer for 3D Multiple Object Tracking Martha Teiko Teye et.al. 2505.12753 null
2025-05-19 VLC Fusion: Vision-Language Conditioned Sensor Fusion for Robust Object Detection Aditya Taparia et.al. 2505.12715 null
2025-05-18 LM $^2$ otifs : An Explainable Framework for Machine-Generated Texts Detection Xu Zheng et.al. 2505.12507 null
2025-05-17 EarthSynth: Generating Informative Earth Observation with Diffusion Models Jiancheng Pan et.al. 2505.12108 null
2025-05-17 Experimental Study on Automatically Assembling Custom Catering Packages With a 3-DOF Delta Robot Using Deep Learning Methods Reihaneh Yourdkhani et.al. 2505.11879 null
2025-05-16 Improving Object Detection Performance through YOLOv8: A Comprehensive Training and Evaluation Study Rana Poureskandar et.al. 2505.11424 null
2025-05-16 MTevent: A Multi-Task Event Camera Dataset for 6D Pose Estimation and Moving Object Detection Shrutarv Awasthi et.al. 2505.11282 null
2025-05-16 M4-SAR: A Multi-Resolution, Multi-Polarization, Multi-Scene, Multi-Source Dataset and Benchmark for Optical-SAR Fusion Object Detection Chao Wang et.al. 2505.10931 null
2025-05-16 A High-Performance Thermal Infrared Object Detection Framework with Centralized Regulation Jinke Li et.al. 2505.10825 null
2025-05-15 StoryReasoning Dataset: Using Chain-of-Thought for Scene Understanding and Grounded Story Generation Daniel A. P. Oliveira et.al. 2505.10292 link
2025-05-15 Defect Detection in Photolithographic Patterns Using Deep Learning Models Trained on Synthetic Data Prashant P. Shinde et.al. 2505.10192 null
2025-05-15 Application of YOLOv8 in monocular downward multiple Car Target detection Shijie Lyu et.al. 2505.10016 null
2025-05-14 EdgeAI Drone for Autonomous Construction Site Demonstrator Emre Girgin et.al. 2505.09837 link
2025-05-14 WhatsAI: Transforming Meta Ray-Bans into an Extensible Generative AI Platform for Accessibility Nasif Zaman et.al. 2505.09823 null
2025-05-14 MoRAL: Motion-aware Multi-Frame 4D Radar and LiDAR Fusion for Robust 3D Object Detection Xiangyuan Peng et.al. 2505.09422 null
2025-05-14 A drone that learns to efficiently find objects in agricultural fields: from simulation to the real world Rick van Essen et.al. 2505.09278 null
2025-05-14 DRRNet: Macro-Micro Feature Fusion and Dual Reverse Refinement for Camouflaged Object Detection Jianlin Sun et.al. 2505.09168 link
2025-05-14 Beyond General Prompts: Automated Prompt Refinement using Contrastive Class Alignment Scores for Disambiguating Objects in Vision-Language Models Lucas Choi et.al. 2505.09139 null
2025-05-14 Promoting SAM for Camouflaged Object Detection via Selective Key Point-based Guidance Guoying Liang et.al. 2505.09123 null
2025-05-13 Robustness Analysis against Adversarial Patch Attacks in Fully Unmanned Stores Hyunsik Na et.al. 2505.08835 null
2025-05-13 Augmented Reality for RObots (ARRO): Pointing Visuomotor Policies Towards Visual Robustness Reihaneh Mirjalili et.al. 2505.08627 null
2025-05-14 Thermal Detection of People with Mobility Restrictions for Barrier Reduction at Traffic Lights Controlled Intersections Xiao Ni et.al. 2505.08568 link
2025-05-13 MDF: Multi-Modal Data Fusion with CNN-Based Object Detection for Enhanced Indoor Localization Using LiDAR-SLAM Saqi Hussain Kalan et.al. 2505.08388 null
2025-05-13 HMPNet: A Feature Aggregation Architecture for Maritime Object Detection from a Shipborne Perspective Yu Zhang et.al. 2505.08231 link
2025-05-13 Object detection in adverse weather conditions for autonomous vehicles using Instruct Pix2Pix Unai Gurbindo et.al. 2505.08228 null
2025-05-13 MoKD: Multi-Task Optimization for Knowledge Distillation Zeeshan Hayder et.al. 2505.08170 null
2025-05-12 LAMM-ViT: AI Face Detection via Layer-Aware Modulation of Region-Guided Attention Jiangling Zhang et.al. 2505.07734 null
2025-05-12 Hybrid Spiking Vision Transformer for Object Detection with Event Cameras Qi Xu et.al. 2505.07715 null
2025-05-12 Self-Supervised Event Representations: Towards Accurate, Real-Time Perception on SoC FPGAs Kamil Jeziorek et.al. 2505.07556 null
2025-05-12 Automated Visual Attention Detection using Mobile Eye Tracking in Behavioral Classroom Studies Efe Bozkir et.al. 2505.07552 null
2025-05-12 DepthFusion: Depth-Aware Hybrid Feature Fusion for LiDAR-Camera 3D Object Detection Mingqian Ji et.al. 2505.07398 null
2025-05-12 Language-Driven Dual Style Mixing for Single-Domain Generalized Object Detection Hongda Qin et.al. 2505.07219 link
2025-05-11 Differentiable NMS via Sinkhorn Matching for End-to-End Fabric Defect Detection Zhengyang Lu et.al. 2505.07040 null
2025-05-11 VALISENS: A Validated Innovative Multi-Sensor System for Cooperative Automated Driving Lei Wan et.al. 2505.06980 null
2025-05-10 M3CAD: Towards Generic Cooperative Autonomous Driving Benchmark Morui Zhu et.al. 2505.06746 null
2025-05-10 Underwater object detection in sonar imagery with detection transformer and Zero-shot neural architecture search XiaoTong Gu et.al. 2505.06694 null
2025-05-09 Camera-Only Bird's Eye View Perception: A Neural Approach to LiDAR-Free Environmental Mapping for Autonomous Vehicles Anupkumar Bochare et.al. 2505.06113 null
2025-05-09 Artificial intelligence pioneers the double-strangeness factory Yan He et.al. 2505.05802 null
2025-05-09 Dome-DETR: DETR with Density-Oriented Feature-Query Manipulation for Efficient Tiny Object Detection Zhangchi Hu et.al. 2505.05741 null
2025-05-09 DiGIT: Multi-Dilated Gated Encoder and Central-Adjacent Region Integrated Decoder for Temporal Action Detection Transformer Ho-Joong Kim et.al. 2505.05711 link
2025-05-08 PillarMamba: Learning Local-Global Context for Roadside Point Cloud via Hybrid State Space Model Zhang Zhang et.al. 2505.05397 null
2025-05-08 PaniCar: Securing the Perception of Advanced Driving Assistance Systems Against Emergency Vehicle Lighting Elad Feldman et.al. 2505.05183 null
2025-05-08 Reliably Bounding False Positives: A Zero-Shot Machine-Generated Text Detection Framework via Multiscaled Conformal Prediction Xiaowei Zhu et.al. 2505.05084 null
2025-05-08 FG-CLIP: Fine-Grained Visual and Textual Alignment Chunyu Xie et.al. 2505.05071 null
2025-05-08 A Simple Detector with Frame Dynamics is a Strong Tracker Chenxu Peng et.al. 2505.04917 null
2025-05-08 Mix-QSAM: Mixed-Precision Quantization of the Segment Anything Model Navin Ranjan et.al. 2505.04861 null
2025-05-07 Lightweight RGB-D Salient Object Detection from a Speed-Accuracy Tradeoff Perspective Songsong Duan et.al. 2505.04758 null
2025-05-07 Hyb-KAN ViT: Hybrid Kolmogorov-Arnold Networks Augmented Vision Transformer Sainath Dey et.al. 2505.04740 null
2025-05-08 MonoCoP: Chain-of-Prediction for Monocular 3D Object Detection Zhihao Zhang et.al. 2505.04594 null
2025-05-07 Edge-GPU Based Face Tracking for Face Detection and Recognition Acceleration Asma Baobaid et.al. 2505.04524 null
2025-05-07 Leveraging Simultaneous Usage of Edge GPU Hardware Engines for Video Face Detection and Recognition Asma Baobaid et.al. 2505.04502 null
2025-05-07 DeCLIP: Decoupled Learning for Open-Vocabulary Dense Perception Junjie Wang et.al. 2505.04410 null
2025-05-06 LogisticsVLN: Vision-Language Navigation For Low-Altitude Terminal Delivery Based on Agentic UAVs Xinyuan Zhang et.al. 2505.03460 null
2025-05-06 Robustness in AI-Generated Detection: Enhancing Resistance to Adversarial Attacks Sun Haoxuan et.al. 2505.03435 null
2025-05-06 From Word to Sentence: A Large-Scale Multi-Instance Dataset for Open-Set Aerial Detection Guoting Wei et.al. 2505.03334 null
2025-05-06 VISLIX: An XAI Framework for Validating Vision Models with Slice Discovery and Analysis Xinyuan Yan et.al. 2505.03132 null
2025-05-05 Sim2Real Transfer for Vision-Based Grasp Verification Pau Amargant et.al. 2505.03046 link
2025-05-05 DPNet: Dynamic Pooling Network for Tiny Object Detection Luqi Gong et.al. 2505.02797 null
2025-05-05 RGBX-DiffusionDet: A Framework for Multi-Modal RGB-X Object Detection Using DiffusionDet Eliraz Orfaig et.al. 2505.02586 null
2025-05-05 Point Cloud Recombination: Systematic Real Data Augmentation Using Robotic Targets for LiDAR Perception Validation Hubert Padusinski et.al. 2505.02476 null
2025-05-04 Robust AI-Generated Face Detection with Imbalanced Data Yamini Sri Krubha et.al. 2505.02182 link
2025-05-04 Transforming faces into video stories -- VideoFace2.0 Branko Brkljač et.al. 2505.02060 null
2025-05-03 DriveNetBench: An Affordable and Configurable Single-Camera Benchmarking System for Autonomous Driving Networks Ali Al-Bustami et.al. 2505.01893 link
2025-05-03 OODTE: A Differential Testing Engine for the ONNX Optimizer Nikolaos Louloudakis et.al. 2505.01892 null
2025-05-03 CMAWRNet: Multiple Adverse Weather Removal via a Unified Quaternion Neural Architecture Vladimir Frants et.al. 2505.01882 null
2025-05-03 DualDiff: Dual-branch Diffusion Model for Autonomous Driving with Semantic Fusion Haoteng Li et.al. 2505.01857 null
2025-05-03 Toward Onboard AI-Enabled Solutions to Space Object Detection for Space Sustainability Wenxuan Zhang et.al. 2505.01650 null
2025-05-02 Efficient Vision-based Vehicle Speed Estimation Andrej Macko et.al. 2505.01203 null
2025-05-02 CDFormer: Cross-Domain Few-Shot Object Detection Transformer Against Feature Confusion Boyuan Meng et.al. 2505.00938 null
2025-05-01 Efficient On-Chip Implementation of 4D Radar-Based 3D Object Detection on Hailo-8L Woong-Chan Byun et.al. 2505.00757 null
2025-05-03 Vision Mamba in Remote Sensing: A Comprehensive Survey of Techniques, Applications and Outlook Muyi Bao et.al. 2505.00630 null
2025-05-01 Visual Trajectory Prediction of Vessels for Inland Navigation Alexander Puzicha et.al. 2505.00599 null
2025-05-01 Synthesizing and Identifying Noise Levels in Autonomous Vehicle Camera Radar Datasets Mathis Morales et.al. 2505.00584 null
2025-05-01 X-ray illicit object detection using hybrid CNN-transformer neural network architectures Jorgen Cani et.al. 2505.00564 null
2025-05-01 A Robust Deep Networks based Multi-Object MultiCamera Tracking System for City Scale Traffic Muhammad Imran Zaman et.al. 2505.00534 null
2025-05-01 Inconsistency-based Active Learning for LiDAR Object Detection Esteban Rivera et.al. 2505.00511 null
2025-05-01 HeAL3D: Heuristical-enhanced Active Learning for 3D Object Detection Esteban Rivera et.al. 2505.00507 null
2025-05-01 Quaternion Wavelet-Conditioned Diffusion Models for Image Super-Resolution Luigi Sigillo et.al. 2505.00334 null
2025-04-30 V3LMA: Visual 3D-enhanced Language Model for Autonomous Driving Jannik Lübberstedt et.al. 2505.00156 null
2025-04-30 LLM-Empowered Embodied Agent for Memory-Augmented Task Planning in Household Robotics Marc Glocker et.al. 2504.21716 null
2025-04-30 Visual Text Processing: A Comprehensive Review and Unified Evaluation Yan Shu et.al. 2504.21682 null
2025-04-29 T2ID-CAS: Diffusion Model and Class Aware Sampling to Mitigate Class Imbalance in Neck Ultrasound Anatomical Landmark Detection Manikanta Varaganti et.al. 2504.21231 null
2025-04-29 FLIM-based Salient Object Detection Networks with Adaptive Decoders Gilson Junior Soares et.al. 2504.20872 null
2025-04-29 A Survey on Event-based Optical Marker Systems Nafiseh Jabbari Tofighi et.al. 2504.20736 null
2025-04-29 Purifying, Labeling, and Utilizing: A High-Quality Pipeline for Small Object Detection Siwei Wang et.al. 2504.20602 null
2025-04-29 Style-Adaptive Detection Transformer for Single-Source Domain Generalized Object Detection Jianhong Han et.al. 2504.20498 null
2025-04-28 More Clear, More Flexible, More Precise: A Comprehensive Oriented Object Detection benchmark for UAV Kai Ye et.al. 2504.20032 null
2025-04-28 Lossy Source Coding with Focal Loss Alex Dytso et.al. 2504.19913 null
2025-04-28 Neural network task specialization via domain constraining Roman Malashin et.al. 2504.19592 null
2025-04-28 GMAR: Gradient-Driven Multi-Head Attention Rollout for Vision Transformer Interpretability Sehyeong Jo et.al. 2504.19414 null
2025-04-27 Improving Small Drone Detection Through Multi-Scale Processing and Data Augmentation Rayson Laroca et.al. 2504.19347 null
2025-04-27 ODExAI: A Comprehensive Object Detection Explainable AI Evaluation Loc Phuc Truong Nguyen et.al. 2504.19249 null
2025-04-27 Boosting Single-domain Generalized Object Detection via Vision-Language Knowledge Interaction Xiaoran Xu et.al. 2504.19086 null
2025-04-26 Federated Learning-based Semantic Segmentation for Lane and Object Detection in Autonomous Driving Gharbi Khamis Alshammari et.al. 2504.18939 null
2025-04-25 Dream-Box: Object-wise Outlier Generation for Out-of-Distribution Detection Brian K. S. Isaac-Medina et.al. 2504.18746 null
2025-04-25 A Review of 3D Object Detection with Vision-Language Models Ranjan Sapkota et.al. 2504.18738 null
2025-04-25 Examining the Impact of Optical Aberrations to Image Classification and Object Detection Models Patrick Müller et.al. 2504.18510 null
2025-04-25 Iterative Event-based Motion Segmentation by Variational Contrast Maximization Ryo Yamaki et.al. 2504.18447 null
2025-04-25 A Multimodal Hybrid Late-Cascade Fusion Network for Enhanced 3D Object Detection Carlo Sgaravatti et.al. 2504.18419 null
2025-04-25 A comprehensive review of classifier probability calibration metrics Richard Oliver Lane et.al. 2504.18278 null
2025-04-25 LiDAR-Guided Monocular 3D Object Detection for Long-Range Railway Monitoring Raul David Dominguez Sanchez et.al. 2504.18203 null
2025-04-25 Multi-Grained Compositional Visual Clue Learning for Image Intent Recognition Yin Tang et.al. 2504.18201 null
2025-04-25 E-InMeMo: Enhanced Prompting for Visual In-Context Learning Jiahao Zhang et.al. 2504.18158 null
2025-04-25 MASF-YOLO: An Improved YOLOv11 Network for Small Object Detection on Drone View Liugang Lu et.al. 2504.18136 null
2025-04-25 Opportunistic Collaborative Planning with Large Vision Model Guided Control and Joint Query-Service Optimization Jiayi Chen et.al. 2504.18057 null
2025-04-25 Direct sampling method to retrieve small objects from two-dimensional limited-aperture scattered field data Won-Kwang Park et.al. 2504.18036 null
2025-04-24 DIVE: Inverting Conditional Diffusion Models for Discriminative Tasks Yinqi Li et.al. 2504.17253 link
2025-04-24 Perspective-Aware Reasoning in Vision-Language Models via Mental Imagery Simulation Phillip Y. Lee et.al. 2504.17207 null
2025-04-24 AUTHENTICATION: Identifying Rare Failure Modes in Autonomous Vehicle Perception Systems using Adversarially Guided Diffusion Models Mohammad Zarei et.al. 2504.17179 null
2025-04-23 Scene-Aware Location Modeling for Data Augmentation in Automotive Object Detection Jens Petersen et.al. 2504.17076 null
2025-04-23 Gaussian Splatting is an Effective Data Generator for 3D Object Detection Farhad G. Zanjani et.al. 2504.16740 null
2025-04-23 EHGCN: Hierarchical Euclidean-Hyperbolic Fusion via Motion-Aware GCN for Hybrid Event Stream Perception Haosheng Chen et.al. 2504.16616 null
2025-04-23 Beyond Anonymization: Object Scrubbing for Privacy-Preserving 2D and 3D Vision Tasks Murat Bilgehan Ertan et.al. 2504.16557 null
2025-04-23 Assessing the Feasibility of Internet-Sourced Video for Automatic Cattle Lameness Detection Md Fahimuzzman Sohan et.al. 2504.16404 null
2025-04-23 Revisiting Radar Camera Alignment by Contrastive Learning for 3D Object Detection Linhua Kong et.al. 2504.16368 null
2025-04-22 Vision Controlled Orthotic Hand Exoskeleton Connor Blais et.al. 2504.16319 null
2025-04-22 $π_{0.5}$ : a Vision-Language-Action Model with Open-World Generalization Physical Intelligence et.al. 2504.16054 null
2025-04-22 SAGA: Semantic-Aware Gray color Augmentation for Visible-to-Thermal Domain Adaptation across Multi-View Drone and Ground-Based Vision Systems Manjunath D et.al. 2504.15728 null
2025-04-22 You Sense Only Once Beneath: Ultra-Light Real-Time Underwater Object Detection Jun Dong et.al. 2504.15694 null
2025-04-22 A Vision-Enabled Prosthetic Hand for Children with Upper Limb Disabilities Md Abdul Baset Sarker et.al. 2504.15654 null
2025-04-21 Context Aware Grounded Teacher for Source Free Object Detection Tajamul Ashraf et.al. 2504.15404 null
2025-04-21 SuoiAI: Building a Dataset for Aquatic Invertebrates in Vietnam Tue Vo et.al. 2504.15252 null
2025-04-21 An Efficient Aerial Image Detection with Variable Receptive Fields Liu Wenbin et.al. 2504.15165 null
2025-04-19 Balancing Privacy and Action Performance: A Penalty-Driven Approach to Image Anonymization Nazia Aslam et.al. 2504.14301 null
2025-04-19 Visual Consensus Prompting for Co-Salient Object Detection Jie Wang et.al. 2504.14254 link
2025-04-18 Feature Alignment and Representation Transfer in Knowledge Distillation for Large Language Models Junjie Yang et.al. 2504.13825 null
2025-04-18 Lightweight LiDAR-Camera 3D Dynamic Object Detection and Multi-Class Trajectory Prediction Yushen He et.al. 2504.13647 link
2025-04-18 DenSe-AdViT: A novel Vision Transformer for Dense SAR Object Detection Yang Zhang et.al. 2504.13638 null
2025-04-18 HMPE:HeatMap Embedding for Efficient Transformer-Based Small Object Detection YangChen Zeng et.al. 2504.13469 null
2025-04-18 Towards a Multi-Agent Vision-Language System for Zero-Shot Novel Hazardous Object Detection for Autonomous Driving Safety Shashank Shriram et.al. 2504.13399 link
2025-04-17 VLLFL: A Vision-Language Model Based Lightweight Federated Learning Framework for Smart Agriculture Long Li et.al. 2504.13365 null
2025-04-17 SAR Object Detection with Self-Supervised Pretraining and Curriculum-Aware Sampling Yasin Almalioglu et.al. 2504.13310 null
2025-04-17 Weak Cube R-CNN: Weakly Supervised 3D Detection using only 2D Bounding Boxes Andreas Lau Hansen et.al. 2504.13297 null
2025-04-17 RF-DETR Object Detection vs YOLOv12 : A Study of Transformer-based and CNN-based Architectures for Single-Class and Multi-Class Greenfruit Detection in Complex Orchard Environments Under Label Ambiguity Ranjan Sapkota et.al. 2504.13099 null
2025-04-17 Self-Supervised Pre-training with Combined Datasets for 3D Perception in Autonomous Driving Shumin Wang et.al. 2504.12709 null
2025-04-18 RoPETR: Improving Temporal Camera-Only 3D Detection by Integrating Enhanced Rotary Position Embedding Hang Ji et.al. 2504.12643 null
2025-04-16 Towards a General-Purpose Zero-Shot Synthetic Low-Light Image and Video Pipeline Joanne Lin et.al. 2504.12169 null
2025-04-16 RADLER: Radar Object Detection Leveraging Semantic 3D City Models and Self-Supervised Radar-Image Learning Yuan Luo et.al. 2504.12167 null
2025-04-16 pix2pockets: Shot Suggestions in 8-Ball Pool from a Single Image in the Wild Jonas Myhre Schiøtt et.al. 2504.12045 null
2025-04-16 A Review of YOLOv12: Attention-Based Enhancements vs. Previous Versions Rahima Khanam et.al. 2504.11995 null
2025-04-16 Multimodal Spatio-temporal Graph Learning for Alignment-free RGBT Video Object Detection Qishun Wang et.al. 2504.11779 null
2025-04-15 Multi-level Cellular Automata for FLIM networks Felipe Crispim Salvagnini et.al. 2504.11406 null
2025-04-15 OpenTuringBench: An Open-Model-based Benchmark and Framework for Machine-Generated Text Detection and Attribution Lucio La Cava et.al. 2504.11369 null
2025-04-15 CFIS-YOLO: A Lightweight Multi-Scale Fusion Network for Edge-Deployable Wood Defect Detection Jincheng Kang et.al. 2504.11305 null
2025-04-15 TSAL: Few-shot Text Segmentation Based on Attribute Learning Chenming Li et.al. 2504.11164 null
2025-04-15 Flyweight FLIM Networks for Salient Object Detection in Biomedical Images Leonardo M. Joao et.al. 2504.11112 null
2025-04-15 S $^2$ Teacher: Step-by-step Teacher for Sparsely Annotated Oriented Object Detection Yu Lin et.al. 2504.11111 null
2025-04-15 DRIFT open dataset: A drone-derived intelligence for traffic analysis in urban environmen Hyejin Lee et.al. 2504.11019 null
2025-04-16 GATE3D: Generalized Attention-based Task-synergized Estimation in 3D* Eunsoo Im et.al. 2504.11014 null
2025-04-15 CDUPatch: Color-Driven Universal Adversarial Patch Attack for Dual-Modal Visible-Infrared Detectors Jiahuan Long et.al. 2504.10888 null
2025-04-15 Safe-Construct: Redefining Construction Safety Violation Recognition as 3D Multi-View Engagement Task Aviral Chharia et.al. 2504.10880 null
2025-04-14 DiffMOD: Progressive Diffusion Point Denoising for Moving Object Detection in Remote Sensing Jinyue Zhang et.al. 2504.10278 null
2025-04-14 Balancing Stability and Plasticity in Pretrained Detector: A Dual-Path Framework for Incremental Object Detection Songze Li et.al. 2504.10214 null
2025-04-14 WildLive: Near Real-time Visual Wildlife Tracking onboard UAVs Nguyen Ngoc Dat et.al. 2504.10165 null
2025-04-14 COUNTS: Benchmarking Object Detectors and Multimodal Large Language Models under Distribution Shifts Jiansheng Li et.al. 2504.10158 null
2025-04-14 SemiETS: Integrating Spatial and Content Consistencies for Semi-Supervised End-to-end Text Spotting Dongliang Luo et.al. 2504.09966 null
2025-04-14 Small Object Detection with YOLO: A Performance Analysis Across Model Versions and Hardware Muhammad Fasih Tariq et.al. 2504.09900 null
2025-04-14 Density-based Object Detection in Crowded Scenes Chenyang Zhao et.al. 2504.09819 null
2025-04-13 Uncertainty Guided Refinement for Fine-Grained Salient Object Detection Yao Yuan et.al. 2504.09666 link
2025-04-13 Pillar-Voxel Fusion Network for 3D Object Detection in Airborne Hyperspectral Point Clouds Yanze Jiang et.al. 2504.09506 null
2025-04-13 Vision-Language Model for Object Detection and Segmentation: A Review and Evaluation Yongchao Feng et.al. 2504.09480 null
2025-04-11 TinyCenterSpeed: Efficient Center-Based Object Detection for Autonomous Racing Neil Reichlin et.al. 2504.08655 null
2025-04-11 Training-free Guidance in Text-to-Video Generation via Multimodal Planning and Structured Noise Initialization Jialu Li et.al. 2504.08641 null
2025-04-10 Enhanced Cooperative Perception Through Asynchronous Vehicle to Infrastructure Framework with Delay Mitigation for Connected and Automated Vehicles Nithish Kumar Saravanan et.al. 2504.08172 null
2025-04-10 Multi-Task Learning with Multi-Annotation Triplet Loss for Improved Object Detection Meilun Zhou et.al. 2504.08054 null
2025-04-10 Detect Anything 3D in the Wild Hanxue Zhang et.al. 2504.07958 null
2025-04-11 Pychop: Emulating Low-Precision Arithmetic in Numerical Methods and Neural Networks Erin Carson et.al. 2504.07835 null
2025-04-10 P2Object: Single Point Supervised Object Detection and Instance Segmentation Pengfei Chen et.al. 2504.07813 null
2025-04-10 Nonlocal Retinex-Based Variational Model and its Deep Unfolding Twin for Low-Light Image Enhancement Daniel Torres et.al. 2504.07810 null
2025-04-10 Adaptive Detection of Fast Moving Celestial Objects Using a Mixture of Experts and Physical-Inspired Neural Network Peng Jia et.al. 2504.07777 null
2025-04-10 Prediction of Usage Probabilities of Shopping-Mall Corridors Using Heterogeneous Graph Neural Networks Malik M Barakathullah et.al. 2504.07645 null
2025-04-10 VLM-R1: A Stable and Generalizable R1-style Large Vision-Language Model Haozhan Shen et.al. 2504.07615 link
2025-04-10 RASMD: RGB And SWIR Multispectral Driving Dataset for Robust Perception in Adverse Conditions Youngwan Jin et.al. 2504.07603 null
2025-04-10 WS-DETR: Robust Water Surface Object Detection through Vision-Radar Fusion with Detection Transformer Huilin Yin et.al. 2504.07441 null
2025-04-10 Model Discrepancy Learning: Synthetic Faces Detection Based on Multi-Reconstruction Qingchao Jiang et.al. 2504.07382 link
2025-04-09 Generalized Semantic Contrastive Learning via Embedding Side Information for Few-Shot Object Detection Ruoyu Chen et.al. 2504.07060 null
2025-04-09 UAV Position Estimation using a LiDAR-based 3D Object Detection Method Uthman Olawoye et.al. 2504.07028 null
2025-04-09 Towards Efficient Roadside LiDAR Deployment: A Fast Surrogate Metric Based on Entropy-Guided Visibility Yuze Jiang et.al. 2504.06772 null
2025-04-09 Domain-Conditioned Scene Graphs for State-Grounded Task Planning Jonas Herzog et.al. 2504.06661 null
2025-04-09 Visually Similar Pair Alignment for Robust Cross-Domain Object Detection Onkar Krishna et.al. 2504.06607 null
2025-04-08 From Broadcast to Minimap: Achieving State-of-the-Art SoccerNet Game State Reconstruction Vladimir Golovkin et.al. 2504.06357 null
2025-04-08 Analyzing the Impact of Low-Rank Adaptation for Cross-Domain Few-Shot Object Detection in Aerial Images Hicham Talaoubrid et.al. 2504.06330 null
2025-04-08 Security Analysis of Thumbnail-Preserving Image Encryption and a New Framework Dong Xie et.al. 2504.06083 null
2025-04-08 Balancing long- and short-term dynamics for the modeling of saliency in videos Theodor Wulff et.al. 2504.05913 null
2025-04-08 PRIMEDrive-CoT: A Precognitive Chain-of-Thought Framework for Uncertainty-Aware Object Interaction in Driving Scene Scenario Sriram Mandalika et.al. 2504.05908 null
2025-04-08 Intrinsic Saliency Guided Trunk-Collateral Network for Unsupervised Video Object Segmentation Xiangyu Zheng et.al. 2504.05904 null
2025-04-08 KAN-SAM: Kolmogorov-Arnold Network Guided Segment Anything Model for RGB-T Salient Object Detection Xingyuan Li et.al. 2504.05878 null
2025-04-08 DefMamba: Deformable Visual State Space Model Leiye Liu et.al. 2504.05794 null
2025-04-08 Event-based Civil Infrastructure Visual Defect Detection: ev-CIVIL Dataset and Benchmark Udayanga G. W. K. N. Gamage et.al. 2504.05679 null
2025-04-08 POD: Predictive Object Detection with Single-Frame FMCW LiDAR Point Cloud Yining Shi et.al. 2504.05649 null
2025-04-08 AD-Det: Boosting Object Detection in UAV Images with Focused Small Objects and Balanced Tail Classes Zhenteng Li et.al. 2504.05601 null
2025-04-07 SSLFusion: Scale & Space Aligned Latent Fusion Model for Multimodal 3D Object Detection Bonan Ding et.al. 2504.05170 null
2025-04-07 Inland Waterway Object Detection in Multi-environment: Dataset and Approach Shanshan Wang et.al. 2504.04835 null
2025-04-07 Playing Non-Embedded Card-Based Games with Reinforcement Learning Tianyang Wu et.al. 2504.04783 null
2025-04-07 Feedback-Enhanced Hallucination-Resistant Vision-Language Model for Real-Time Scene Understanding Zahir Alsulaimawi et.al. 2504.04772 null
2025-04-07 Inverse++: Vision-Centric 3D Semantic Occupancy Prediction Assisted with 3D Object Detection Zhenxing Ming et.al. 2504.04732 null
2025-04-06 Enhance Then Search: An Augmentation-Search Strategy with Foundation Models for Cross-Domain Few-Shot Object Detection Jiancheng Pan et.al. 2504.04517 link
2025-04-06 eKalibr-Stereo: Continuous-Time Spatiotemporal Calibration for Event-Based Stereo Visual Systems Shuolong Chen et.al. 2504.04451 link
2025-04-05 Autoregressive High-Order Finite Difference Modulo Imaging: High-Dynamic Range for Computer Vision Applications Brayan Monroy et.al. 2504.04228 null
2025-04-05 An Optimized Density-Based Lane Keeping System for A Cost-Efficient Autonomous Vehicle Platform: AurigaBot V1 Farbod Younesi et.al. 2504.04217 null
2025-04-05 Learning about the Physical World through Analytic Concepts Jianhua Sun et.al. 2504.04170 null
2025-04-04 VISTA-OCR: Towards generative and interactive end to end OCR models Laziz Hamdi et.al. 2504.03621 null
2025-04-04 PF3Det: A Prompted Foundation Feature Assisted Visual LiDAR 3D Detector Kaidong Li et.al. 2504.03563 null
2025-04-04 ZFusion: An Effective Fuser of Camera and 4D Radar for 3D Object Perception in Autonomous Driving Sheng Yang et.al. 2504.03438 null
2025-04-04 Infrared bubble recognition in the Milky Way and beyond using deep learning Shimpei Nishimoto et.al. 2504.03367 null
2025-04-04 Real-Time Roadway Obstacle Detection for Electric Scooters Using Deep Learning and Multi-Sensor Fusion Zeyang Zheng et.al. 2504.03171 null
2025-04-04 Finding the Reflection Point: Unpadding Images to Remove Data Augmentation Artifacts in Large Open Source Image Datasets for Machine Learning Lucas Choi et.al. 2504.03168 null
2025-04-03 Attention-Aware Multi-View Pedestrian Tracking Reef Alturki et.al. 2504.03047 null
2025-04-03 LiDAR-based Object Detection with Real-time Voice Specifications Anurag Kulkarni et.al. 2504.02920 null
2025-04-03 BOP Challenge 2024 on Model-Based and Model-Free 6D Object Pose Estimation Van Nguyen Nguyen et.al. 2504.02812 null
2025-04-03 Rip Current Segmentation: A Novel Benchmark and YOLOv8 Baseline Results Andrei Dumitriu et.al. 2504.02558 null
2025-04-03 Multimodal Fusion and Vision-Language Models: A Survey for Robot Vision Xiaofeng Han et.al. 2504.02477 null
2025-04-03 CornerPoint3D: Look at the Nearest Corner Instead of the Center Ruixiao Zhang et.al. 2504.02464 null
2025-04-03 Hyperspectral Remote Sensing Images Salient Object Detection: The First Benchmark Dataset and Baseline Peifu Liu et.al. 2504.02416 null
2025-04-03 SemiISP/SemiIE: Semi-Supervised Image Signal Processor and Image Enhancement Leveraging One-to-Many Mapping sRGB-to-RAW Masakazu Yoshimura et.al. 2504.02345 null
2025-04-03 Improving Harmful Text Detection with Joint Retrieval and External Knowledge Zidong Yu et.al. 2504.02310 null
2025-04-03 LLM-Guided Evolution: An Autonomous Model Optimization for Object Detection YiMing Yu et.al. 2504.02280 null
2025-04-02 Cat-Eye Inspired Active-Passive-Composite Aperture-Shared Sub-Terahertz Meta-Imager for Non-Interactive Concealed Object Detection Mingshuang Hu et.al. 2504.01473 null
2025-04-02 CFMD: Dynamic Cross-layer Feature Fusion for Salient Object Detection Jin Lian et.al. 2504.01326 null
2025-04-01 Enabling Efficient Processing of Spiking Neural Networks with On-Chip Learning on Commodity Neuromorphic Processors for Edge AI Systems Rachmad Vidya Wicaksana Putra et.al. 2504.00957 null
2025-04-01 NeuRadar: Neural Radiance Fields for Automotive Radar Point Clouds Mahan Rafidashti et.al. 2504.00859 null
2025-04-01 AttentiveGRU: Recurrent Spatio-Temporal Modeling for Advanced Radar-Based BEV Object Detection Loveneet Saini et.al. 2504.00559 null
2025-04-01 Archival Faces: Detection of Faces in Digitized Historical Documents Marek Vaško et.al. 2504.00558 null
2025-04-01 High-Quality Pseudo-Label Generation Based on Visual Prompt Assisted Cloud Model Update Xinrun Xu et.al. 2504.00526 null
2025-04-01 Intrinsic-feature-guided 3D Object Detection Wanjing Zhang et.al. 2504.00382 null
2025-04-01 CamoSAM2: Motion-Appearance Induced Auto-Refining Prompts for Video Camouflaged Object Detection Xin Zhang et.al. 2504.00375 null
2025-03-31 Towards Precise Action Spotting: Addressing Temporal Misalignment in Labels with Dynamic Label Assignment Masato Tamura et.al. 2504.00149 null
2025-03-31 SU-YOLO: Spiking Neural Network for Efficient Underwater Object Detection Chenyang Li et.al. 2503.24389 link
2025-03-31 MB-ORES: A Multi-Branch Object Reasoner for Visual Grounding in Remote Sensing Karim Radouane et.al. 2503.24219 link
2025-03-31 Spectral-Adaptive Modulation Networks for Visual Perception Guhnoo Yun et.al. 2503.23947 null
2025-03-31 Reliable Traffic Monitoring Using Low-Cost Doppler Radar Units Mishay Naidoo et.al. 2503.23926 null
2025-03-31 Expanding-and-Shrinking Binary Neural Networks Xulong Shi et.al. 2503.23709 link
2025-03-30 Beyond Detection: Designing AI-Resilient Assessments with Automated Feedback Tool to Foster Critical Thinking Muhammad Sajjad Akbar et.al. 2503.23622 null
2025-03-30 Re-Aligning Language to Visual Objects with an Agentic Workflow Yuming Chen et.al. 2503.23508 null
2025-03-30 EagleVision: Object-level Attribute Multimodal LLM for Remote Sensing Hongxiang Jiang et.al. 2503.23330 null
2025-03-29 Context in object detection: a systematic literature review Mahtab Jamali et.al. 2503.23249 null
2025-03-29 Large Self-Supervised Models Bridge the Gap in Domain Adaptive Object Detection Marc-Antoine Lavoie et.al. 2503.23220 null
2025-03-28 AnnoPage Dataset: Dataset of Non-Textual Elements in Documents with Fine-Grained Categorization Martin Kišš et.al. 2503.22526 null
2025-03-28 Data Quality Matters: Quantifying Image Quality Impact on Machine Learning Performance Christian Steinhauser et.al. 2503.22375 null
2025-03-28 ForcePose: A Deep Learning Approach for Force Calculation Based on Action Recognition Using MediaPipe Pose Estimation Combined with Object Detection Nandakishor M et.al. 2503.22363 null
2025-03-28 SKDU at De-Factify 4.0: Natural Language Features for AI-Generated Text-Detection Shrikant Malviya et.al. 2503.22338 link
2025-03-28 Knowledge Rectification for Camouflaged Object Detection: Unlocking Insights from Low-Quality Data Juwei Guan et.al. 2503.22180 null
2025-03-28 A Survey on Remote Sensing Foundation Models: From Vision to Multimodality Ziyue Huang et.al. 2503.22081 null
2025-03-27 AGILE: A Diffusion-Based Attention-Guided Image and Label Translation for Efficient Cross-Domain Plant Trait Identification Earl Ranario et.al. 2503.22019 null
2025-03-27 FACETS: Efficient Once-for-all Object Detection via Constrained Iterative Search Tony Tran et.al. 2503.21999 null
2025-03-27 Exponentially Weighted Instance-Aware Repeat Factor Sampling for Long-Tailed Object Detection Model Training in Unmanned Aerial Vehicles Surveillance Scenarios Taufiq Ahmed et.al. 2503.21893 null
2025-03-27 Learning Class Prototypes for Unified Sparse Supervised 3D Object Detection Yun Zhu et.al. 2503.21099 link
2025-03-26 SaViD: Spectravista Aesthetic Vision Integration for Robust and Discerning 3D Object Detection in Challenging Environments Tanmoy Dam et.al. 2503.20614 link
2025-03-26 Small Object Detection: A Comprehensive Survey on Challenges, Techniques and Real-World Applications Mahya Nikouei et.al. 2503.20516 null
2025-03-25 Gemini Robotics: Bringing AI into the Physical World Gemini Robotics Team et.al. 2503.20020 null
2025-03-25 Hyperdimensional Uncertainty Quantification for Multimodal Uncertainty Fusion in Autonomous Vehicles Perception Luke Chen et.al. 2503.20011 null
2025-03-25 Mind the Gap: Benchmarking Spatial Reasoning in Vision-Language Models Ilias Stogiannidis et.al. 2503.19707 null
2025-03-25 BiblioPage: A Dataset of Scanned Title Pages for Bibliographic Metadata Extraction Jan Kohút et.al. 2503.19658 null
2025-03-25 Single Shot AI-assisted quantification of KI-67 proliferation index in breast cancer Deepti Madurai Muthu et.al. 2503.19606 null
2025-03-25 MATT-GS: Masked Attention-based 3DGS for Robot Perception and Object Detection Jee Won Lee et.al. 2503.19330 null
2025-03-25 Multiscale Feature Importance-based Bit Allocation for End-to-End Feature Coding for Machines Junle Liu et.al. 2503.19278 null
2025-03-24 Benchmarking Object Detectors under Real-World Distribution Shifts in Satellite Imagery Sara Al-Emadi et.al. 2503.19202 null
2025-03-24 Pitch Contour Exploration Across Audio Domains: A Vision-Based Transfer Learning Approach Jakob Abeßer et.al. 2503.19161 null
2025-03-24 Cooperative Control of Multi-Quadrotors for Transporting Cable-Suspended Payloads: Obstacle-Aware Planning and Event-Based Nonlinear Model Predictive Control Tohid Kargar Tasooji et.al. 2503.19135 null
2025-03-24 Building Blocks for Robust and Effective Semi-Supervised Real-World Object Detection Moussa Kassem Sbeyti et.al. 2503.18903 null
2025-03-24 LGI-DETR: Local-Global Interaction for UAV Object Detection Zifa Chen et.al. 2503.18785 null
2025-03-25 Frequency Dynamic Convolution for Dense Image Prediction Linwei Chen et.al. 2503.18783 null
2025-03-24 CQ-DINO: Mitigating Gradient Dilution via Category Queries for Vast Vocabulary Object Detection Zhichao Sun et.al. 2503.18430 null
2025-03-24 Vision-Guided Loco-Manipulation with a Snake Robot Adarsh Salagame et.al. 2503.18308 null
2025-03-23 Extended Visibility of Autonomous Vehicles via Optimized Cooperative Perception under Imperfect Communication Ahmad Sarlak et.al. 2503.18192 null
2025-03-22 MAMAT: 3D Mamba-Based Atmospheric Turbulence Removal and its Object Detection Capability Paul Hill et.al. 2503.17700 null
2025-03-22 Sense4FL: Vehicular Crowdsensing Enhanced Federated Learning for Autonomous Driving Yanan Ma et.al. 2503.17697 null
2025-03-21 Should we pre-train a decoder in contrastive learning for dense prediction tasks? Sébastien Quetin et.al. 2503.17526 null
2025-03-21 Event-Based Crossing Dataset (EBCD) Joey Mulé et.al. 2503.17499 null
2025-03-21 An Iterative Feedback Mechanism for Improving Natural Language Class Descriptions in Open-Vocabulary Object Detection Louis Y. Kim et.al. 2503.17285 null
2025-03-21 Which2comm: An Efficient Collaborative Perception Framework for 3D Object Detection Duanrui Yu et.al. 2503.17175 null
2025-03-21 Hi-ALPS -- An Experimental Robustness Quantification of Six LiDAR-based Object Detection Systems for Autonomous Driving Alexandra Arzberger et.al. 2503.17168 null
2025-03-21 R-LiViT: A LiDAR-Visual-Thermal Dataset Enabling Vulnerable Road User Focused Roadside Perception Jonas Mirlach et.al. 2503.17122 null
2025-03-21 Exploring Few-Shot Object Detection on Blood Smear Images: A Case Study of Leukocytes and Schistocytes Davide Antonio Mura et.al. 2503.17107 null
2025-03-21 R2LDM: An Efficient 4D Radar Super-Resolution Framework Leveraging Diffusion Model Boyuan Zheng et.al. 2503.17097 null
2025-03-21 Superpowering Open-Vocabulary Object Detectors for X-ray Vision Pablo Garcia-Fernandez et.al. 2503.17071 null
2025-03-21 Scoring, Remember, and Reference: Catching Camouflaged Objects in Videos Yuang Feng et.al. 2503.17050 null
2025-03-21 Salient Object Detection in Traffic Scene through the TSOD10K Dataset Yu Qiu et.al. 2503.16910 null
2025-03-21 Seg2Box: 3D Object Detection by Point-Wise Semantics Supervision Maoji Zheng et.al. 2503.16811 null
2025-03-20 RESFL: An Uncertainty-Aware Framework for Responsible Federated Learning by Balancing Privacy, Fairness and Utility in Autonomous Vehicles Dawood Wasif et.al. 2503.16251 null
2025-03-20 MapGlue: Multimodal Remote Sensing Image Matching Peihao Wu et.al. 2503.16185 null
2025-03-20 Uncertainty Meets Diversity: A Comprehensive Active Learning Framework for Indoor 3D Object Detection Jiangyi Wang et.al. 2503.16125 null
2025-03-20 Semantic-Guided Global-Local Collaborative Networks for Lightweight Image Super-Resolution Wanshu Fan et.al. 2503.16056 null
2025-03-19 A Context-Driven Training-Free Network for Lightweight Scene Text Segmentation and Recognition Ritabrata Chakraborty et.al. 2503.15639 null
2025-03-19 DCA: Dividing and Conquering Amnesia in Incremental Object Detection Aoting Zhang et.al. 2503.15295 null
2025-03-19 Test-Time Backdoor Detection for Object Detection Models Hangtao Zhang et.al. 2503.15293 null
2025-03-19 GO-N3RDet: Geometry Optimized NeRF-enhanced 3D Object Detector Zechuan Li et.al. 2503.15211 null
2025-03-19 UltraFlwr -- An Efficient Federated Medical and Surgical Object Detection Framework Yang Li et.al. 2503.15161 null
2025-03-19 An Investigation of Beam Density on LiDAR Object Detection Performance Christoph Griesbacher et.al. 2503.15087 null
2025-03-19 SPADE: Systematic Prompt Framework for Automated Dialogue Expansion in Machine-Generated Text Detection Haoyi Li et.al. 2503.15044 null
2025-03-19 Fine-Grained Open-Vocabulary Object Detection with Fined-Grained Prompts: Task, Dataset and Benchmark Ying Liu et.al. 2503.14862 null
2025-03-19 State Space Model Meets Transformer: A New Paradigm for 3D Object Detection Chuxin Wang et.al. 2503.14493 null
2025-03-18 Panoramic Distortion-Aware Tokenization for Person Detection and Localization Using Transformers in Overhead Fisheye Images Nobuhiko Wakai et.al. 2503.14228 null
2025-03-18 A Revisit to the Decoder for Camouflaged Object Detection Seung Woo Ko et.al. 2503.14035 null
2025-03-18 Shift, Scale and Rotation Invariant Multiple Object Detection using Balanced Joint Transform Correlator Xi Shen et.al. 2503.14034 null
2025-03-18 LEGNet: Lightweight Edge-Gaussian Driven Network for Low-Quality Remote Sensing Image Object Detection Wei Lu et.al. 2503.14012 null
2025-03-18 FrustumFusionNets: A Three-Dimensional Object Detection Network Based on Tractor Road Scene Lili Yang et.al. 2503.13951 null
2025-03-18 Is Discretization Fusion All You Need for Collaborative Perception? Kang Yang et.al. 2503.13946 null
2025-03-18 PSA-SSL: Pose and Size-aware Self-Supervised Learning on LiDAR Point Clouds Barza Nisar et.al. 2503.13914 null
2025-03-18 HSOD-BIT-V2: A New Challenging Benchmarkfor Hyperspectral Salient Object Detection Yuhao Qiu et.al. 2503.13906 null
2025-03-18 TGBFormer: Transformer-GraphFormer Blender Network for Video Object Detection Qiang Qi et.al. 2503.13903 null
2025-03-17 Beyond RGB: Adaptive Parallel Processing for RAW Object Detection Shani Gamrian et.al. 2503.13163 null
2025-03-17 Who Wrote This? Identifying Machine vs Human-Generated Text in Hausa Babangida Sani et.al. 2503.13101 null
2025-03-17 SparseAlign: A Fully Sparse Framework for Cooperative Object Detection Yunshuang Yuan et.al. 2503.12982 null
2025-03-17 Efficient Multimodal 3D Object Detector via Instance-Level Contrastive Distillation Zhuoqun Su et.al. 2503.12914 null
2025-03-16 Point Cloud Based Scene Segmentation: A Survey Dan Halperin et.al. 2503.12595 null
2025-03-16 GeoRSMLLM: A Multimodal Large Language Model for Vision-Language Tasks in Geoscience and Remote Sensing Zilun Zhang et.al. 2503.12490 null
2025-03-16 Deepfake Detection with Optimized Hybrid Model: EAR Biometric Descriptor via Improved RCNN Ruchika Sharma et.al. 2503.12381 null
2025-03-15 An Efficient Deep Learning-Based Approach to Automating Invoice Document Validation Aziz Amari et.al. 2503.12267 null
2025-03-15 Minuscule Cell Detection in AS-OCT Images with Progressive Field-of-View Focusing Boyu Chen et.al. 2503.12249 null
2025-03-15 SFMNet: Sparse Focal Modulation for 3D Object Detection Oren Shrout et.al. 2503.12093 null
2025-03-14 FLASHμ: Fast Localizing And Sizing of Holographic Microparticles Ayush Paliwal et.al. 2503.11538 null
2025-03-14 Falcon: A Remote Sensing Vision-Language Foundation Model Kelu Yao et.al. 2503.11070 null
2025-03-14 FMNet: Frequency-Assisted Mamba-Like Linear Attention Network for Camouflaged Object Detection Ming Deng et.al. 2503.11030 null
2025-03-14 Comparative Analysis of Advanced AI-based Object Detection Models for Pavement Marking Quality Assessment during Daytime Gian Antariksa et.al. 2503.11008 null
2025-03-14 Cyclic Contrastive Knowledge Transfer for Open-Vocabulary Object Detection Chuhan Zhang et.al. 2503.11005 null
2025-03-14 Enhanced Multi-View Pedestrian Detection Using Probabilistic Occupancy Volume Reef Alturki et.al. 2503.10982 null
2025-03-13 The Power of One: A Single Example is All it Takes for Segmentation in VLMs Mir Rayat Imtiaz Hossain et.al. 2503.10779 null
2025-03-13 HeightFormer: Learning Height Prediction in Voxel Features for Roadside Vision Centric 3D Object Detection via Transformer Zhang Zhang et.al. 2503.10777 null
2025-03-13 Semantic-Supervised Spatial-Temporal Fusion for LiDAR-based 3D Object Detection Chaoqun Wang et.al. 2503.10579 null
2025-03-13 RoCo-Sim: Enhancing Roadside Collaborative Perception through Foreground Simulation Yuwen Du et.al. 2503.10410 link
2025-03-13 RoMA: Scaling up Mamba-based Foundation Models for Remote Sensing Fengxiang Wang et.al. 2503.10392 link
2025-03-13 Object detection characteristics in a learning factory environment using YOLOv8 Toni Schneidereit et.al. 2503.10356 null
2025-03-13 TARS: Traffic-Aware Radar Scene Flow Estimation Jialong Wu et.al. 2503.10210 null
2025-03-13 A Hierarchical Semantic Distillation Framework for Open-Vocabulary Object Detection Shenghao Fu et.al. 2503.10152 link
2025-03-13 Deep Learning-Based Direct Leaf Area Estimation using Two RGBD Datasets for Model Development Namal Jayasuriya et.al. 2503.10129 null
2025-03-13 Style Evolving along Chain-of-Thought for Unknown-Domain Object Detection Zihao Zhang et.al. 2503.09968 null
2025-03-12 CleverDistiller: Simple and Spatially Consistent Cross-modal Distillation Hariprasath Govindarajan et.al. 2503.09878 null
2025-03-12 How good are deep learning methods for automated road safety analysis using video data? An experimental study Qingwu Liu et.al. 2503.09807 null
2025-03-12 Deep Learning for Climate Action: Computer Vision Analysis of Visual Narratives on X Katharina Prasse et.al. 2503.09361 null
2025-03-12 Fully-Synthetic Training for Visual Quality Inspection in Automotive Production Christoph Huber et.al. 2503.09354 null
2025-03-12 DitHub: A Modular Framework for Incremental Open-Vocabulary Object Detection Chiara Cappellino et.al. 2503.09271 null
2025-03-12 Polygonizing Roof Segments from High-Resolution Aerial Images Using Yolov8-Based Edge Detection Qipeng Mei et.al. 2503.09187 null
2025-03-12 RFUAV: A Benchmark Dataset for Unmanned Aerial Vehicle Detection and Identification Rui Shi et.al. 2503.09033 null
2025-03-12 Dual-Domain Homogeneous Fusion with Cross-Modal Mamba and Progressive Decoder for 3D Object Detection Xuzhong Hu et.al. 2503.08992 null
2025-03-11 GBlobs: Explicit Local Structure via Gaussian Blobs for Improved Cross-Domain LiDAR-based 3D Object Detection Dušan Malić et.al. 2503.08639 null
2025-03-11 Referring to Any Person Qing Jiang et.al. 2503.08507 null
2025-03-11 SuperCap: Multi-resolution Superpixel-based Image Captioning Henry Senior et.al. 2503.08496 null
2025-03-13 Learning to Detect Objects from Multi-Agent LiDAR Scans without Manual Labels Qiming Xia et.al. 2503.08421 null
2025-03-11 Embodied Crowd Counting Runling Long et.al. 2503.08367 null
2025-03-11 Physics-based AI methodology for Material Parameter Extraction from Optical Data M. Koumans et.al. 2503.08183 null
2025-03-11 Bring Remote Sensing Object Detect Into Nature Language Model: Using SFT Method Fei Wang et.al. 2503.08144 null
2025-03-11 Accelerate 3D Object Detection Models via Zero-Shot Attention Key Pruning Lizhen Xu et.al. 2503.08101 link
2025-03-11 SparseVoxFormer: Sparse Voxel-based Transformer for Multi-modal 3D Object Detection Hyeongseok Son et.al. 2503.08092 null
2025-03-11 Simulating Automotive Radar with Lidar and Camera Inputs Peili Song et.al. 2503.08068 null
2025-03-10 YOLOE: Real-Time Seeing Anything Ao Wang et.al. 2503.07465 link
2025-03-10 HGO-YOLO: Advancing Anomaly Behavior Detection with Hierarchical Features and Lightweight Optimized Detection Qizhi Zheng et.al. 2503.07371 null
2025-03-10 Mitigating Hallucinations in YOLO-based Object Detection Models: A Revisit to Out-of-Distribution Detection Weicheng He et.al. 2503.07330 null
2025-03-10 Semantic Communications with Computer Vision Sensing for Edge Video Transmission Yubo Peng et.al. 2503.07252 null
2025-03-10 MIRAM: Masked Image Reconstruction Across Multiple Scales for Breast Lesion Risk Prediction Hung Q. Vo et.al. 2503.07157 null
2025-03-10 A Light Perspective for 3D Object Detection Marcelo Eduardo Pederiva et.al. 2503.07133 null
2025-03-10 SimROD: A Simple Baseline for Raw Object Detection with Global and Local Enhancements Haiyang Xie et.al. 2503.07101 null
2025-03-10 RS2V-L: Vehicle-Mounted LiDAR Data Generation from Roadside Sensor Observations Ruidan Xing et.al. 2503.07085 null
2025-03-10 Availability-aware Sensor Fusion via Unified Canonical Space for 4D Radar, LiDAR, and Camera Dong-Hee Paek et.al. 2503.07029 null
2025-03-10 Large Language Model Guided Progressive Feature Alignment for Multimodal UAV Object Detection Wentao Wu et.al. 2503.06948 null
2025-03-06 Collaborative Evaluation of Deepfake Text with Deliberation-Enhancing Dialogue Systems Jooyoung Lee et.al. 2503.04945 null
2025-03-06 Fine-Tuning Florence2 for Enhanced Object Detection in Un-constructed Environments: Vision-Language Model Approach Soumyadeep Ro et.al. 2503.04918 null
2025-03-06 Floxels: Fast Unsupervised Voxel Based Scene Flow Estimation David T. Hoffmann et.al. 2503.04718 null
2025-03-06 DEAL-YOLO: Drone-based Efficient Animal Localization using YOLO Aditya Prashant Naidu et.al. 2503.04698 null
2025-03-06 Teach YOLO to Remember: A Self-Distillation Approach for Continual Object Detection Riccardo De Monte et.al. 2503.04688 null
2025-03-06 ReynoldsFlow: Exquisite Flow Estimation via Reynolds Transport Theorem Yu-Hsi Chen et.al. 2503.04500 null
2025-03-06 A lightweight model FDM-YOLO for small target improvement based on YOLOv8 Xuerui Zhang et.al. 2503.04452 null
2025-03-06 Shaken, Not Stirred: A Novel Dataset for Visual Understanding of Glasses in Human-Robot Bartending Tasks Lukáš Gajdošech et.al. 2503.04308 null
2025-03-06 CA-W3D: Leveraging Context-Aware Knowledge for Weakly Supervised Monocular 3D Detection Chupeng Liu et.al. 2503.04154 null
2025-03-06 Robust Computer-Vision based Construction Site Detection for Assistive-Technology Applications Junchi Feng et.al. 2503.04139 null
2025-03-06 Fractional Correspondence Framework in Detection Transformer Masoumeh Zareapoor et.al. 2503.04107 null
2025-03-05 DualDiff+: Dual-Branch Diffusion for High-Fidelity Video Generation with Reward Guidance Zhao Yang et.al. 2503.03689 null
2025-03-05 4D Radar Ground Truth Augmentation with LiDAR-to-4D Radar Data Synthesis Woo-Jin Jung et.al. 2503.03637 null
2025-03-05 Feature-Level Insights into Artificial Text Detection with Sparse Autoencoders Kristian Kuznetsov et.al. 2503.03601 null
2025-03-05 Simulation-Based Performance Evaluation of 3D Object Detection Methods with Deep Learning for a LiDAR Point Cloud Dataset in a SOTIF-related Use Case Milin Patel et.al. 2503.03548 link
2025-03-05 AI-Driven Multi-Stage Computer Vision System for Defect Detection in Laser-Engraved Industrial Nameplates Adhish Anitha Vilasan et.al. 2503.03395 null
2025-03-05 MIAdapt: Source-free Few-shot Domain Adaptive Object Detection for Microscopic Images Nimra Dilawar et.al. 2503.03370 null
2025-03-05 Automated Attendee Recognition System for Large-Scale Social Events or Conference Gathering Dhruv Motwani et.al. 2503.03330 null
2025-03-05 BEVMOSNet: Multimodal Fusion for BEV Moving Object Segmentation Hiep Truong Cong et.al. 2503.03280 null
2025-03-05 Find Matching Faces Based On Face Parameters Setu A. Bhatt et.al. 2503.03204 null
2025-03-04 Revolutionizing Traffic Management with AI-Powered Machine Vision: A Step Toward Smart Cities Seyed Hossein Hosseini DolatAbadi et.al. 2503.02967 null
2025-03-04 Class-Aware PillarMix: Can Mixed Sample Data Augmentation Enhance 3D Object Detection with Radar Point Clouds? Miao Zhang et.al. 2503.02687 null
2025-03-04 Exploring Model Quantization in GenAI-based Image Inpainting and Detection of Arable Plants Sourav Modak et.al. 2503.02420 null
2025-03-04 Robust detection of overlapping bioacoustic sound events Louis Mahon et.al. 2503.02389 null
2025-03-04 YOLO-PRO: Enhancing Instance-Specific Object Detection with Full-Channel Global Self-Attention Lin Huang et.al. 2503.02348 null
2025-03-04 SSNet: Saliency Prior and State Space Model-based Network for Salient Object Detection in RGB-D Images Gargi Panda et.al. 2503.02270 null
2025-03-03 Generalized Diffusion Detector: Mining Robust Features from Diffusion Models for Domain-Generalized Detection Boyong He et.al. 2503.02101 null
2025-03-03 Uncertainty Representation in a SOTIF-Related Use Case with Dempster-Shafer Theory for LiDAR Sensor-Based Object Detection Milin Patel et.al. 2503.02087 link
2025-03-03 Visual-RFT: Visual Reinforcement Fine-Tuning Ziyu Liu et.al. 2503.01785 link
2025-03-03 Enhancing Object Detection Accuracy in Underwater Sonar Images through Deep Learning-based Denoising Ziyu Wang et.al. 2503.01655 null
2025-03-03 Evaluating Stenosis Detection with Grounding DINO, YOLO, and DINO-DETR Muhammad Musab Ansari et.al. 2503.01601 null
2025-02-28 The Common Objects Underwater (COU) Dataset for Robust Underwater Object Detection Rishi Mukherjee et.al. 2502.20651 null
2025-02-28 RTGen: Real-Time Generative Detection Transformer Chi Ruan et.al. 2502.20622 null
2025-02-28 LV-DOT: LiDAR-visual dynamic obstacle detection and tracking for autonomous robot navigation Zhefan Xu et.al. 2502.20607 null
2025-02-27 Multi-Scale Neighborhood Occupancy Masked Autoencoder for Self-Supervised Learning in LiDAR Point Clouds Mohamed Abdelsamad et.al. 2502.20316 null
2025-02-27 OverLoCK: An Overview-first-Look-Closely-next ConvNet with Context-Mixing Dynamic Kernels Meng Lou et.al. 2502.20087 link
2025-02-27 Night-Voyager: Consistent and Efficient Nocturnal Vision-Aided State Estimation in Object Maps Tianxiao Gao et.al. 2502.20054 null
2025-02-27 Learning Mask Invariant Mutual Information for Masked Image Modeling Tao Huang et.al. 2502.19718 null
2025-02-27 BEVDiffuser: Plug-and-Play Diffusion Model for BEV Denoising with Ground-Truth Guidance Xin Ye et.al. 2502.19694 null
2025-02-26 Ev-3DOD: Pushing the Temporal Boundaries of 3D Object Detection with Event Cameras Hoonhee Cho et.al. 2502.19630 null
2025-02-26 Is Your Paper Being Reviewed by an LLM? A New Benchmark Dataset and Approach for Detecting AI Text in Peer Review Sungduk Yu et.al. 2502.19614 null
2025-02-23 Rewards-based image analysis in microscopy Kamyar Barakati et.al. 2502.18522 null
2025-02-25 Multi-Perspective Data Augmentation for Few-shot Object Detection Anh-Khoa Nguyen Vu et.al. 2502.18195 null
2025-02-25 Progressive Local Alignment for Medical Multimodal Pre-training Huimin Yan et.al. 2502.18047 null
2025-02-25 Automatic Vehicle Detection using DETR: A Transformer-Based Approach for Navigating Treacherous Roads Istiaq Ahmed Fahad et.al. 2502.17843 null
2025-02-24 Semi-Supervised Weed Detection in Vegetable Fields: In-domain and Cross-domain Experiments Boyang Deng et.al. 2502.17673 null
2025-02-24 Experimental validation of UAV search and detection system in real wilderness environment Stella Dumenčić et.al. 2502.17372 null
2025-02-24 LCV2I: Communication-Efficient and High-Performance Collaborative Perception Framework with Low-Resolution LiDAR Xinxin Feng et.al. 2502.17039 null
2025-02-24 Sarang at DEFACTIFY 4.0: Detecting AI-Generated Text Using Noised Data and an Ensemble of DeBERTa Models Avinash Trivedi et.al. 2502.16857 null
2025-02-23 Geometry-Aware 3D Salient Object Detection Network Chen Wang et.al. 2502.16488 null
2025-02-26 MQADet: A Plug-and-Play Paradigm for Enhancing Open-Vocabulary Object Detection via Multimodal Question Answering Caixiong Li et.al. 2502.16486 null
2025-02-23 Cross-domain Few-shot Object Detection with Multi-modal Textual Enrichment Zeyu Shangguan et.al. 2502.16469 null
2025-02-23 Deep learning approaches to surgical video segmentation and object detection: A Scoping Review Devanish N. Kamtam et.al. 2502.16459 null
2025-02-22 FeatSharp: Your Vision Model Features, Sharper Mike Ranzinger et.al. 2502.16025 null
2025-02-21 Generative AI Framework for 3D Object Generation in Augmented Reality Majid Behravan et.al. 2502.15869 null
2025-02-21 Machine-generated text detection prevents language model collapse George Drayson et.al. 2502.15654 null
2025-02-21 Depth-aware Fusion Method based on Image and 4D Radar Spectrum for 3D Object Detection Yue Sun et.al. 2502.15516 null
2025-02-21 Q-PETR: Quant-aware Position Embedding Transformation for Multi-View 3D Object Detection Jiangyong Yu et.al. 2502.15488 null
2025-02-21 PFSD: A Multi-Modal Pedestrian-Focus Scene Dataset for Rich Tasks in Semi-Structured Environments Yueting Liu et.al. 2502.15342 null
2025-02-20 Synth It Like KITTI: Synthetic Data Generation for Object Detection in Driving Scenarios Richard Marcus et.al. 2502.15076 null
2025-02-20 YOLOv12: A Breakdown of the Key Architectural Features Mujadded Al Rabbani Alif et.al. 2502.14740 null
2025-02-20 LXLv2: Enhanced LiDAR Excluded Lean 3D Object Detection with Fusion of 4D Radar and Camera Weiyi Xiong et.al. 2502.14503 null
2025-02-20 ODVerse33: Is the New YOLO Version Always Better? A Multi Domain benchmark from YOLO v5 to v11 Tianyou Jiang et.al. 2502.14314 null
2025-02-19 PedDet: Adaptive Spectral Optimization for Multimodal Pedestrian Detection Rui Zhao et.al. 2502.14063 link
2025-02-19 Image compositing is all you need for data augmentation Ang Jia Ning Shermaine et.al. 2502.13936 null
2025-02-19 MSVCOD:A Large-Scale Multi-Scene Dataset for Video Camouflage Object Detection Shuyong Gao et.al. 2502.13859 null
2025-02-19 An Overall Real-Time Mechanism for Classification and Quality Evaluation of Rice Wanke Xia et.al. 2502.13764 null
2025-02-18 Multiple Distribution Shift -- Aerial (MDS-A): A Dataset for Test-Time Error Detection and Model Adaptation Noel Ngu et.al. 2502.13289 null
2025-02-18 RobuRCDet: Enhancing Robustness of Radar-Camera Fusion in Bird's Eye View for 3D Object Detection Jingtong Yue et.al. 2502.13071 null
2025-02-18 Task-Oriented Semantic Communication for Stereo-Vision 3D Object Detection Zijian Cao et.al. 2502.12735 null
2025-02-18 Iron Sharpens Iron: Defending Against Attacks in Machine-Generated Text Detection with Adversarial Training Yuanfan Li et.al. 2502.12734 null
2025-02-18 DAMamba: Vision State Space Model with Dynamic Adaptive Scan Tanzhe Li et.al. 2502.12627 null
2025-02-18 Who Writes What: Unveiling the Impact of Author Roles on AI-generated Text Detection Jiatao Li et.al. 2502.12611 null
2025-02-18 Gaseous Object Detection Kailai Zhou et.al. 2502.12415 null
2025-02-17 AI-generated Text Detection with a GLTR-based Approach Lucía Yan Wu et.al. 2502.12064 null
2025-02-17 Enhancing Transparent Object Pose Estimation: A Fusion of GDR-Net and Edge Detection Tessa Pulli et.al. 2502.12027 null
2025-02-17 ExaGPT: Example-Based Machine-Generated Text Detection for Human Interpretability Ryuto Koike et.al. 2502.11336 null
2025-02-16 DAViMNet: SSMs-Based Domain Adaptive Object Detection A. Enes Doruk et.al. 2502.11178 null
2025-02-15 CLoCKDistill: Consistent Location-and-Context-aware Knowledge Distillation for DETRs Qizhen Lan et.al. 2502.10683 null
2025-02-14 Text-guided Sparse Voxel Pruning for Efficient 3D Visual Grounding Wenxuan Guo et.al. 2502.10392 null
2025-02-14 Object Detection and Tracking Md Pranto et.al. 2502.10310 null
2025-02-14 Artificial Intelligence to Assess Dental Findings from Panoramic Radiographs -- A Multinational Study Yin-Chih Chelsea Wang et.al. 2502.10277 null
2025-02-13 Instance Segmentation of Scene Sketches Using Natural Image Priors Mia Tang et.al. 2502.09608 null
2025-02-13 Wholly-WOOD: Wholly Leveraging Diversified-quality Labels for Weakly-supervised Oriented Object Detection Yi Yu et.al. 2502.09471 link
2025-02-13 Mitigating the Impact of Prominent Position Shift in Drone-based RGBT Object Detection Yan Zhang et.al. 2502.09311 null
2025-02-13 Billet Number Recognition Based on Test-Time Adaptation Yuan Wei et.al. 2502.09026 null
2025-02-12 Uncertainty Aware Human-machine Collaboration in Camouflaged Object Detection Ziyue Yang et.al. 2502.08373 link
2025-02-12 Modification and Generated-Text Detection: Achieving Dual Detection Capabilities for the Outputs of LLM by Watermark Yuhang Cai et.al. 2502.08332 null
2025-02-12 Plantation Monitoring Using Drone Images: A Dataset and Performance Review Yashwanth Karumanchi et.al. 2502.08233 null
2025-02-12 Take What You Need: Flexible Multi-Task Semantic Communications with Channel Adaptation Xiang Chen et.al. 2502.08221 null
2025-02-13 SARChat-Bench-2M: A Multi-Task Vision-Language Benchmark for SAR Image Interpretation Zhiming Ma et.al. 2502.08168 null
2025-02-12 Knowledge Swapping via Learning and Unlearning Mingyu Xing et.al. 2502.08075 null
2025-02-11 Visual-based spatial audio generation system for multi-speaker environments Xiaojing Liu et.al. 2502.07538 null
2025-02-11 Quantitative Analysis of Objects in Prisoner Artworks Thea Christoffersen et.al. 2502.07440 null
2025-02-11 Fast-COS: A Fast One-Stage Object Detector Based on Reparameterized Attention Vision Transformer for Autonomous Driving Novendra Setyawan et.al. 2502.07417 null
2025-02-11 Multi-Task-oriented Nighttime Haze Imaging Enhancer for Vision-driven Measurement Systems Ai Chen et.al. 2502.07351 link
2025-02-11 SparseFormer: Detecting Objects in HRW Shots via Sparse Vision Transformer Wenxi Li et.al. 2502.07216 null
2025-02-11 Dense Object Detection Based on De-homogenized Queries Yueming Huang et.al. 2502.07194 null
2025-02-11 Foreign-Object Detection in High-Voltage Transmission Line Based on Improved YOLOv8m Zhenyue Wang et.al. 2502.07175 null
2025-02-11 A Survey on Mamba Architecture for Vision Applications Fady Ibrahim et.al. 2502.07161 null
2025-02-10 Multimodal Search on a Line Jared Coleman et.al. 2502.07000 null
2025-02-10 AgilePilot: DRL-Based Drone Agent for Real-Time Motion Planning in Dynamic Environments by Leveraging Object Detection Roohan Ahmed Khan et.al. 2502.06725 null
2025-02-10 EdgeMLBalancer: A Self-Adaptive Approach for Dynamic Model Switching on Resource-Constrained Edge Devices Akhila Matathammal et.al. 2502.06493 null
2025-02-10 PLATTER: A Page-Level Handwritten Text Recognition System for Indic Scripts Badri Vishal Kasuba et.al. 2502.06172 null
2025-02-10 Enhancing Document Key Information Localization Through Data Augmentation Yue Dai et.al. 2502.06132 null
2025-02-10 Improved YOLOv5s model for key components detection of power transmission lines Chen Chen et.al. 2502.06127 null
2025-02-10 A Novel Multi-Teacher Knowledge Distillation for Real-Time Object Detection using 4D Radar Seung-Hyun Song et.al. 2502.06114 null
2025-02-09 Training-free Anomaly Event Detection via LLM-guided Symbolic Pattern Discovery Yuhui Zeng et.al. 2502.05843 null
2025-02-08 Demystifying Catastrophic Forgetting in Two-Stage Incremental Object Detector Qirui Wu et.al. 2502.05540 null
2025-02-07 Invizo: Arabic Handwritten Document Optical Character Recognition Solution Alhossien Waly et.al. 2502.05277 null
2025-02-07 LP-DETR: Layer-wise Progressive Relations for Object Detection Zhengjian Kang et.al. 2502.05147 null
2025-02-07 Counting Fish with Temporal Representations of Sonar Video Kai Van Brunt et.al. 2502.05129 null
2025-02-07 DetVPCC: RoI-based Point Cloud Sequence Compression for 3D Object Detection Mingxuan Yan et.al. 2502.04804 null
2025-02-07 MHAF-YOLO: Multi-Branch Heterogeneous Auxiliary Fusion YOLO for accurate object detection Zhiqiang Yang et.al. 2502.04656 null
2025-02-07 AIQViT: Architecture-Informed Post-Training Quantization for Vision Transformers Runqing Jiang et.al. 2502.04628 null
2025-02-06 An Optimized YOLOv5 Based Approach For Real-time Vehicle Detection At Road Intersections Using Fisheye Cameras Md. Jahin Alam et.al. 2502.04566 null
2025-02-06 Group-Adaptive Threshold Optimization for Robust AI-Generated Text Detection Minseok Jung et.al. 2502.04528 null
2025-02-06 OneTrack-M: A multitask approach to transformer-based MOT models Luiz C. S. de Araujo et.al. 2502.04478 null
2025-02-07 Point2RBox-v2: Rethinking Point-supervised Oriented Object Detection with Spatial Layout Among Instances Yi Yu et.al. 2502.04268 null
2025-02-06 An object detection approach for lane change and overtake detection from motion profiles Andrea Benericetti et.al. 2502.04244 null
2025-02-06 YOLOv4: A Breakthrough in Real-Time Object Detection Athulya Sundaresan Geetha et.al. 2502.04161 null
2025-02-06 Advanced Object Detection and Pose Estimation with Hybrid Task Cascade and High-Resolution Networks Yuhui Jin et.al. 2502.03877 null
2025-02-06 Pursuing Better Decision Boundaries for Long-Tailed Object Detection via Category Information Amount Yanbiao Ma et.al. 2502.03852 null
2025-02-06 Single-Domain Generalized Object Detection by Balancing Domain Diversity and Invariance Zhenwei He et.al. 2502.03835 null
2025-02-06 UAV Cognitive Semantic Communications Enabled by Knowledge Graph for Robust Object Detection Xi Song et.al. 2502.03761 null
2025-02-06 RAMOTS: A Real-Time System for Aerial Multi-Object Tracking based on Deep Learning and Big Data Technology Nhat-Tan Do et.al. 2502.03760 null
2025-02-05 An Empirical Study of Methods for Small Object Detection from Satellite Imagery Xiaohui Yuan et.al. 2502.03674 null
2025-02-05 Gompertz Linear Units: Leveraging Asymmetry for Enhanced Learning Dynamics Indrashis Das et.al. 2502.03654 null
2025-02-05 RoboGrasp: A Universal Grasping Policy for Robust Robotic Control Yiqi Huang et.al. 2502.03072 null
2025-02-05 Enhancing Quantum-ready QUBO-based Suppression for Object Detection with Appearance and Confidence Features Keiichiro Yamamura et.al. 2502.02895 null
2025-02-05 RS-YOLOX: A High Precision Detector for Object Detection in Satellite Remote Sensing Images Lei Yang et.al. 2502.02850 null
2025-02-04 Learning the RoPEs: Better 2D and 3D Position Encodings with STRING Connor Schenck et.al. 2502.02562 null
2025-02-04 Uncertainty Quantification for Collaborative Object Detection Under Adversarial Attacks Huiqun Huang et.al. 2502.02537 null
2025-02-04 Improving Generalization Ability for 3D Object Detection by Learning Sparsity-invariant Features Hsin-Cheng Lu et.al. 2502.02322 null
2025-02-04 From Fog to Failure: How Dehazing Can Harm Clear Image Object Detection Ashutosh Kumar et.al. 2502.02027 null
2025-02-04 Memory Efficient Transformer Adapter for Dense Predictions Dong Zhang et.al. 2502.01962 null
2025-02-04 INTACT: Inducing Noise Tolerance through Adversarial Curriculum Training for LiDAR-based Safety-Critical Perception and Autonomy Nastaran Darabi et.al. 2502.01896 null
2025-02-04 SimBEV: A Synthetic Multi-Task Multi-Sensor Driving Data Generation Tool and Dataset Goodarz Mehr et.al. 2502.01894 null
2025-02-03 Reliability-Driven LiDAR-Camera Fusion for Robust 3D Object Detection Reza Sadeghian et.al. 2502.01856 null
2025-02-03 GauCho: Gaussian Distributions with Cholesky Decomposition for Oriented Object Detection Jeffri Murrugarra-LLerena et.al. 2502.01565 null
2025-02-03 Human Body Restoration with One-Step Diffusion Model and A New Benchmark Jue Gong et.al. 2502.01411 null
2025-01-31 Let Human Sketches Help: Empowering Challenging Image Segmentation Task with Freehand Sketches Ying Zang et.al. 2501.19329 null
2025-01-31 Beyond checkmate: exploring the creative chokepoints in AI text Nafis Irtiza Tripto et.al. 2501.19301 link
2025-01-31 GO: The Great Outdoors Multimodal Dataset Peng Jiang et.al. 2501.19274 null
2025-01-31 Adversarial Attacks on AI-Generated Text Detection Models: A Token Probability-Based Approach Using Embeddings Ahmed K. Kadhim et.al. 2501.18998 null
2025-01-31 Early Diagnosis and Severity Assessment of Weligama Coconut Leaf Wilt Disease and Coconut Caterpillar Infestation using Deep Learning-based Image Processing Techniques Samitha Vidhanaarachchi et.al. 2501.18835 null
2025-01-30 Tuning Event Camera Biases Heuristic for Object Detection Applications in Staring Scenarios David El-Chai Ben-Ezra et.al. 2501.18788 null
2025-01-30 Adaptive Object Detection for Indoor Navigation Assistance: A Performance Evaluation of Real-Time Algorithms Abhinav Pratap et.al. 2501.18444 null
2025-01-29 Real Time Scheduling Framework for Multi Object Detection via Spiking Neural Networks Donghwa Kang et.al. 2501.18412 null
2025-01-30 IROAM: Improving Roadside Monocular 3D Object Detection Learning from Autonomous Vehicle Data Domain Zhe Wang et.al. 2501.18162 null
2025-02-03 Efficient Feature Fusion for UAV Object Detection Xudong Wang et.al. 2501.17983 null
2025-01-29 TransRAD: Retentive Vision Transformer for Enhanced Radar Object Detection Lei Cheng et.al. 2501.17977 link
2025-01-28 Object Detection with Deep Learning for Rare Event Search in the GADGET II TPC Tyler Wheeler et.al. 2501.17892 null
2025-01-29 Detection of Oscillation-like Patterns in Eclipsing Binary Light Curves using Neural Network-based Object Detection Algorithms Burak Ulaş et.al. 2501.17538 null
2025-01-30 Assessing the Capability of YOLO- and Transformer-based Object Detectors for Real-time Weed Detection Alicia Allmendinger et.al. 2501.17387 null
2025-01-28 DINOSTAR: Deep Iterative Neural Object Detector Self-Supervised Training for Roadside LiDAR Applications Muhammad Shahbaz et.al. 2501.17076 null
2025-01-28 Contextual Self-paced Learning for Weakly Supervised Spatio-Temporal Video Grounding Akash Kumar et.al. 2501.17053 null
2025-01-28 Approach Towards Semi-Automated Certification for Low Criticality ML-Enabled Airborne Applications Chandrasekar Sridhar et.al. 2501.17028 null
2025-01-28 Modulating CNN Features with Pre-Trained ViT Representations for Open-Vocabulary Object Detection Xiangyu Gao et.al. 2501.16981 null
2025-01-28 B-FPGM: Lightweight Face Detection via Bayesian-Optimized Soft FPGM Pruning Nikolaos Kaparinos et.al. 2501.16917 null
2025-01-28 SSF-PAN: Semantic Scene Flow-Based Perception for Autonomous Navigation in Traffic Scenarios Yinqi Chen et.al. 2501.16754 null
2025-01-28 DebugAgent: Efficient and Interpretable Error Slice Discovery for Comprehensive Model Debugging Muxi Chen et.al. 2501.16751 null
2025-01-28 DFCon: Attention-Driven Supervised Contrastive Learning for Robust Deepfake Detection MD Sadik Hossain Shanto et.al. 2501.16704 null
2025-01-27 Efficient Object Detection of Marine Debris using Pruned YOLO Model Abi Aryaza et.al. 2501.16571 null
2025-01-27 Object Detection for Medical Image Analysis: Insights from the RT-DETR Model Weijie He et.al. 2501.16469 null
2025-01-27 The Linear Attention Resurrection in Vision Transformer Chuanyang Zheng et.al. 2501.16182 null
2025-01-27 Real-Time Brain Tumor Detection in Intraoperative Ultrasound Using YOLO11: From Model Training to Deployment in the Operating Room Santiago Cepeda et.al. 2501.15994 null
2025-01-26 Classifying Deepfakes Using Swin Transformers Aprille J. Xi et.al. 2501.15656 null
2025-01-26 A Privacy Enhancing Technique to Evade Detection by Street Video Cameras Without Using Adversarial Accessories Jacob Shams et.al. 2501.15653 null
2025-01-26 Breaking the SSL-AL Barrier: A Synergistic Semi-Supervised Active Learning Framework for 3D Object Detection Zengran Wang et.al. 2501.15449 null
2025-01-26 FAVbot: An Autonomous Target Tracking Micro-Robot with Frequency Actuation Control Zhijian Hao et.al. 2501.15426 null
2025-01-26 Doracamom: Joint 3D Detection and Occupancy Prediction with Multi-view 4D Radars and Cameras for Omnidirectional Perception Lianqing