Table of Contents
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-17 | DDS-NAS: Dynamic Data Selection within Neural Architecture Search via On-line Hard Example Mining applied to Image Classification | Matt Poyser et.al. | 2506.14667 | null |
2025-06-17 | Train Once, Forget Precisely: Anchored Optimization for Efficient Post-Hoc Unlearning | Prabhav Sanga et.al. | 2506.14515 | null |
2025-06-17 | Compositional Attribute Imbalance in Vision Datasets | Jiayi Chen et.al. | 2506.14418 | null |
2025-06-17 | One-Shot Neural Architecture Search with Network Similarity Directed Initialization for Pathological Image Classification | Renao Yan et.al. | 2506.14176 | null |
2025-06-17 | SeqPE: Transformer with Sequential Position Encoding | Huayang Li et.al. | 2506.13277 | null |
2025-06-15 | Intriguing Frequency Interpretation of Adversarial Robustness for CNNs and ViTs | Lu Chen et.al. | 2506.12875 | null |
2025-06-15 | Medical Argument Mining: Exploitation of Scarce Data Using NLI Systems | Maitane Urruela et.al. | 2506.12823 | null |
2025-06-15 | Cross-architecture universal feature coding via distribution alignment | Changsheng Gao et.al. | 2506.12737 | null |
2025-06-15 | Unsupervised Contrastive Learning Using Out-Of-Distribution Data for Long-Tailed Dataset | Cuong Manh Hoang et.al. | 2506.12698 | null |
2025-06-15 | Evaluating Cell Type Inference in Vision Language Models Under Varying Visual Context | Samarth Singhal et.al. | 2506.12683 | null |
2025-06-14 | OscNet v1.5: Energy Efficient Hopfield Network on CMOS Oscillators for Image Classification | Wenxiao Cai et.al. | 2506.12610 | null |
2025-06-14 | DejaVid: Encoder-Agnostic Learned Temporal Matching for Video Classification | Darryl Ho et.al. | 2506.12585 | null |
2025-06-14 | MVP-CBM:Multi-layer Visual Preference-enhanced Concept Bottleneck Model for Explainable Medical Image Classification | Chunjiang Wang et.al. | 2506.12568 | null |
2025-06-14 | PLD: A Choice-Theoretic List-Wise Knowledge Distillation | Ejafa Bassam et.al. | 2506.12542 | null |
2025-06-13 | GeistBERT: Breathing Life into German NLP | Raphael Scheible-Schmitt et.al. | 2506.11903 | null |
2025-06-13 | Evaluating Fairness and Mitigating Bias in Machine Learning: A Novel Technique using Tensor Data and Bayesian Regression | Kuniko Paxton et.al. | 2506.11627 | null |
2025-06-13 | Machine Unlearning for Robust DNNs: Attribution-Guided Partitioning and Neuron Pruning in Noisy Environments | Deliang Jin et.al. | 2506.11615 | null |
2025-06-13 | Black-Box Edge AI Model Selection with Conformal Latency and Accuracy Guarantees | Anders E. Kalør et.al. | 2506.11391 | null |
2025-06-12 | SNR and Resource Adaptive Deep JSCC for Distributed IoT Image Classification | Ali Waqas et.al. | 2506.10699 | null |
2025-06-13 | PiPViT: Patch-based Visual Interpretable Prototypes for Retinal Image Analysis | Marzieh Oghbaie et.al. | 2506.10669 | link |
2025-06-12 | Boosting Adversarial Transferability for Hyperspectral Image Classification Using 3D Structure-invariant Transformation and Intermediate Feature Distance | Chun Liu et.al. | 2506.10459 | null |
2025-06-12 | Can We Infer Confidential Properties of Training Data from LLMs? | Penguin Huang et.al. | 2506.10364 | null |
2025-06-12 | Flick: Few Labels Text Classification using K-Aware Intermediate Learning in Multi-Task Low-Resource Languages | Ali Almutairi et.al. | 2506.10292 | null |
2025-06-11 | FedMLAC: Mutual Learning Driven Heterogeneous Federated Audio Classification | Jun Bai et.al. | 2506.10207 | null |
2025-06-11 | Detecção da Psoríase Utilizando Visão Computacional: Uma Abordagem Comparativa Entre CNNs e Vision Transformers | Natanael Lucena et.al. | 2506.10119 | null |
2025-06-11 | DeepTraverse: A Depth-First Search Inspired Network for Algorithmic Visual Understanding | Bin Guo et.al. | 2506.10084 | null |
2025-06-11 | Evidential Deep Learning with Spectral-Spatial Uncertainty Disentanglement for Open-Set Hyperspectral Domain Generalization | Amirreza Khoshbakht et.al. | 2506.09460 | null |
2025-06-11 | MSSDF: Modality-Shared Self-supervised Distillation for High-Resolution Multi-modal Remote Sensing Image Learning | Tong Wang et.al. | 2506.09327 | null |
2025-06-10 | ScalableHD: Scalable and High-Throughput Hyperdimensional Computing Inference on Multi-Core CPUs | Dhruv Parikh et.al. | 2506.09282 | null |
2025-06-10 | Hyperbolic Dual Feature Augmentation for Open-Environment | Peilin Yu et.al. | 2506.08906 | null |
2025-06-10 | Normalized Radon Cumulative Distribution Transforms for Invariance and Robustness in Optimal Transport Based Image Classification | Matthias Beckmann et.al. | 2506.08761 | null |
2025-06-12 | InceptionMamba: An Efficient Hybrid Network with Large Band Convolution and Bottleneck Mamba | Yuhang Wang et.al. | 2506.08735 | null |
2025-06-10 | Biologically Inspired Deep Learning Approaches for Fetal Ultrasound Image Classification | Rinat Prochii et.al. | 2506.08623 | null |
2025-06-10 | mSTEB: Massively Multilingual Evaluation of LLMs on Speech and Text Tasks | Luel Hagos Beyene et.al. | 2506.08400 | null |
2025-06-10 | An Adaptive Method Stabilizing Activations for Enhanced Generalization | Hyunseok Seung et.al. | 2506.08353 | null |
2025-06-11 | Hyperspectral Image Classification via Transformer-based Spectral-Spatial Attention Decoupling and Adaptive Gating | Guandong Li et.al. | 2506.08324 | null |
2025-06-09 | TokenBreak: Bypassing Text Classification Models Through Token Manipulation | Kasimir Schulz et.al. | 2506.07948 | null |
2025-06-09 | MultiMatch: Multihead Consistency Regularization Matching for Semi-Supervised Text Classification | Iustin Sirbu et.al. | 2506.07801 | null |
2025-06-09 | Improving Memory Efficiency for Training KANs via Meta Learning | Zhangchi Zhao et.al. | 2506.07549 | null |
2025-06-09 | Mind the Gap: Removing the Discretization Gap in Differentiable Logic Gate Networks | Shakir Yousefi et.al. | 2506.07500 | null |
2025-06-08 | Mobility-Aware Asynchronous Federated Learning with Dynamic Sparsification | Jintao Yan et.al. | 2506.07328 | null |
2025-06-08 | A Stable Whitening Optimizer for Efficient Neural Network Training | Kevin Frans et.al. | 2506.07254 | null |
2025-06-08 | Hierarchical Feature-level Reverse Propagation for Post-Training Neural Networks | Ni Ding et.al. | 2506.07188 | null |
2025-06-08 | CTDGSI: A comprehensive exploitation of instance selection methods for automatic text classification. VII Concurso de Teses, Dissertações e Trabalhos de Graduação em SI -- XXI Simpósio Brasileiro de Sistemas de Informação | Washington Cunha et.al. | 2506.07169 | null |
2025-06-08 | pFedSOP : Accelerating Training Of Personalized Federated Learning Using Second-Order Optimization | Mrinmay Sen et.al. | 2506.07159 | null |
2025-06-07 | Rewriting the Budget: A General Framework for Black-Box Attacks Under Cost Asymmetry | Mahdi Salmani et.al. | 2506.06933 | null |
2025-06-06 | Eigenspectrum Analysis of Neural Networks without Aspect Ratio Bias | Yuanzhe Hu et.al. | 2506.06280 | null |
2025-06-06 | FPDANet: A Multi-Section Classification Model for Intelligent Screening of Fetal Ultrasound | Minglang Chen et.al. | 2506.06054 | null |
2025-06-06 | Enhancing Orthopox Image Classification Using Hybrid Machine Learning and Deep Learning Models | Alejandro Puente-Castro et.al. | 2506.06007 | null |
2025-06-06 | LTG at SemEval-2025 Task 10: Optimizing Context for Classification of Narrative Roles | Egil Rønningstad et.al. | 2506.05976 | null |
2025-06-06 | Integer Binary-Range Alignment Neuron for Spiking Neural Networks | Binghao Ye et.al. | 2506.05679 | null |
2025-06-05 | FRAME: Pre-Training Video Feature Representations via Anticipation and Memory | Sethuraman TV et.al. | 2506.05543 | null |
2025-06-05 | Spectral Graph Neural Networks are Incomplete on Graphs with a Simple Spectrum | Snir Hordan et.al. | 2506.05530 | null |
2025-06-05 | Robustness Evaluation for Video Models with Reinforcement Learning | Ashwin Ramesh Babu et.al. | 2506.05431 | null |
2025-06-05 | Interpretable Few-Shot Image Classification via Prototypical Concept-Guided Mixture of LoRA Experts | Zhong Ji et.al. | 2506.04673 | null |
2025-06-04 | Deep Learning for Absorption-Image Analysis | Jacob Morrey et.al. | 2506.04517 | null |
2025-06-04 | KOALA++: Efficient Kalman-Based Optimization of Neural Networks with Gradient-Covariance Products | Zixuan Xia et.al. | 2506.04432 | null |
2025-06-04 | Benchmarking Time-localized Explanations for Audio Classification Models | Cecilia Bolaños et.al. | 2506.04391 | null |
2025-06-04 | Hierarchical Text Classification Using Contrastive Learning Informed Path Guided Hierarchy | Neeraj Agrawal et.al. | 2506.04381 | null |
2025-06-04 | Recent Advances in Medical Image Classification | Loan Dao et.al. | 2506.04129 | null |
2025-06-04 | Prompt Candidates, then Distill: A Teacher-Student Framework for LLM-driven Data Annotation | Mingxuan Xia et.al. | 2506.03857 | null |
2025-06-04 | RhoDARTS: Differentiable Quantum Architecture Search with Density Matrix Simulations | Swagat Kumar et.al. | 2506.03697 | null |
2025-06-04 | Directional Non-Commutative Monoidal Embeddings for MNIST | Mahesh Godavarti et.al. | 2506.03472 | null |
2025-06-03 | RoNFA: Robust Neural Field-based Approach for Few-Shot Image Classification with Noisy Labels | Nan Xiang et.al. | 2506.03461 | null |
2025-06-02 | Quantifying task-relevant representational similarity using decision variable correlation | Yu et.al. | 2506.02164 | null |
2025-06-02 | Towards Better Generalization and Interpretability in Unsupervised Concept-Based Models | Francesco De Santis et.al. | 2506.02092 | null |
2025-06-02 | OD3: Optimization-free Dataset Distillation for Object Detection | Salwa K. Al Khatib et.al. | 2506.01942 | null |
2025-06-02 | Generalized Gradient Norm Clipping & Non-Euclidean |
Thomas Pethick et.al. | 2506.01913 | null |
2025-06-02 | Beyond Static Responses: Multi-Agent LLM Systems as a New Paradigm for Social Science Research | Jennifer Haase et.al. | 2506.01839 | null |
2025-06-02 | mdok of KInIT: Robustly Fine-tuned LLM for Binary and Multiclass AI-Generated Text Detection | Dominik Macko et.al. | 2506.01702 | null |
2025-06-02 | Data Pruning by Information Maximization | Haoru Tan et.al. | 2506.01701 | null |
2025-06-02 | Domain Lexical Knowledge-based Word Embedding Learning for Text Classification under Small Data | Zixiao Zhu et.al. | 2506.01621 | null |
2025-06-02 | Speed-up of Vision Transformer Models by Attention-aware Token Filtering | Takahiro Naruko et.al. | 2506.01519 | null |
2025-06-02 | A Novel Context-Adaptive Fusion of Shadow and Highlight Regions for Efficient Sonar Image Classification | Kamal Basha S et.al. | 2506.01445 | null |
2025-05-30 | Optimal Weighted Convolution for Classification and Denosing | Simone Cammarasana et.al. | 2505.24558 | null |
2025-05-30 | SASP: Strip-Aware Spatial Perception for Fine-Grained Bird Image Classification | Zheng Wang et.al. | 2505.24380 | null |
2025-05-30 | Spatiotemporal Analysis of Forest Machine Operations Using 3D Video Classification | Maciej Wielgosz et.al. | 2505.24375 | null |
2025-05-30 | GeoVision Labeler: Zero-Shot Geospatial Classification with Vision and Language Models | Gilles Quentin Hacheme et.al. | 2505.24340 | null |
2025-05-30 | Provably Improving Generalization of Few-Shot Models with Synthetic Data | Lan-Cuong Nguyen et.al. | 2505.24190 | null |
2025-05-30 | FeatureSense: Protecting Speaker Attributes in Always-On Audio Sensing System | Bhawana Chhaglani et.al. | 2505.24115 | null |
2025-05-30 | Proxy-FDA: Proxy-based Feature Distribution Alignment for Fine-tuning Vision Foundation Models without Forgetting | Chen Huang et.al. | 2505.24088 | null |
2025-05-29 | BIRD: Behavior Induction via Representation-structure Distillation | Galen Pogoncheff et.al. | 2505.23933 | null |
2025-05-29 | Boosting Domain Incremental Learning: Selecting the Optimal Parameters is All You Need | Qiang Wang et.al. | 2505.23744 | null |
2025-05-29 | Spectrotemporal Modulation: Efficient and Interpretable Feature Representation for Classifying Speech, Music, and Environmental Sounds | Andrew Chang et.al. | 2505.23509 | link |
2025-05-29 | MCFNet: A Multimodal Collaborative Fusion Network for Fine-Grained Semantic Classification | Yang Qiao et.al. | 2505.23365 | null |
2025-05-29 | DSAGL: Dual-Stream Attention-Guided Learning for Weakly Supervised Whole Slide Image Classification | Daoxi Cao et.al. | 2505.23341 | null |
2025-05-29 | Deep Modeling and Optimization of Medical Image Classification | Yihang Wu et.al. | 2505.23040 | link |
2025-05-28 | Leveraging Diffusion Models for Synthetic Data Augmentation in Protein Subcellular Localization Classification | Sylvey Lin et.al. | 2505.22926 | null |
2025-05-28 | Frequency-Adaptive Discrete Cosine-ViT-ResNet Architecture for Sparse-Data Vision | Ziyue Kang et.al. | 2505.22701 | null |
2025-05-28 | S2AFormer: Strip Self-Attention for Efficient Vision Transformer | Guoan Xu et.al. | 2505.22195 | null |
2025-05-28 | Efficient Ensemble for Fine-tuning Language Models on Multiple Datasets | Dongyue Li et.al. | 2505.21930 | null |
2025-05-28 | Test-Time Adaptation of Vision-Language Models for Open-Vocabulary Semantic Segmentation | Mehrdad Noori et.al. | 2505.21844 | null |
2025-05-27 | MedBridge: Bridging Foundation Vision-Language Models to Medical Image Diagnosis | Yitong Li et.al. | 2505.21698 | null |
2025-05-27 | Leveraging large language models and traditional machine learning ensembles for ADHD detection from narrative transcripts | Yuxin Zhu et.al. | 2505.21324 | null |
2025-05-27 | Making Every Event Count: Balancing Data Efficiency and Accuracy in Event Camera Subsampling | Hesam Araghi et.al. | 2505.21187 | null |
2025-05-27 | Information-Theoretic Complementary Prompts for Improved Continual Text Classification | Duzhen Zhang et.al. | 2505.20933 | null |
2025-05-27 | Evidential Deep Active Learning for Semi-Supervised Classification | Shenkai Zhao et.al. | 2505.20691 | null |
2025-05-26 | UORA: Uniform Orthogonal Reinitialization Adaptation in Parameter-Efficient Fine-Tuning of Large Models | Xueyan Zhang et.al. | 2505.20154 | null |
2025-05-26 | Improvement Strategies for Few-Shot Learning in OCT Image Classification of Rare Retinal Diseases | Cheng-Yu Tai et.al. | 2505.20149 | null |
2025-05-26 | Differential Privacy Analysis of Decentralized Gossip Averaging under Varying Threat Models | Antti Koskela et.al. | 2505.19969 | null |
2025-05-26 | Task-Oriented Low-Label Semantic Communication With Self-Supervised Learning | Run Gu et.al. | 2505.19940 | null |
2025-05-26 | Advancements in Medical Image Classification through Fine-Tuning Natural Domain Foundation Models | Mobina Mansoori et.al. | 2505.19779 | link |
2025-05-26 | Mosaic: Data-Free Knowledge Distillation via Mixture-of-Experts for Heterogeneous Distributed Environments | Junming Liu et.al. | 2505.19699 | null |
2025-05-26 | Diagnosing and Mitigating Modality Interference in Multimodal Large Language Models | Rui Cai et.al. | 2505.19616 | null |
2025-05-26 | Applications and Effect Evaluation of Generative Adversarial Networks in Semi-Supervised Learning | Jiyu Hu et.al. | 2505.19522 | null |
2025-05-26 | DiSa: Directional Saliency-Aware Prompt Learning for Generalizable Vision-Language Models | Niloufar Alipour Talemi et.al. | 2505.19373 | null |
2025-05-25 | Remote Sensing Image Classification with Decoupled Knowledge Distillation | Yaping He et.al. | 2505.19111 | null |
2025-05-24 | MoMBS: Mixed-order minibatch sampling enhances model training from diverse-quality images | Han Li et.al. | 2505.18741 | null |
2025-05-23 | SemSegBench & DetecBench: Benchmarking Reliability and Generalization Beyond Classification | Shashank Agnihotri et.al. | 2505.18015 | null |
2025-05-23 | KITINet: Kinetics Theory Inspired Network Architectures with PDE Simulation Approaches | Mingquan Feng et.al. | 2505.17919 | null |
2025-05-23 | Ownership Verification of DNN Models Using White-Box Adversarial Attacks with Specified Probability Manipulation | Teruki Sano et.al. | 2505.17579 | null |
2025-05-23 | Scaling Up Biomedical Vision-Language Models: Fine-Tuning, Instruction Tuning, and Multi-Modal Learning | Cheng Peng et.al. | 2505.17436 | null |
2025-05-23 | EVM-Fusion: An Explainable Vision Mamba Architecture with Neural Algorithmic Fusion | Zichuan Yang et.al. | 2505.17367 | null |
2025-05-22 | Extending Dataset Pruning to Object Detection: A Variance-based Approach | Ryota Yagi et.al. | 2505.17245 | null |
2025-05-23 | TULiP: Test-time Uncertainty Estimation via Linearization and Weight Perturbation | Yuhui Zhang et.al. | 2505.16923 | null |
2025-05-22 | Incremental Sequence Classification with Temporal Consistency | Lucas Maystre et.al. | 2505.16548 | null |
2025-05-22 | Fusion of Foundation and Vision Transformer Model Features for Dermatoscopic Image Classification | Amirreza Mahbod et.al. | 2505.16338 | null |
2025-05-22 | Accelerating Targeted Hard-Label Adversarial Attacks in Low-Query Black-Box Settings | Arjhun Swaminathan et.al. | 2505.16313 | link |
2025-05-22 | Swin Transformer for Robust CGI Images Detection: Intra- and Inter-Dataset Analysis across Multiple Color Spaces | Preeti Mehta et.al. | 2505.16253 | null |
2025-05-22 | When VLMs Meet Image Classification: Test Sets Renovation via Missing Label Identification | Zirui Pang et.al. | 2505.16149 | null |
2025-05-21 | Small Language Models in the Real World: Insights from Industrial Text Classification | Lujun Li et.al. | 2505.16078 | null |
2025-05-21 | GradPCA: Leveraging NTK Alignment for Reliable Out-of-Distribution Detection | Mariia Seleznova et.al. | 2505.16017 | null |
2025-05-21 | Domain Adaptive Skin Lesion Classification via Conformal Ensemble of Vision Transformers | Mehran Zoravar et.al. | 2505.15997 | null |
2025-05-21 | Large Language Models as Computable Approximations to Solomonoff Induction | Jun Wan et.al. | 2505.15784 | null |
2025-05-21 | FragFake: A Dataset for Fine-Grained Detection of Edited Images with Vision Language Models | Zhen Sun et.al. | 2505.15644 | null |
2025-05-21 | SNAP: A Benchmark for Testing the Effects of Capture Conditions on Fundamental Vision Tasks | Iuliia Kotseruba et.al. | 2505.15628 | link |
2025-05-21 | Aligning Explanations with Human Communication | Jacopo Teneggi et.al. | 2505.15626 | null |
2025-05-21 | Beyond Linearity: Squeeze-and-Recalibrate Blocks for Few-Shot Whole Slide Image Classification | Conghao Xiong et.al. | 2505.15504 | null |
2025-05-21 | Adaptive Temperature Scaling with Conformal Prediction | Nikita Kotelevskii et.al. | 2505.15437 | null |
2025-05-21 | Parameter-Efficient Fine-Tuning of Multispectral Foundation Models for Hyperspectral Image Classification | Bernardin Ligan et.al. | 2505.15334 | null |
2025-05-21 | Multicrossmodal Automated Agent for Integrating Diverse Materials Science Data | Adib Bazgir et.al. | 2505.15132 | null |
2025-05-20 | Reliable Decision Support with LLMs: A Framework for Evaluating Consistency in Binary Text Classification Applications | Fadel M. Megahed et.al. | 2505.14918 | null |
2025-05-20 | Solving MNIST with a globally trained Mixture of Quantum Experts | Paolo Alessandro Xavier Tognini et.al. | 2505.14789 | null |
2025-05-20 | Guarded Query Routing for Large Language Models | Richard Šléher et.al. | 2505.14524 | null |
2025-05-20 | PRL: Prompts from Reinforcement Learning | Paweł Batorski et.al. | 2505.14412 | null |
2025-05-20 | Domain Adaptation for Multi-label Image Classification: a Discriminator-free Approach | Inder Pal Singh et.al. | 2505.14333 | link |
2025-05-20 | HausaNLP: Current Status, Challenges and Future Directions for Hausa Natural Language Processing | Shamsuddeen Hassan Muhammad et.al. | 2505.14311 | null |
2025-05-20 | Intra-class Patch Swap for Self-Distillation | Hongjun Choi et.al. | 2505.14124 | link |
2025-05-20 | Scaling Vision Mamba Across Resolutions via Fractal Traversal | Bo Li et.al. | 2505.14062 | null |
2025-05-20 | Learning Concept-Driven Logical Rules for Interpretable and Generalizable Medical Image Classification | Yibo Gao et.al. | 2505.14049 | null |
2025-05-20 | A Challenge to Build Neuro-Symbolic Video Agents | Sahil Shah et.al. | 2505.13851 | null |
2025-05-19 | Synthetic-Powered Predictive Inference | Meshi Bashari et.al. | 2505.13432 | null |
2025-05-20 | Unlabeled Data or Pre-trained Model: Rethinking Semi-Supervised Learning and Pretrain-Finetuning | Song-Lin Li et.al. | 2505.13317 | null |
2025-05-19 | A Physics-Inspired Optimizer: Velocity Regularized Adam | Pranav Vaidhyanathan et.al. | 2505.13196 | null |
2025-05-19 | Emergence of Fixational and Saccadic Movements in a Multi-Level Recurrent Attention Model for Vision | Pengcheng Pan et.al. | 2505.13191 | null |
2025-05-19 | Learning to Adapt to Position Bias in Vision Transformer Classifiers | Robert-Jan Bruintjes et.al. | 2505.13137 | link |
2025-05-19 | When majority rules, minority loses: bias amplification of gradient descent | François Bachoc et.al. | 2505.13122 | null |
2025-05-19 | Expert-Like Reparameterization of Heterogeneous Pyramid Receptive Fields in Efficient CNNs for Fair Medical Image Classification | Xiao Wu et.al. | 2505.13039 | null |
2025-05-19 | EPIC: Explanation of Pretrained Image Classification Networks via Prototype | Piotr Borycki et.al. | 2505.12897 | link |
2025-05-19 | Enhancing Transformers Through Conditioned Embedded Tokens | Hemanth Saratchandran et.al. | 2505.12789 | null |
2025-05-19 | An approach based on class activation maps for investigating the effects of data augmentation on neural networks for image classification | Lucas M. Dorneles et.al. | 2505.12581 | null |
2025-05-16 | Energy efficiency analysis of Spiking Neural Networks for space applications | Paolo Lunghi et.al. | 2505.11418 | null |
2025-05-16 | Harnessing Photon Indistinguishability in Quantum Extreme Learning Machines | Malo Joly et.al. | 2505.11238 | null |
2025-05-16 | CheX-DS: Improving Chest X-ray Image Classification with Ensemble Learning Based on DenseNet and Swin Transformer | Xinran Li et.al. | 2505.11168 | null |
2025-05-16 | Privacy-Aware Lifelong Learning | Ozan Özdenizci et.al. | 2505.10941 | null |
2025-05-16 | MCU: Improving Machine Unlearning through Mode Connectivity | Yingdan Shi et.al. | 2505.10859 | null |
2025-05-15 | CLIP Embeddings for AI-Generated Image Detection: A Few-Shot Study with Lightweight Classifier | Ziyang Ou et.al. | 2505.10664 | null |
2025-05-15 | Research of the Variational Shadow Quantum Circuit Based on the Whale Optimization Algorithm in Image Classification | Shuang Wu et.al. | 2505.09994 | null |
2025-05-14 | Quantum-Enhanced Parameter-Efficient Learning for Typhoon Trajectory Forecasting | Chen-Yu Liu et.al. | 2505.09395 | null |
2025-05-14 | Marigold: Affordable Adaptation of Diffusion-Based Image Generators for Image Analysis | Bingxin Ke et.al. | 2505.09358 | link |
2025-05-17 | PrePrompt: Predictive prompting for class incremental learning | Libo Huang et.al. | 2505.08586 | link |
2025-05-13 | Convolutional Spiking Neural Network for Image Classification | Mikhail Kiselev et.al. | 2505.08514 | null |
2025-05-13 | CNN and ViT Efficiency Study on Tiny ImageNet and DermaMNIST Datasets | Aidar Amangeldi et.al. | 2505.08259 | null |
2025-05-13 | Empowering Vision Transformers with Multi-Scale Causal Intervention for Long-Tailed Image Classification | Xiaoshuo Yan et.al. | 2505.08173 | null |
2025-05-13 | MoKD: Multi-Task Optimization for Knowledge Distillation | Zeeshan Hayder et.al. | 2505.08170 | null |
2025-05-12 | Hierarchical Sparse Attention Framework for Computationally Efficient Classification of Biological Cells | Elad Yoshai et.al. | 2505.07661 | null |
2025-05-12 | Synthetic Similarity Search in Automotive Production | Christoph Huber et.al. | 2505.07256 | null |
2025-05-12 | Discovering Fine-Grained Visual-Concept Relations by Disentangled Optimal Transport Concept Bottleneck Models | Yan Xie et.al. | 2505.07209 | null |
2025-05-12 | KDH-MLTC: Knowledge Distillation for Healthcare Multi-Label Text Classification | Hajar Sakai et.al. | 2505.07162 | null |
2025-05-11 | A Vision-Language Foundation Model for Leaf Disease Identification | Khang Nguyen Quoc et.al. | 2505.07019 | null |
2025-05-11 | Image Classification Using a Diffusion Model as a Pre-Training Model | Kosuke Ukita et.al. | 2505.06890 | null |
2025-05-11 | NeuRN: Neuro-inspired Domain Generalization for Image Classification | Hamd Jalil et.al. | 2505.06881 | null |
2025-05-11 | Active Learning for Multi-class Image Classification | Thien Nhan Vo et.al. | 2505.06825 | null |
2025-05-10 | FNBench: Benchmarking Robust Federated Learning against Noisy Labels | Xuefeng Jiang et.al. | 2505.06684 | link |
2025-05-10 | The Efficiency of Pre-training with Objective Masking in Pseudo Labeling for Semi-Supervised Text Classification | Arezoo Hatefi et.al. | 2505.06624 | null |
2025-05-09 | Adapting a Segmentation Foundation Model for Medical Image Classification | Pengfei Gu et.al. | 2505.06217 | null |
2025-05-09 | Towards Robust Few-Shot Text Classification Using Transformer Architectures and Dual Loss Strategies | Xu Han et.al. | 2505.06145 | null |
2025-05-09 | Short-circuiting Shortcuts: Mechanistic Investigation of Shortcuts in Text Classification | Leon Eshuijs et.al. | 2505.06032 | link |
2025-05-09 | Efficient Quantum Convolutional Neural Networks for Image Classification: Overcoming Hardware Constraints | Peter Röseler et.al. | 2505.05957 | null |
2025-05-09 | Achieving 3D Attention via Triplet Squeeze and Excitation Block | Maan Alhazmi et.al. | 2505.05943 | null |
2025-05-09 | Improving Generalizability of Kolmogorov-Arnold Networks via Error-Correcting Output Codes | Youngjoon Lee et.al. | 2505.05798 | null |
2025-05-09 | Variational Bayesian Logistic Tensor Regression with Application to Image Recognition | Yunzhi Jin et.al. | 2505.05730 | null |
2025-05-08 | V-EfficientNets: Vector-Valued Efficiently Scaled Convolutional Neural Network Models | Guilherme Vieira Neto et.al. | 2505.05659 | link |
2025-05-08 | KG-HTC: Integrating Knowledge Graphs into LLMs for Effective Zero-shot Hierarchical Text Classification | Qianbo Zang et.al. | 2505.05583 | link |
2025-05-08 | Hide & Seek: Transformer Symmetries Obscure Sharpness & Riemannian Geometry Finds It | Marvin F. da Silva et.al. | 2505.05409 | null |
2025-05-08 | Quantum Surrogate-Driven Image Classifier: A Gradient-Free Approach to Avoid Barren Plateaus | Yichen Xie et.al. | 2505.05249 | null |
2025-05-08 | Biomed-DPT: Dual Modality Prompt Tuning for Biomedical Vision-Language Models | Wei Peng et.al. | 2505.05189 | null |
2025-05-08 | CacheFL: Efficient Federated Cache Model Fine-Tuning for Vision-Language Models | Mengjun Yi et.al. | 2505.05130 | null |
2025-05-08 | Direct Image Classification from Fourier Ptychographic Microscopy Measurements without Reconstruction | Navya Sonal Agarwal et.al. | 2505.05054 | null |
2025-05-07 | ORXE: Orchestrating Experts for Dynamically Configurable Efficiency | Qingyuan Wang et.al. | 2505.04850 | null |
2025-05-07 | Label-efficient Single Photon Images Classification via Active Learning | Zili Zhang et.al. | 2505.04376 | null |
2025-05-07 | FRAIN to Train: A Fast-and-Reliable Solution for Decentralized Federated Learning | Sanghyeon Park et.al. | 2505.04223 | null |
2025-05-06 | Read My Ears! Horse Ear Movement Detection for Equine Affective State Assessment | João Alves et.al. | 2505.03554 | null |
2025-05-06 | Noisy HQNNs: A Comprehensive Analysis of Noise Robustness in Hybrid Quantum Neural Networks | Tasnim Ahmed et.al. | 2505.03378 | null |
2025-05-06 | A Vision-Language Model for Focal Liver Lesion Classification | Song Jian et.al. | 2505.03350 | null |
2025-05-06 | Comparative Analysis of Lightweight Deep Learning Models for Memory-Constrained Devices | Tasnim Shahriar et.al. | 2505.03303 | null |
2025-05-06 | Survey of Abstract Meaning Representation: Then, Now, Future | Behrooz Mansouri et.al. | 2505.03229 | null |
2025-05-06 | seq-JEPA: Autoregressive Predictive Learning of Invariant-Equivariant World Models | Hafez Ghaemi et.al. | 2505.03176 | null |
2025-05-06 | Enhancing Glass Defect Detection with Diffusion Models: Addressing Imbalanced Datasets in Manufacturing Quality Control | Sajjad Rezvani Boroujeni et.al. | 2505.03134 | null |
2025-05-05 | Bayesian Robust Aggregation for Federated Learning | Aleksandr Karakulev et.al. | 2505.02490 | null |
2025-05-06 | Adversarial Cooperative Rationalization: The Risk of Spurious Correlations in Even Clean Datasets | Wei Liu et.al. | 2505.02118 | null |
2025-05-03 | Backdoor Attacks Against Patch-based Mixture of Experts | Cedric Chan et.al. | 2505.01811 | null |
2025-05-03 | Low-Complexity Acoustic Scene Classification with Device Information in the DCASE 2025 Challenge | Florian Schmid et.al. | 2505.01747 | null |
2025-05-03 | CLOG-CD: Curriculum Learning based on Oscillating Granularity of Class Decomposed Medical Image Classification | Asmaa Abbas et.al. | 2505.01741 | null |
2025-05-02 | TActiLE: Tiny Active LEarning for wearable devices | Massimo Pavan et.al. | 2505.01160 | null |
2025-04-30 | Towards Improved Cervical Cancer Screening: Vision Transformer-Based Classification and Interpretability | Khoa Tuan Nguyen et.al. | 2504.21340 | null |
2025-04-28 | AGATE: Stealthy Black-box Watermarking for Multimodal Model Copyright Protection | Jianbo Gao et.al. | 2504.21044 | null |
2025-04-29 | Photonic Quantum Convolutional Neural Networks with Adaptive State Injection | Léo Monbroussou et.al. | 2504.20989 | null |
2025-04-30 | DS_FusionNet: Dynamic Dual-Stream Fusion with Bidirectional Knowledge Distillation for Plant Disease Recognition | Yanghui Song et.al. | 2504.20948 | link |
2025-04-29 | MambaMoE: Mixture-of-Spectral-Spatial-Experts State Space Model for Hyperspectral Image Classification | Yichu Xu et.al. | 2504.20509 | null |
2025-04-28 | DeepAndes: A Self-Supervised Vision Foundation Model for Multi-Spectral Remote Sensing Imagery of the Andes | Junlin Guo et.al. | 2504.20303 | null |
2025-04-28 | GenCLS++: Pushing the Boundaries of Generative Classification in LLMs Through Comprehensive SFT and RL Studies Across Diverse Datasets | Mingqian He et.al. | 2504.19898 | null |
2025-04-28 | Reinforcement Learning-Based Heterogeneous Multi-Task Optimization in Semantic Broadcast Communications | Zhilin Lu et.al. | 2504.19806 | null |
2025-04-28 | Explaining Vision GNNs: A Semantic and Visual Analysis of Graph-based Image Classification | Nikolaos Chaidos et.al. | 2504.19682 | null |
2025-04-28 | Hardware/Software Co-Design of RISC-V Extensions for Accelerating Sparse DNNs on FPGAs | Muhammad Sabih et.al. | 2504.19659 | null |
2025-04-28 | Neural network task specialization via domain constraining | Roman Malashin et.al. | 2504.19592 | null |
2025-04-28 | GMAR: Gradient-Driven Multi-Head Attention Rollout for Vision Transformer Interpretability | Sehyeong Jo et.al. | 2504.19414 | null |
2025-04-27 | Dual-Branch Residual Network for Cross-Domain Few-Shot Hyperspectral Image Classification with Refined Prototype | Anyong Qin et.al. | 2504.19074 | null |
2025-04-26 | Advancing Scientific Text Classification: Fine-Tuned Models with Dataset Expansion and Hard-Voting | Zhyar Rzgar K Rostam et.al. | 2504.19021 | null |
2025-04-26 | A Simple Ensemble Strategy for LLM Inference: Towards More Stable Text Classification | Junichiro Niimi et.al. | 2504.18884 | link |
2025-04-26 | IoT Botnet Detection: Application of Vision Transformer to Classification of Network Flow Traffic | Hassan Wasswa et.al. | 2504.18781 | null |
2025-04-25 | Examining the Impact of Optical Aberrations to Image Classification and Object Detection Models | Patrick Müller et.al. | 2504.18510 | null |
2025-04-25 | Pseudo-Asynchronous Local SGD: Robust and Efficient Data-Parallel Training | Hiroki Naganuma et.al. | 2504.18454 | null |
2025-04-25 | Passive All-Optical Nonlinear Neuron Activation via PPLN Nanophotonic Waveguides | Wujie Fu et.al. | 2504.18145 | null |
2025-04-25 | DMS-Net:Dual-Modal Multi-Scale Siamese Network for Binocular Fundus Image Classification | Guohao Huo et.al. | 2504.18046 | null |
2025-04-24 | Disaggregated Deep Learning via In-Physics Computing at Radio Frequency | Zhihui Gao et.al. | 2504.17752 | null |
2025-04-24 | Aerial Image Classification in Scarce and Unconstrained Environments via Conformal Prediction | Farhad Pourkamali-Anaraki et.al. | 2504.17655 | null |
2025-04-24 | Enhanced Sample Selection with Confidence Tracking: Identifying Correctly Labeled yet Hard-to-Learn Samples in Noisy Data | Weiran Pan et.al. | 2504.17474 | null |
2025-04-24 | Dual-Individual Genetic Algorithm: A Dual-Individual Approach for Efficient Training of Multi-Layer Neural Networks | Tran Thuy Nga Truong et.al. | 2504.17346 | null |
2025-04-24 | Evaluating and Mitigating Bias in AI-Based Medical Text Generation | Xiuying Chen et.al. | 2504.17279 | null |
2025-04-24 | Group Downsampling with Equivariant Anti-aliasing | Md Ashiqur Rahman et.al. | 2504.17258 | link |
2025-04-24 | Multi-Modal Traffic Analysis: Integrating Time-Series Forecasting, Accident Prediction, and Image Classification | Nivedita M et.al. | 2504.17232 | null |
2025-04-23 | A Diff-Attention Aware State Space Fusion Model for Remote Sensing Classification | Wenping Ma et.al. | 2504.16665 | null |
2025-04-23 | Streetscape Analysis with Generative AI (SAGAI): Vision-Language Assessment and Mapping of Urban Scenes | Joan Perez et.al. | 2504.16538 | null |
2025-04-24 | An Effective Gram Matrix Characterizes Generalization in Deep Networks | Rubing Yang et.al. | 2504.16450 | null |
2025-04-23 | FrogDogNet: Fourier frequency Retained visual prompt Output Guidance for Domain Generalization of CLIP in Remote Sensing | Hariseetharam Gunduboina et.al. | 2504.16433 | null |
2025-04-22 | CLIP-IT: CLIP-based Pairing for Histology Images Classification | Banafsheh Karimian et.al. | 2504.16181 | null |
2025-04-22 | Automated Bug Report Prioritization in Large Open-Source Projects | Riley Pierson et.al. | 2504.15912 | null |
2025-04-22 | Generative AI for Research Data Processing: Lessons Learnt From Three Use Cases | Modhurita Mitra et.al. | 2504.15829 | null |
2025-04-22 | DualOptim: Enhancing Efficacy and Stability in Machine Unlearning with Dual Optimizers | Xuyang Zhong et.al. | 2504.15827 | null |
2025-04-22 | HS-Mamba: Full-Field Interaction Multi-Groups Mamba for Hyperspectral Image Classification | Hongxing Peng et.al. | 2504.15612 | null |
2025-04-22 | LLM-based Semantic Augmentation for Harmful Content Detection | Elyas Meguellati et.al. | 2504.15548 | null |
2025-04-21 | Feeding LLM Annotations to BERT Classifiers at Your Own Risk | Yucheng Lu et.al. | 2504.15432 | null |
2025-04-21 | Dynamic 3D KAN Convolution with Adaptive Grid Optimization for Hyperspectral Image Classification | Guandong Li et.al. | 2504.15155 | null |
2025-04-21 | Application of Sensitivity Analysis Methods for Studying Neural Network Models | Jiaxuan Miao et.al. | 2504.15100 | null |
2025-04-21 | Trainable Quantum Neural Network for Multiclass Image Classification with the Power of Pre-trained Tree Tensor Networks | Keisuke Murota et.al. | 2504.14995 | null |
2025-04-21 | ECViT: Efficient Convolutional Vision Transformer with Local-Attention and Multi-scale Stages | Zhoujie Qian et.al. | 2504.14825 | null |
2025-04-21 | What Lurks Within? Concept Auditing for Shared Diffusion Models at Scale | Xiaoyong Yuan et.al. | 2504.14815 | null |
2025-04-21 | A Basic Evaluation of Neural Networks Trained with the Error Diffusion Learning Algorithm | Kazuhisa Fujita et.al. | 2504.14814 | null |
2025-04-19 | Learning from Stochastic Teacher Representations Using Student-Guided Knowledge Distillation | Muhammad Haseeb Aslam et.al. | 2504.14307 | null |
2025-04-19 | Exploring Modality Guidance to Enhance VFM-based Feature Fusion for UDA in 3D Semantic Segmentation | Johannes Spoecklberger et.al. | 2504.14231 | null |
2025-04-19 | Enhancing Multimodal In-Context Learning for Image Classification through Coreset Optimization | Huiyi Chen et.al. | 2504.14200 | null |
2025-04-19 | ThyroidEffi 1.0: A Cost-Effective System for High-Performance Multi-Class Thyroid Carcinoma Classification | Hai Pham-Ngoc et.al. | 2504.14139 | null |
2025-04-18 | Feature Alignment and Representation Transfer in Knowledge Distillation for Large Language Models | Junjie Yang et.al. | 2504.13825 | null |
2025-04-18 | CheXWorld: Exploring Image World Modeling for Radiograph Representation Learning | Yang Yue et.al. | 2504.13820 | link |
2025-04-18 | Towards Accurate and Interpretable Neuroblastoma Diagnosis via Contrastive Multi-scale Pathological Image Analysis | Zhu Zhu et.al. | 2504.13754 | null |
2025-04-18 | Human-aligned Deep Learning: Explainability, Causality, and Biological Inspiration | Gianluca Carloni et.al. | 2504.13717 | null |
2025-04-18 | Word Embedding Techniques for Classification of Star Ratings | Hesham Abdelmotaleb et.al. | 2504.13653 | null |
2025-04-18 | Cross-Hierarchical Bidirectional Consistency Learning for Fine-Grained Visual Classification | Pengxiang Gao et.al. | 2504.13608 | null |
2025-04-18 | MAAM: A Lightweight Multi-Agent Aggregation Module for Efficient Image Classification Based on the MindSpore Framework | Zhenkai Qin et.al. | 2504.13574 | null |
2025-04-18 | Bayesian continual learning and forgetting in neural networks | Djohan Bonnet et.al. | 2504.13569 | null |
2025-04-17 | Dynamic Memory-enhanced Transformer for Hyperspectral Image Classification | Muhammad Ahmad et.al. | 2504.13242 | null |
2025-04-17 | Perception Encoder: The best visual embeddings are not at the output of the network | Daniel Bolya et.al. | 2504.13181 | null |
2025-04-17 | Expert Kernel Generation Network Driven by Contextual Mapping for Hyperspectral Image Classification | Guandong Li et.al. | 2504.13045 | null |
2025-04-17 | Quantum Computing Supported Adversarial Attack-Resilient Autonomous Vehicle Perception Module for Traffic Sign Classification | Reek Majumder et.al. | 2504.12644 | null |
2025-04-16 | GLUSE: Enhanced Channel-Wise Adaptive Gated Linear Units SE for Onboard Satellite Earth Observation Image Classification | Thanh-Dung Le et.al. | 2504.12484 | null |
2025-04-16 | FLIP Reasoning Challenge | Andreas Plesner et.al. | 2504.12256 | null |
2025-04-16 | Weakly Semi-supervised Whole Slide Image Classification by Two-level Cross Consistency Supervision | Linhao Qu et.al. | 2504.12132 | null |
2025-04-16 | Exploring Video-Based Driver Activity Recognition under Noisy Labels | Linjuan Fan et.al. | 2504.11966 | link |
2025-04-17 | Selective Attention Federated Learning: Improving Privacy and Efficiency for Clinical Text Classification | Yue Li et.al. | 2504.11793 | null |
2025-04-15 | The Pontryagin Maximum Principle for Training Convolutional Neural Networks | Sebastian Hofmann et.al. | 2504.11647 | null |
2025-04-15 | Deep Learning Approaches for Medical Imaging Under Varying Degrees of Label Availability: A Comprehensive Survey | Siteng Ma et.al. | 2504.11588 | null |
2025-04-15 | Diversity-Driven Learning: Tackling Spurious Correlations and Data Heterogeneity in Federated Models | Gergely D. Németh et.al. | 2504.11216 | null |
2025-04-15 | Embedding Radiomics into Vision Transformers for Multimodal Medical Image Classification | Zhenyu Yang et.al. | 2504.10916 | null |
2025-04-15 | Progressive Rock Music Classification | Arpan Nagar et.al. | 2504.10821 | null |
2025-04-15 | 3D Wavelet Convolutions with Extended Receptive Fields for Hyperspectral Image Classification | Guandong Li et.al. | 2504.10795 | null |
2025-04-14 | Quantum Image Classification: Experiments on Utility-Scale Quantum Computers | Hrant Gharibyan et.al. | 2504.10595 | null |
2025-04-14 | LEMUR Neural Network Dataset: Towards Seamless AutoML | Arash Torabi Goodarzi et.al. | 2504.10552 | null |
2025-04-13 | An Efficient Quantum Classifier Based on Hamiltonian Representations | Federico Tiblias et.al. | 2504.10542 | null |
2025-04-14 | Correlative and Discriminative Label Grouping for Multi-Label Visual Prompt Tuning | LeiLei Ma et.al. | 2504.09990 | null |
2025-04-14 | GFT: Gradient Focal Transformer | Boris Kriuk et.al. | 2504.09852 | null |
2025-04-13 | PCM-SAR: Physics-Driven Contrastive Mutual Learning for SAR Classification | Pengfei Wang et.al. | 2504.09502 | null |
2025-04-13 | InfoBound: A Provable Information-Bounds Inspired Framework for Both OoD Generalization and OoD Detection | Lin Zhu et.al. | 2504.09448 | null |
2025-04-13 | Sparse Deformable Mamba for Hyperspectral Image Classification | Lincoln Linlin Xu et.al. | 2504.09446 | null |
2025-04-12 | Cycle Training with Semi-Supervised Domain Adaptation: Bridging Accuracy and Efficiency for Real-Time Mobile Scene Detection | Huu-Phong Phan-Nguyen et.al. | 2504.09297 | null |
2025-04-12 | Sparse Hybrid Linear-Morphological Networks | Konstantinos Fotopoulos et.al. | 2504.09289 | null |
2025-04-12 | Mixture of Group Experts for Learning Invariant Representations | Lei Kang et.al. | 2504.09265 | null |
2025-04-12 | Langformers: Unified NLP Pipelines for Language Models | Rabindra Lamsal et.al. | 2504.09170 | null |
2025-04-12 | Evolved Hierarchical Masking for Self-Supervised Learning | Zhanzhou Feng et.al. | 2504.09155 | null |
2025-04-11 | Hypergraph Vision Transformers: Images are More than Nodes, More than Edges | Joshua Fixelle et.al. | 2504.08710 | null |
2025-04-11 | Integrated ensemble of BERT- and features-based models for authorship attribution in Japanese literary works | Taisei Kanda et.al. | 2504.08527 | null |
2025-04-11 | An Early Experience with Confidential Computing Architecture for On-Device Model Protection | Sina Abdollahi et.al. | 2504.08508 | null |
2025-04-11 | The inherent convolution property of quantum neural networks | Guangkai Qu et.al. | 2504.08487 | null |
2025-04-11 | A Hybrid Fully Convolutional CNN-Transformer Model for Inherently Interpretable Medical Image Classification | Kerol Djoumessi et.al. | 2504.08481 | null |
2025-04-11 | FocalLens: Instruction Tuning Enables Zero-Shot Conditional Image Representations | Cheng-Yu Hsieh et.al. | 2504.08368 | null |
2025-04-11 | Comparative Analysis of Different Methods for Classifying Polychromatic Sketches | Fahd Baba et.al. | 2504.08186 | null |
2025-04-11 | Pychop: Emulating Low-Precision Arithmetic in Numerical Methods and Neural Networks | Erin Carson et.al. | 2504.07835 | null |
2025-04-10 | Traversal Learning Coordination For Lossless And Efficient Distributed Learning | Erdenebileg Batbaatar et.al. | 2504.07471 | null |
2025-04-09 | Identifying regions of interest in whole slide images of renal cell carcinoma | Mohammed Lamine Benomar et.al. | 2504.07313 | null |
2025-04-09 | A new training approach for text classification in Mental Health: LatentGLoss | Korhan Sevinç et.al. | 2504.07245 | null |
2025-04-09 | Deep Learning for Cardiovascular Risk Assessment: Proxy Features from Carotid Sonography as Predictors of Arterial Damage | Christoph Balada et.al. | 2504.06680 | null |
2025-04-08 | Memory-Modular Classification: Learning to Generalize with Memory Replacement | Dahyun Kang et.al. | 2504.06021 | null |
2025-04-08 | Federated Unlearning Made Practical: Seamless Integration via Negated Pseudo-Gradients | Alessio Mora et.al. | 2504.05822 | null |
2025-04-08 | DefMamba: Deformable Visual State Space Model | Leiye Liu et.al. | 2504.05794 | null |
2025-04-08 | Layer-Aware Embedding Fusion for LLMs in Text Classifications | Jiho Gwak et.al. | 2504.05764 | null |
2025-04-07 | REEF: Relevance-Aware and Efficient LLM Adapter for Video Understanding | Sakib Reza et.al. | 2504.05491 | null |
2025-04-07 | Secure Diagnostics: Adversarial Robustness Meets Clinical Interpretability | Mohammad Hossein Najafi et.al. | 2504.05483 | null |
2025-04-07 | Explaining Low Perception Model Competency with High-Competency Counterfactuals | Sara Pohland et.al. | 2504.05254 | null |
2025-04-07 | Federated Learning for Medical Image Classification: A Comprehensive Benchmark | Zhekai Zhou et.al. | 2504.05238 | null |
2025-04-07 | Batch Aggregation: An Approach to Enhance Text Classification with Correlated Augmented Data | Charco Hui et.al. | 2504.05020 | null |
2025-04-07 | RS-RAG: Bridging Remote Sensing Imagery and Comprehensive Knowledge with a Multi-Modal Dataset and Retrieval-Augmented Generation Model | Congcong Wen et.al. | 2504.04988 | null |
2025-04-06 | Your Image Generator Is Your New Private Dataset | Nicolo Resmini et.al. | 2504.04582 | null |
2025-04-06 | Attributed Synthetic Data Generation for Zero-shot Domain-specific Image Classification | Shijian Wang et.al. | 2504.04510 | null |
2025-04-06 | Spatial-Geometry Enhanced 3D Dynamic Snake Convolutional Neural Network for Hyperspectral Image Classification | Guandong Li et.al. | 2504.04463 | null |
2025-04-05 | A Comparative Study of Explainable AI Methods: Model-Agnostic vs. Model-Specific Approaches | Keerthi Devireddy et.al. | 2504.04276 | null |
2025-04-05 | GlotEval: A Test Suite for Massively Multilingual Evaluation of Large Language Models | Hengyu Luo et.al. | 2504.04155 | null |
2025-04-05 | Scaling Federated Learning Solutions with Kubernetes for Synthesizing Histopathology Images | Andrei-Alexandru Preda et.al. | 2504.04130 | null |
2025-04-04 | Adaptive Classification of Interval-Valued Time Series | Wan Tian et.al. | 2504.03318 | null |
2025-04-04 | Beyond the Next Token: Towards Prompt-Robust Zero-Shot Classification via Efficient Multi-Token Prediction | Junlang Qian et.al. | 2504.03159 | null |
2025-04-03 | HQViT: Hybrid Quantum Vision Transformer for Image Classification | Hui Zhang et.al. | 2504.02730 | null |
2025-04-03 | LLM-Guided Evolution: An Autonomous Model Optimization for Object Detection | YiMing Yu et.al. | 2504.02280 | null |
2025-04-02 | Neural Style Transfer for Synthesising a Dataset of Ancient Egyptian Hieroglyphs | Lewis Matheson Creed et.al. | 2504.02163 | null |
2025-04-02 | A thorough benchmark of automatic text classification: From traditional approaches to large language models | Washington Cunha et.al. | 2504.01930 | link |
2025-04-02 | A Randomized Zeroth-Order Hierarchical Framework for Heterogeneous Federated Learning | Yuyang Qiu et.al. | 2504.01839 | null |
2025-04-02 | A Novel Approach To Implementing Knowledge Distillation In Tsetlin Machines | Calvin Kinateder et.al. | 2504.01798 | null |
2025-04-02 | Token Pruning in Audio Transformers: Optimizing Performance and Decoding Patch Importance | Taehan Lee et.al. | 2504.01690 | link |
2025-04-02 | All Patches Matter, More Patches Better: Enhance AI-Generated Image Detection via Panoptic Patch Learning | Zheng Yang et.al. | 2504.01396 | null |
2025-04-01 | TenAd: A Tensor-based Low-rank Black Box Adversarial Attack for Video Classification | Kimia haghjooei et.al. | 2504.01228 | null |
2025-04-01 | PolygoNet: Leveraging Simplified Polygonal Representation for Effective Image Classification | Salim Khazem et.al. | 2504.01214 | link |
2025-04-01 | Enabling Efficient Processing of Spiking Neural Networks with On-Chip Learning on Commodity Neuromorphic Processors for Edge AI Systems | Rachmad Vidya Wicaksana Putra et.al. | 2504.00957 | null |
2025-04-01 | Impact of Data Duplication on Deep Neural Network-Based Image Classifiers: Robust vs. Standard Models | Alireza Aghabagherloo et.al. | 2504.00638 | null |
2025-04-01 | Geometric Median Matching for Robust k-Subset Selection from Noisy Data | Anish Acharya et.al. | 2504.00564 | null |
2025-03-31 | NoProp: Training Neural Networks without Back-propagation or Forward-propagation | Qinyu Li et.al. | 2503.24322 | null |
2025-03-31 | CIBR: Cross-modal Information Bottleneck Regularization for Robust CLIP Generalization | Yingrui Ji et.al. | 2503.24182 | null |
2025-03-31 | PixelCAM: Pixel Class Activation Mapping for Histology Image Classification and ROI Localization | Alexis Guichemerre et.al. | 2503.24135 | link |
2025-03-31 | Crossmodal Knowledge Distillation with WordNet-Relaxed Text Embeddings for Robust Image Classification | Chenqi Guo et.al. | 2503.24017 | null |
2025-03-31 | FlexiMo: A Flexible Remote Sensing Foundation Model | Xuyang Li et.al. | 2503.23844 | null |
2025-03-31 | Expanding-and-Shrinking Binary Neural Networks | Xulong Shi et.al. | 2503.23709 | link |
2025-03-31 | WHERE and WHICH: Iterative Debate for Biomedical Synthetic Data Augmentation | Zhengyi Zhao et.al. | 2503.23673 | null |
2025-03-30 | Efficient Dynamic Attention 3D Convolution for Hyperspectral Image Classification | Guandong Li et.al. | 2503.23472 | null |
2025-03-30 | KernelDNA: Dynamic Kernel Sharing via Decoupled Naive Adapters | Haiduo Huang et.al. | 2503.23379 | link |
2025-03-29 | Optimizing Distributed Training Approaches for Scaling Neural Networks | Vishnu Vardhan Baligodugula et.al. | 2503.23186 | null |
2025-03-28 | Data-Free Universal Attack by Exploiting the Intrinsic Vulnerability of Deep Models | YangTian Yan et.al. | 2503.22205 | link |
2025-03-28 | Route-and-Aggregate Decentralized Federated Learning Under Communication Errors | Weicai Li et.al. | 2503.22186 | null |
2025-03-27 | On Large Multimodal Models as Open-World Image Classifiers | Alessandro Conti et.al. | 2503.21851 | link |
2025-03-27 | Bayesian Pseudo Posterior Mechanism for Differentially Private Machine Learning | Robert Chew et.al. | 2503.21528 | null |
2025-03-27 | Retinal Fundus Multi-Disease Image Classification using Hybrid CNN-Transformer-Ensemble Architectures | Deependra Singh et.al. | 2503.21465 | link |
2025-03-27 | Fine-Tuning LLMs on Small Medical Datasets: Text Classification and Normalization Effectiveness on Cardiology reports and Discharge records | Noah Losch et.al. | 2503.21349 | null |
2025-03-27 | Improving |
Mario García-Márquez et.al. | 2503.21244 | link |
2025-03-27 | Neural Architecture Search by Learning a Hierarchical Search Space | Mehraveh Javan Roshtkhari et.al. | 2503.21061 | null |
2025-03-26 | TS-Inverse: A Gradient Inversion Attack Tailored for Federated Time Series Forecasting Models | Caspar Meijer et.al. | 2503.20952 | link |
2025-03-26 | VESTA: A Versatile SNN-Based Transformer Accelerator with Unified PEs for Multiple Computational Layers | Ching-Yao Chen et.al. | 2503.20246 | null |
2025-03-26 | BeLightRec: A lightweight recommender system enhanced with BERT | Manh Mai Van et.al. | 2503.20206 | null |
2025-03-25 | Vanishing Depth: A Depth Adapter with Positional Depth Encoding for Generalized Image Encoders | Paul Koch et.al. | 2503.19947 | null |
2025-03-25 | Optimizing Breast Cancer Detection in Mammograms: A Comprehensive Study of Transfer Learning, Resolution Reduction, and Multi-View Classification | Daniel G. P. Petrini et.al. | 2503.19945 | null |
2025-03-25 | Extensions of regret-minimization algorithm for optimal design | Youguang Chen et.al. | 2503.19874 | null |
2025-03-25 | VectorFit : Adaptive Singular & Bias Vector Fine-Tuning of Pre-trained Foundation Models | Suhas G Hegde et.al. | 2503.19530 | null |
2025-03-25 | LRSCLIP: A Vision-Language Foundation Model for Aligning Remote Sensing Image with Longer Text | Weizhi Chen et.al. | 2503.19311 | null |
2025-03-25 | Face Spoofing Detection using Deep Learning | Najeebullah et.al. | 2503.19223 | link |
2025-03-24 | Exploring the Integration of Key-Value Attention Into Pure and Hybrid Transformers for Semantic Segmentation | DeShin Hwa et.al. | 2503.18862 | null |
2025-03-24 | Latent Space Class Dispersion: Effective Test Data Quality Assessment for DNNs | Vivek Vekariya et.al. | 2503.18799 | null |
2025-03-24 | Unbiasing through Textual Descriptions: Mitigating Representation Bias in Video Benchmarks | Nina Shvetsova et.al. | 2503.18637 | null |
2025-03-24 | Explaining Domain Shifts in Language: Concept erasing for Interpretable Image Classification | Zequn Zeng et.al. | 2503.18483 | null |
2025-03-24 | Teaching LLMs for Step-Level Automatic Math Correction via Reinforcement Learning | Junsong Li et.al. | 2503.18432 | null |
2025-03-24 | Sun-Shine: A Large Language Model for Tibetan Culture | Cheng Huang et.al. | 2503.18288 | null |
2025-03-23 | Feature Learning beyond the Lazy-Rich Dichotomy: Insights from Representational Geometry | Chi-Ning Chou et.al. | 2503.18114 | null |
2025-03-23 | What Time Tells Us? An Explorative Study of Time Awareness Learned from Static Images | Dongheng Lin et.al. | 2503.17899 | null |
2025-03-21 | Spatiotemporal Learning with Context-aware Video Tubelets for Ultrasound Video Analysis | Gary Y. Li et.al. | 2503.17475 | null |
2025-03-21 | Leveraging Text-to-Image Generation for Handling Spurious Correlation | Aryan Yazdan Parast et.al. | 2503.17226 | null |
2025-03-21 | CoRLD: Contrastive Representation Learning Of Deformable Shapes In Images | Tonmoy Hossain ana Miaomiao Zhang et.al. | 2503.17162 | null |
2025-03-21 | Beyond Accuracy: What Matters in Designing Well-Behaved Models? | Robin Hesse et.al. | 2503.17110 | null |
2025-03-21 | Symbolic Audio Classification via Modal Decision Tree Learning | Enrico Marzano et.al. | 2503.17018 | null |
2025-03-21 | EasyRobust: A Comprehensive and Easy-to-use Toolkit for Robust and Generalized Vision | Xiaofeng Mao et.al. | 2503.16975 | null |
2025-03-21 | City2Scene: Improving Acoustic Scene Classification with City Features | Yiqiang Cai et.al. | 2503.16862 | null |
2025-03-20 | MobilePlantViT: A Mobile-friendly Hybrid ViT for Generalized Plant Disease Image Classification | Moshiur Rahman Tonmoy et.al. | 2503.16628 | null |
2025-03-20 | PSA-MIL: A Probabilistic Spatial Attention-Based Multiple Instance Learning for Whole Slide Image Classification | Sharon Peled et.al. | 2503.16284 | link |
2025-03-20 | CLS-RL: Image Classification with Rule-Based Reinforcement Learning | Ming Li et.al. | 2503.16188 | null |
2025-03-20 | Corrective In-Context Learning: Evaluating Self-Correction in Large Language Models | Mario Sanz-Guerrero et.al. | 2503.16022 | link |
2025-03-20 | Beyond the Visible: Multispectral Vision-Language Learning for Earth Observation | Clive Tinashe Marimo et.al. | 2503.15969 | null |
2025-03-19 | Graph-Weighted Contrastive Learning for Semi-Supervised Hyperspectral Image Classification | Yuqing Zhang et.al. | 2503.15731 | null |
2025-03-20 | Dynamic Bi-Elman Attention Networks (DBEAN): Dual-Directional Context-Aware Representation Learning for Enhanced Text Classification | ZhengLin Lai et.al. | 2503.15469 | link |
2025-03-19 | Test-Time Backdoor Detection for Object Detection Models | Hangtao Zhang et.al. | 2503.15293 | null |
2025-03-19 | Efficient allocation of image recognition and LLM tasks on multi-GPU system | Marcin Lawenda et.al. | 2503.15252 | null |
2025-03-19 | Comparing Llama3 and DeepSeekR1 on Biomedical Text Classification Tasks | Yuting Guo et.al. | 2503.15169 | null |
2025-03-19 | ARC: Anchored Representation Clouds for High-Resolution INR Classification | Joost Luijmes et.al. | 2503.15156 | null |
2025-03-19 | Ultrasound Image-to-Video Synthesis via Latent Dynamic Diffusion Models | Tingxiu Chen et.al. | 2503.14966 | null |
2025-03-19 | Optimal Transport Adapter Tuning for Bridging Modality Gaps in Few-Shot Remote Sensing Scene Classification | Zhong Ji et.al. | 2503.14938 | null |
2025-03-18 | RAT: Boosting Misclassification Detection Ability without Extra Data | Ge Yan et.al. | 2503.14783 | null |
2025-03-18 | LipShiFT: A Certifiably Robust Shift-based Vision Transformer | Rohan Menon et.al. | 2503.14751 | null |
2025-03-18 | Utilization of Neighbor Information for Image Classification with Different Levels of Supervision | Gihan Jayatilaka et.al. | 2503.14500 | null |
2025-03-17 | Neural Edge Histogram Descriptors for Underwater Acoustic Target Recognition | Atharva Agashe et.al. | 2503.13763 | null |
2025-03-17 | Micro Text Classification Based on Balanced Positive-Unlabeled Learning | Lin-Han Jia et.al. | 2503.13562 | null |
2025-03-17 | Escaping Plato's Cave: Robust Conceptual Reasoning through Interpretable 3D Neural Object Volumes | Nhi Pham et.al. | 2503.13429 | null |
2025-03-17 | Do Vision Models Develop Human-Like Progressive Difficulty Understanding? | Zeyi Huang et.al. | 2503.13058 | null |
2025-03-16 | Domain Generalization for Improved Human Activity Recognition in Office Space Videos Using Adaptive Pre-processing | Partho Ghosh et.al. | 2503.12678 | null |
2025-03-16 | Scaling Semantic Categories: Investigating the Impact on Vision Transformer Labeling Performance | Anthony Lamelas et.al. | 2503.12617 | null |
2025-03-16 | Defense Against Model Stealing Based on Account-Aware Distribution Discrepancy | Jian-Ping Mei et.al. | 2503.12497 | null |
2025-03-16 | GeoRSMLLM: A Multimodal Large Language Model for Vision-Language Tasks in Geoscience and Remote Sensing | Zilun Zhang et.al. | 2503.12490 | null |
2025-03-16 | Shape Bias and Robustness Evaluation via Cue Decomposition for Image Classification and Segmentation | Edgar Heinert et.al. | 2503.12453 | null |
2025-03-16 | MExD: An Expert-Infused Diffusion Model for Whole-Slide Image Classification | Jianwei Zhao et.al. | 2503.12401 | null |
2025-03-15 | TLAC: Two-stage LMM Augmented CLIP for Zero-Shot Classification | Ans Munir et.al. | 2503.12206 | null |
2025-03-15 | Goal-Oriented Source Coding using LDPC Codes for Compressed-Domain Image Classification | Ahcen Aliouat et.al. | 2503.11954 | null |
2025-03-14 | Creating a Good Teacher for Knowledge Distillation in Acoustic Scene Classification | Tobias Morocutti et.al. | 2503.11363 | null |
2025-03-14 | PARIC: Probabilistic Attention Regularization for Language Guided Image Classification from Pre-trained Vison Language Models | Mayank Nautiyal et.al. | 2503.11360 | null |
2025-03-14 | APLA: A Simple Adaptation Method for Vision Transformers | Moein Sorkhei et.al. | 2503.11335 | null |
2025-03-14 | Open-Set Plankton Recognition | Joona Kareinen et.al. | 2503.11318 | null |
2025-03-14 | MEET: A Million-Scale Dataset for Fine-Grained Geospatial Scene Classification with Zoom-Free Remote Sensing Imagery | Yansheng Li et.al. | 2503.11219 | null |
2025-03-14 | Falcon: A Remote Sensing Vision-Language Foundation Model | Kelu Yao et.al. | 2503.11070 | null |
2025-03-13 | Juan Felipe Gomez et.al. | 2503.10945 | null | |
2025-03-13 | Learning Interpretable Logic Rules from Deep Vision Models | Chuqin Geng et.al. | 2503.10547 | null |
2025-03-13 | Extreme Learning Machines for Attention-based Multiple Instance Learning in Whole-Slide Image Classification | Rajiv Krishnakumar et.al. | 2503.10510 | null |
2025-03-13 | RoMA: Scaling up Mamba-based Foundation Models for Remote Sensing | Fengxiang Wang et.al. | 2503.10392 | link |
2025-03-13 | PS3C: An Ensemble-Based Two-Step Framework for Classification of Pep Smear Cell Images | Theo Di Piazza et.al. | 2503.10312 | link |
2025-03-13 | Wikipedia is Not a Dictionary, Delete! Text Classification as a Proxy for Analysing Wiki Deletion Discussions | Hsuvas Borkakoty et.al. | 2503.10294 | null |
2025-03-13 | A Multi-Modal Federated Learning Framework for Remote Sensing Image Classification | Barış Büyüktaş et.al. | 2503.10262 | null |
2025-03-13 | Interpretable Image Classification via Non-parametric Part Prototype Learning | Zhijie Zhu et.al. | 2503.10247 | null |
2025-03-13 | Multiplicative Learning | Han Kim et.al. | 2503.10144 | null |
2025-03-13 | Cognitive-Mental-LLM: Leveraging Reasoning in Large Language Models for Mental Health Prediction via Online Text | Avinash Patil et.al. | 2503.10095 | null |
2025-03-13 | Do We Always Need the Simplicity Bias? Looking for Optimal Inductive Biases in the Wild | Damien Teney et.al. | 2503.10065 | null |
2025-03-12 | Fair Federated Medical Image Classification Against Quality Shift via Inter-Client Progressive State Matching | Nannan Wu et.al. | 2503.09587 | null |
2025-03-12 | Double-Stage Feature-Level Clustering-Based Mixture of Experts Framework | Bakary Badjie et.al. | 2503.09504 | null |
2025-03-12 | ForAug: Recombining Foregrounds and Backgrounds to Improve Vision Transformer Training with Bias Mitigation | Tobias Christian Nauen et.al. | 2503.09399 | null |
2025-03-12 | Membership Inference Attacks fueled by Few-Short Learning to detect privacy leakage tackling data integrity | Daniel Jiménez-López et.al. | 2503.09365 | null |
2025-03-12 | Deep Learning for Climate Action: Computer Vision Analysis of Visual Narratives on X | Katharina Prasse et.al. | 2503.09361 | null |
2025-03-12 | Bayesian Test-Time Adaptation for Vision-Language Models | Lihua Zhou et.al. | 2503.09248 | null |
2025-03-12 | Probing Network Decisions: Capturing Uncertainties and Unveiling Vulnerabilities Without Label Information | Youngju Joung et.al. | 2503.09068 | null |
2025-03-12 | Discovering Influential Neuron Path in Vision Transformers | Yifan Wang et.al. | 2503.09046 | null |
2025-03-11 | KAN-Mixers: a new deep learning architecture for image classification | Jorge Luiz dos Santos Canuto et.al. | 2503.08939 | null |
2025-03-12 | MsaMIL-Net: An End-to-End Multi-Scale Aware Multiple Instance Learning Network for Efficient Whole Slide Image Classification | Jiangping Wen et.al. | 2503.08581 | null |
2025-03-11 | Generalizable and Explainable Deep Learning for Medical Image Computing: An Overview | Ahmad Chaddad et.al. | 2503.08420 | null |
2025-03-11 | Prototype-Based Multiple Instance Learning for Gigapixel Whole Slide Image Classification | Susu Sun et.al. | 2503.08384 | null |
2025-03-11 | Tangentially Aligned Integrated Gradients for User-Friendly Explanations | Lachlan Simpson et.al. | 2503.08240 | null |
2025-03-11 | EnergyFormer: Energy Attention with Fourier Embedding for Hyperspectral Image Classification | Saad Sohail et.al. | 2503.08239 | null |
2025-03-11 | Identification of Star Clusters in M31 from PAndAS Images Based on Deep Learning | Baisong Zhang et.al. | 2503.08130 | null |
2025-03-11 | LabelCoRank: Revolutionizing Long Tail Multi-Label Classification with Co-Occurrence Reranking | Yan Yan et.al. | 2503.07968 | null |
2025-03-12 | Measuring directional bias amplification in image captions using predictability | Rahul Nair et.al. | 2503.07878 | null |
2025-03-10 | Fair Text Classification via Transferable Representations | Thibaud Leteno et.al. | 2503.07691 | null |
2025-03-10 | Keeping Representation Similarity in Finetuning for Medical Image Analysis | Wenqiang Zu et.al. | 2503.07399 | null |
2025-03-10 | Brain Inspired Adaptive Memory Dual-Net for Few-Shot Image Classification | Kexin Di et.al. | 2503.07396 | null |
2025-03-10 | Is My Text in Your AI Model? Gradient-based Membership Inference Test applied to LLMs | Gonzalo Mancera et.al. | 2503.07384 | null |
2025-03-10 | Distilling Knowledge into Quantum Vision Transformers for Biomedical Image Classification | Thomas Boucher et.al. | 2503.07294 | null |
2025-03-10 | A Zero-shot Learning Method Based on Large Language Models for Multi-modal Knowledge Graph Embedding | Bingchen Liu et.al. | 2503.07202 | null |
2025-03-10 | Understanding the Learning Dynamics of LoRA: A Gradient Flow Perspective on Low-Rank Adaptation in Matrix Factorization | Ziqing Xu et.al. | 2503.06982 | null |
2025-03-10 | Task Vector Quantization for Memory-Efficient Model Merging | Youngeun Kim et.al. | 2503.06921 | null |
2025-03-10 | MADS: Multi-Attribute Document Supervision for Zero-Shot Image Classification | Xiangyan Qu et.al. | 2503.06847 | null |
2025-03-09 | Enhancing Layer Attention Efficiency through Pruning Redundant Retrievals | Hanze Li et.al. | 2503.06473 | null |
2025-03-09 | M |
Mingxiang Cao et.al. | 2503.06446 | null |
2025-03-07 | Similarity-Based Domain Adaptation with LLMs | Jie He et.al. | 2503.05281 | null |
2025-03-07 | Spatial Context-Driven Positive Pair Sampling for Enhanced Histopathology Image Classification | Willmer Rafell Quinones Robles et.al. | 2503.05170 | null |
2025-03-07 | Ensemble Debiasing Across Class and Sample Levels for Fairer Prompting Accuracy | Ruixi Lin et.al. | 2503.05157 | null |
2025-03-07 | Grouped Sequential Optimization Strategy -- the Application of Hyperparameter Importance Assessment in Deep Learning | Ruinan Wang et.al. | 2503.05106 | null |
2025-03-06 | HieroLM: Egyptian Hieroglyph Recovery with Next Word Prediction Language Model | Xuheng Cai et.al. | 2503.04996 | null |
2025-03-06 | Label Distribution Learning-Enhanced Dual-KNN for Text Classification | Bo Yuan et.al. | 2503.04869 | null |
2025-03-06 | Guiding LLMs to Generate High-Fidelity and High-Quality Counterfactual Explanations for Text Classification | Van Bach Nguyen et.al. | 2503.04463 | null |
2025-03-06 | WeakSupCon: Weakly Supervised Contrastive Learning for Encoder Pre-training | Bodong Zhang et.al. | 2503.04165 | null |
2025-03-04 | Measurement noise scaling laws for cellular representation learning | Gokul Gowri et.al. | 2503.02726 | null |
2025-03-04 | XFMamba: Cross-Fusion Mamba for Multi-View Medical Image Classification | Xiaoyu Zheng et.al. | 2503.02619 | null |
2025-03-04 | Remote Sensing Image Classification Using Convolutional Neural Network (CNN) and Transfer Learning Techniques | Mustafa Majeed Abd Zaid et.al. | 2503.02510 | null |
2025-03-06 | Union of Experts: Adapting Hierarchical Routing to Equivalently Decomposed Transformer | Yujiao Yang et.al. | 2503.02495 | null |
2025-03-04 | Making Better Mistakes in CLIP-Based Zero-Shot Classification with Hierarchy-Aware Language Prompts | Tong Liang et.al. | 2503.02248 | null |
2025-03-04 | Sharpness-Aware Minimization: General Analysis and Improved Rates | Dimitris Oikonomou et.al. | 2503.02225 | null |
2025-03-03 | Mathematical Foundation of Interpretable Equivariant Surrogate Models | Jacopo Joy Colombini et.al. | 2503.01942 | null |
2025-03-03 | Visual-RFT: Visual Reinforcement Fine-Tuning | Ziyu Liu et.al. | 2503.01785 | link |
2025-03-03 | Mamba base PKD for efficient knowledge compression | José Medina et.al. | 2503.01727 | null |
2025-03-04 | SAR-W-MixMAE: SAR Foundation Model Training Using Backscatter Power Weighting | Ali Caglayan et.al. | 2503.01181 | null |
2025-03-03 | Large Language Models for Healthcare Text Classification: A Systematic Review | Hajar Sakai et.al. | 2503.01159 | null |
2025-03-03 | Fast and Accurate Gigapixel Pathological Image Classification with Hierarchical Distillation Multi-Instance Learning | Jiuyang Dong et.al. | 2502.21130 | null |
2025-02-28 | Comparative study of the ansätze in quantum language models | Jordi Del Castillo et.al. | 2502.20744 | null |
2025-02-28 | Exploring the Impact of Temperature Scaling in Softmax for Classification and Adversarial Robustness | Hao Xuan et.al. | 2502.20604 | null |
2025-02-27 | In-Model Merging for Enhancing the Robustness of Medical Imaging Classification Models | Hu Wang et.al. | 2502.20516 | null |
2025-02-27 | Online Meta-learning for AutoML in Real-time (OnMAR) | Mia Gerber et.al. | 2502.20279 | null |
2025-03-03 | Gradient-Guided Annealing for Domain Generalization | Aristotelis Ballas et.al. | 2502.20162 | link |
2025-02-27 | QPM: Discrete Optimization for Globally Interpretable Image Classification | Thomas Norrenbrock et.al. | 2502.20130 | link |
2025-02-27 | ProAPO: Progressively Automatic Prompt Optimization for Visual Classification | Xiangyan Qu et.al. | 2502.19844 | link |
2025-02-27 | Text classification using machine learning methods | Bogdan Oancea et.al. | 2502.19801 | null |
2025-02-27 | InPK: Infusing Prior Knowledge into Prompt for Vision-Language Models | Shuchang Zhou et.al. | 2502.19777 | null |
2025-02-27 | Learning Mask Invariant Mutual Information for Masked Image Modeling | Tao Huang et.al. | 2502.19718 | null |
2025-02-27 | Language-Informed Hyperspectral Image Synthesis for Imbalanced-Small Sample Classification via Semi-Supervised Conditional Diffusion Model | Yimin Zhu et.al. | 2502.19700 | null |
2025-02-27 | Spatial-Spectral Diffusion Contrastive Representation Network for Hyperspectral Image Classification | Yimin Zhu et.al. | 2502.19699 | null |
2025-02-27 | A Residual Multi-task Network for Joint Classification and Regression in Medical Imaging | Junji Lin et.al. | 2502.19692 | null |
2025-02-26 | I Know What I Don't Know: Improving Model Cascades Through Confidence Tuning | Stephan Rabanser et.al. | 2502.19335 | null |
2025-02-26 | Active Few-Shot Learning for Text Classification | Saeed Ahmadnia et.al. | 2502.18782 | null |
2025-02-25 | Enhancing Image Classification with Augmentation: Data Augmentation Techniques for Improved Image Classification | Saorj Kumar et.al. | 2502.18691 | null |
2025-02-25 | Enhancing Text Classification with a Novel Multi-Agent Collaboration Framework Leveraging BERT | Hediyeh Baban et.al. | 2502.18653 | null |
2025-02-25 | MedKAN: An Advanced Kolmogorov-Arnold Network for Medical Image Classification | Zhuoqin Yang et.al. | 2502.18416 | null |
2025-02-26 | A Fusion Model for Art Author Identification Based on Convolutional Neural Networks and Transformers | Zhenyu Wang et.al. | 2502.18083 | null |
2025-02-25 | MAGE: Multi-Head Attention Guided Embeddings for Low Resource Sentiment Classification | Varun Vashisht et.al. | 2502.17987 | null |
2025-02-25 | Dual Classification Head Self-training Network for Cross-scene Hyperspectral Image Classification | Rong Liu et.al. | 2502.17879 | null |
2025-02-24 | Can Score-Based Generative Modeling Effectively Handle Medical Image Classification? | Sushmita Sarker et.al. | 2502.17727 | null |
2025-02-24 | A Priori Generalizability Estimate for a CNN | Cito Balsells et.al. | 2502.17622 | null |
2025-02-24 | Neural Attention: A Novel Mechanism for Enhanced Expressive Power in Transformer Models | Andrew DiGiugno et.al. | 2502.17206 | null |
2025-02-24 | Disentangling Visual Transformers: Patch-level Interpretability for Image Classification | Guillaume Jeanneret et.al. | 2502.17196 | null |
2025-02-24 | Make LoRA Great Again: Boosting LoRA with Adaptive Singular Values and Mixture-of-Experts Optimization Alignment | Chenghao Fan et.al. | 2502.16894 | null |
2025-02-24 | Applying LLMs to Active Learning: Towards Cost-Efficient Cross-Task Text Classification without Manually Labeled Data | Yejian Zhang et.al. | 2502.16892 | null |
2025-02-24 | A Transformer-in-Transformer Network Utilizing Knowledge Distillation for Image Recognition | Dewan Tauhid Rahman et.al. | 2502.16762 | null |
2025-02-23 | AUKT: Adaptive Uncertainty-Guided Knowledge Transfer with Conformal Prediction | Rui Liu et.al. | 2502.16736 | null |
2025-02-22 | MOB-GCN: A Novel Multiscale Object-Based Graph Neural Network for Hyperspectral Image Classification | Tuan-Anh Yang et.al. | 2502.16289 | link |
2025-02-22 | A Multi-Scale Isolation Forest Approach for Real-Time Detection and Filtering of FGSM Adversarial Attacks in Video Streams of Autonomous Vehicles | Richard Abhulimhen et.al. | 2502.16044 | null |
2025-02-21 | MMRAG: Multi-Mode Retrieval-Augmented Generation with Large Language Models for Biomedical In-Context Learning | Zaifu Zhan et.al. | 2502.15954 | null |
2025-02-21 | Directional Gradient Projection for Robust Fine-Tuning of Foundation Models | Chengyue Huang et.al. | 2502.15895 | null |
2025-02-21 | MHQA: A Diverse, Knowledge Intensive Mental Health Question Answering Challenge for Language Models | Suraj Racha et.al. | 2502.15418 | null |
2025-02-21 | A Novel Riemannian Sparse Representation Learning Network for Polarimetric SAR Image Classification | Junfei Shi et.al. | 2502.15302 | null |
2025-02-21 | Quantum autoencoders for image classification | Hinako Asaoka et.al. | 2502.15254 | null |
2025-02-21 | Steganographic Embeddings as an Effective Data Augmentation | Nicholas DiSalvo et.al. | 2502.15245 | null |
2025-02-21 | Learning to Collaborate: A Capability Vectors-based Architecture for Adaptive Human-AI Decision Making | Renlong Jie et.al. | 2502.15196 | null |
2025-02-21 | TransMamba: Fast Universal Architecture Adaption from Transformers to Mamba | Xiuwei Chen et.al. | 2502.15130 | null |
2025-02-20 | Fundamental Survey on Neuromorphic Based Audio Classification | Amlan Basu et.al. | 2502.15056 | null |
2025-02-20 | Reinforcement Learning for Ultrasound Image Analysis A Comprehensive Review of Advances and Applications | Maha Ezzelarab et.al. | 2502.14995 | null |
2025-02-20 | Sparse Activations as Conformal Predictors | Margarida M. Campos et.al. | 2502.14773 | link |
2025-02-20 | An Enhancement of Jiang, Z., et al.s Compression-Based Classification Algorithm Applied to News Article Categorization | Sean Lester C. Benavides et.al. | 2502.14444 | null |
2025-02-20 | Stochastic Resonance Improves the Detection of Low Contrast Images in Deep Learning Models | Siegfried Ludwig et.al. | 2502.14442 | null |
2025-02-20 | Token-Level Density-Based Uncertainty Quantification Methods for Eliciting Truthfulness of Large Language Models | Artem Vazhentsev et.al. | 2502.14427 | null |
2025-02-20 | Reliable Explainability of Deep Learning Spatial-Spectral Classifiers for Improved Semantic Segmentation in Autonomous Driving | Jon Gutiérrez-Zaballa et.al. | 2502.14416 | null |
2025-02-20 | QUAD-LLM-MLTC: Large Language Models Ensemble Learning for Healthcare Text Multi-Label Classification | Hajar Sakai et.al. | 2502.14189 | null |
2025-02-19 | Self-Regularization with Latent Space Explanations for Controllable LLM-based Classification | Xuansheng Wu et.al. | 2502.14133 | null |
2025-02-19 | Medical Image Classification with KAN-Integrated Transformers and Dilated Neighborhood Attention | Omid Nejati Manzari et.al. | 2502.13693 | link |
2025-02-18 | Language Models Can Predict Their Own Behavior | Dhananjay Ashok et.al. | 2502.13329 | null |
2025-02-18 | Performance Evaluation of Sentiment Analysis on Text and Emoji Data Using End-to-End, Transfer Learning, Distributed and Explainable AI Models | Sirisha Velampalli et.al. | 2502.13278 | null |
2025-02-18 | Private Text Generation by Seeding Large Language Model Prompts | Supriya Nagesh et.al. | 2502.13193 | null |
2025-02-18 | RingFormer: Rethinking Recurrent Transformer with Adaptive Level Signals | Jaemu Heo et.al. | 2502.13181 | null |
2025-02-18 | Benchmarking MedMNIST dataset on real quantum hardware | Gurinder Singh et.al. | 2502.13056 | null |
2025-02-18 | Likelihood-Ratio Regularized Quantile Regression: Adapting Conformal Prediction to High-Dimensional Covariate Shifts | Sunay Joshi et.al. | 2502.13030 | null |
2025-02-18 | A Survey of Text Classification Under Class Distribution Shift | Adriana Valentina Costache et.al. | 2502.12965 | null |
2025-02-18 | Task-Informed Anti-Curriculum by Masking Improves Downstream Performance on Text | Andrei Jarca et.al. | 2502.12953 | null |
2025-02-18 | DAMamba: Vision State Space Model with Dynamic Adaptive Scan | Tanzhe Li et.al. | 2502.12627 | null |
2025-02-18 | When Segmentation Meets Hyperspectral Image: New Paradigm for Hyperspectral Image Classification | Weilian Zhou et.al. | 2502.12541 | null |
2025-02-17 | Achieving Upper Bound Accuracy of Joint Training in Continual Learning | Saleh Momeni et.al. | 2502.12388 | null |
2025-02-17 | OCT Data is All You Need: How Vision Transformers with and without Pre-training Benefit Imaging | Zihao Han et.al. | 2502.12379 | null |
2025-02-17 | AdaSplash: Adaptive Sparse Flash Attention | Nuno Gonçalves et.al. | 2502.12082 | null |
2025-02-17 | Masked Latent Prediction and Classification for Self-Supervised Audio Representation Learning | Aurian Quelennec et.al. | 2502.12031 | null |
2025-02-17 | Text Classification in the LLM Era - Where do we stand? | Sowmya Vajjala et.al. | 2502.11830 | null |
2025-02-17 | Variable-frame CNNLSTM for Breast Nodule Classification using Ultrasound Videos | Xiangxiang Cui et.al. | 2502.11481 | null |
2025-02-16 | Leveraging Conditional Mutual Information to Improve Large Language Model Fine-Tuning For Classification | Thanushon Sivakaran et.al. | 2502.11258 | null |
2025-02-16 | UNITE-FND: Reframing Multimodal Fake News Detection through Unimodal Scene Translation | Arka Mukherjee et.al. | 2502.11132 | null |
2025-02-16 | Towards Achieving Concept Completeness for Unsupervised Textual Concept Bottleneck Models | Milan Bhan et.al. | 2502.11100 | null |
2025-02-16 | Leveraging Large Language Models for Cybersecurity: Enhancing SMS Spam Detection with Robust and Context-Aware Text Classification | Mohsen Ahmadi et.al. | 2502.11014 | null |
2025-02-15 | Simulations of Common Unsupervised Domain Adaptation Algorithms for Image Classification | Ahmad Chaddad et.al. | 2502.10694 | null |
2025-02-15 | REAL: Realism Evaluation of Text-to-Image Generation Models for Effective Data Augmentation | Ran Li et.al. | 2502.10663 | null |
2025-02-14 | Simplifying DINO via Coding Rate Regularization | Ziyang Wu et.al. | 2502.10385 | null |
2025-02-14 | Ocular Disease Classification Using CNN with Deep Convolutional Generative Adversarial Network | Arun Kunwar et.al. | 2502.10334 | null |
2025-02-14 | SeWA: Selective Weight Average via Probabilistic Masking | Peng Wang et.al. | 2502.10119 | null |
2025-02-14 | On Space Folds of ReLU Neural Networks | Michal Lewandowski et.al. | 2502.09954 | null |
2025-02-13 | A CNN Approach to Automated Detection and Classification of Brain Tumors | Md. Zahid Hasan et.al. | 2502.09731 | null |
2025-02-13 | GAIA: A Global, Multi-modal, Multi-scale Vision-Language Dataset for Remote Sensing Image Analysis | Angelos Zavras et.al. | 2502.09598 | link |
2025-02-14 | Optimizing GPT for Video Understanding: Zero-Shot Performance and Prompt Engineering | Mark Beliaev et.al. | 2502.09573 | null |
2025-02-13 | Feature-based Graph Attention Networks Improve Online Continual Learning | Adjovi Sim et.al. | 2502.09143 | null |
2025-02-13 | A Hybrid Model for Few-Shot Text Classification Using Transfer and Meta-Learning | Jia Gao et.al. | 2502.09086 | null |
2025-02-13 | Hierarchical Vision Transformer with Prototypes for Interpretable Medical Image Classification | Luisa Gallée et.al. | 2502.08997 | null |
2025-02-13 | Quantum Approaches for Dysphonia Assessment in Small Speech Datasets | Ha Tran et.al. | 2502.08968 | null |
2025-02-12 | Measuring Diversity in Synthetic Datasets | Yuchang Zhu et.al. | 2502.08512 | null |
2025-02-12 | ViLa-MIL: Dual-scale Vision-Language Multiple Instance Learning for Whole Slide Image Classification | Jiangbo Shi et.al. | 2502.08391 | null |
2025-02-12 | Keep your distance: learning dispersed embeddings on |
Evgeniia Tokarchuk et.al. | 2502.08231 | null |
2025-02-12 | Riemannian Complex Hermit Positive Definite Convolution Network for Polarimetric SAR Image Classification | Junfei Shi et.al. | 2502.08137 | null |
2025-02-12 | Knowledge Swapping via Learning and Unlearning | Mingyu Xing et.al. | 2502.08075 | null |
2025-02-12 | Can Machine Learning Support the Selection of Studies for Systematic Literature Review Updates? | Marcelo Costalonga et.al. | 2502.08050 | null |
2025-02-11 | ESPFormer: Doubly-Stochastic Attention with Expected Sliced Transport Plans | Ashkan Shahbazi et.al. | 2502.07962 | null |
2025-02-11 | Optimizing Knowledge Distillation in Transformers: Enabling Multi-Head Attention without Alignment Barriers | Zhaodong Bing et.al. | 2502.07436 | null |
2025-02-11 | MoENAS: Mixture-of-Expert based Neural Architecture Search for jointly Accurate, Fair, and Robust Edge Deep Neural Networks | Lotfi Abdelkrim Mecharbat et.al. | 2502.07422 | null |
2025-02-11 | MGPATH: Vision-Language Model with Multi-Granular Prompt Learning for Few-Shot WSI Classification | Anh-Tien Nguyen et.al. | 2502.07409 | null |
2025-02-11 | Don't Just Demo, Teach Me the Principles: A Principle-Based Multi-Agent Prompting Strategy for Text Classification | Peipei Wei et.al. | 2502.07165 | null |
2025-02-10 | From Image to Video: An Empirical Study of Diffusion Representations | Pedro Vélez et.al. | 2502.07001 | null |
2025-02-10 | Krum Federated Chain (KFC): Using blockchain to defend against adversarial attacks in Federated Learning | Mario García-Márquez et.al. | 2502.06917 | null |
2025-02-10 | Enhancing Performance of Explainable AI Models with Constrained Concept Refinement | Geyu Liang et.al. | 2502.06775 | null |
2025-02-10 | Efficient Scientific Full Text Classification: The Case of EICAT Impact Assessments | Marc Felix Brinner et.al. | 2502.06551 | null |
2025-02-10 | Hybrid State-Space and GRU-based Graph Tokenization Mamba for Hyperspectral Image Classification | Muhammad Ahmad et.al. | 2502.06427 | null |
2025-02-10 | Provably Near-Optimal Federated Ensemble Distillation with Negligible Overhead | Won-Jun Jang et.al. | 2502.06349 | null |
2025-02-10 | From Pixels to Components: Eigenvector Masking for Visual Representation Learning | Alice Bizeul et.al. | 2502.06314 | null |
2025-02-10 | Beyond Batch Learning: Global Awareness Enhanced Domain Adaptation | Lingkun Luo et.al. | 2502.06272 | null |
2025-02-10 | Multi-Scale Transformer Architecture for Accurate Medical Image Classification | Jiacheng Hu et.al. | 2502.06243 | null |
2025-02-10 | Low Tensor-Rank Adaptation of Kolmogorov--Arnold Networks | Yihang Gao et.al. | 2502.06153 | null |
2025-02-09 | Benchmarking Prompt Sensitivity in Large Language Models | Amirhossein Razavi et.al. | 2502.06065 | null |
2025-02-09 | ARISE: Iterative Rule Induction and Synthetic Data Generation for Text Classification | Yashwanth M. et.al. | 2502.05923 | null |
2025-02-07 | Training-free Neural Architecture Search through Variance of Knowledge of Deep Network Weights | Ondřej Týbl et.al. | 2502.04975 | null |
2025-02-07 | Enhancing Disinformation Detection with Explainable AI and Named Entity Replacement | Santiago González-Silot et.al. | 2502.04863 | null |
2025-02-07 | AIQViT: Architecture-Informed Post-Training Quantization for Vision Transformers | Runqing Jiang et.al. | 2502.04628 | null |
2025-02-06 | Augmented Conditioning Is Enough For Effective Training Image Generation | Jiahui Chen et.al. | 2502.04475 | null |
2025-02-06 | How does a Multilingual LM Handle Multiple Languages? | Santhosh Kakarla et.al. | 2502.04269 | null |
2025-02-06 | Cross the Gap: Exposing the Intra-modal Misalignment in CLIP via Modality Inversion | Marco Mistretta et.al. | 2502.04263 | null |
2025-02-06 | Expanding Training Data for Endoscopic Phenotyping of Eosinophilic Esophagitis | Juming Xiong et.al. | 2502.04199 | null |
2025-02-06 | Improving Natural Language Understanding for LLMs via Large-Scale Instruction Synthesis | Lin Yuan et.al. | 2502.03843 | null |
2025-02-06 | Self-Supervised Learning for Solar Radio Spectrum Classification | Siqi Li et.al. | 2502.03778 | null |
2025-02-06 | Conditional Diffusion Models are Medical Image Classifiers that Provide Explainability and Uncertainty for Free | Gian Mario Favero et.al. | 2502.03687 | null |
2025-02-05 | A Study in Dataset Distillation for Image Super-Resolution | Tobias Dietz et.al. | 2502.03656 | null |
2025-02-05 | Gompertz Linear Units: Leveraging Asymmetry for Enhanced Learning Dynamics | Indrashis Das et.al. | 2502.03654 | null |
2025-02-05 | Clinically-Inspired Hierarchical Multi-Label Classification of Chest X-rays with a Penalty-Based Loss Function | Mehrdad Asadi et.al. | 2502.03591 | link |
2025-02-05 | Optimal Task Order for Continual Learning of Multiple Tasks | Ziyan Li et.al. | 2502.03350 | null |
2025-02-05 | Out-of-Distribution Detection using Synthetic Data Generation | Momin Abbas et.al. | 2502.03323 | null |
2025-02-05 | Long-tailed Medical Diagnosis with Relation-aware Representation Learning and Iterative Classifier Calibration | Li Pan et.al. | 2502.03238 | null |
2025-02-05 | Adversarial Dependence Minimization | Pierre-François De Plaen et.al. | 2502.03227 | null |
2025-02-05 | Disentangling CLIP Features for Enhanced Localized Understanding | Samyak Rawelekar et.al. | 2502.02977 | null |
2025-02-05 | Slowing Learning by Erasing Simple Features | Lucia Quirke et.al. | 2502.02820 | null |
2025-02-04 | The Skin Game: Revolutionizing Standards for AI Dermatology Model Comparison | Łukasz Miętkiewicz et.al. | 2502.02500 | null |
2025-02-04 | BRIDLE: Generalized Self-supervised Learning with Quantization | Hoang M. Nguyen et.al. | 2502.02118 | null |
2025-02-04 | DCT-Mamba3D: Spectral Decorrelation and Spatial-Spectral Feature Extraction for Hyperspectral Image Classification | Weijia Cao et.al. | 2502.01986 | null |
2025-02-04 | Generative Data Mining with Longtail-Guided Diffusion | David S. Hayden et.al. | 2502.01980 | null |
2025-02-03 | A Multi-Scale Feature Fusion Framework Integrating Frequency Domain and Cross-View Attention for Dual-View X-ray Security Inspections | Shilong Hong et.al. | 2502.01710 | null |
2025-02-03 | Activation by Interval-wise Dropout: A Simple Way to Prevent Neural Networks from Plasticity Loss | Sangyeon Park et.al. | 2502.01342 | null |
2025-02-03 | A Framework for Double-Blind Federated Adaptation of Foundation Models | Nurbek Tastan et.al. | 2502.01289 | null |
2025-02-02 | Synthetic Artifact Auditing: Tracing LLM-Generated Synthetic Data Usage in Downstream Applications | Yixin Wu et.al. | 2502.00808 | null |
2025-02-02 | Enhanced Convolutional Neural Networks for Improved Image Classification | Xiaoran Yang et.al. | 2502.00663 | null |
2025-02-01 | Fast Vision Mamba: Pooling Spatial Dimensions for Accelerated Processing | Saarthak Kapse et.al. | 2502.00594 | null |
2025-01-31 | Redefining Machine Unlearning: A Conformal Prediction-Motivated Approach | Yingdan Shi et.al. | 2501.19403 | null |
2025-01-31 | An All-digital 65-nm Tsetlin Machine Image Classification Accelerator with 8.6 nJ per MNIST Frame at 60.3k Frames per Second | Svein Anders Tunheim et.al. | 2501.19347 | null |
2025-01-31 | Through the Looking Glass: LLM-Based Analysis of AR/VR Android Applications Privacy Policies | Abdulaziz Alghamdi et.al. | 2501.19223 | null |
2025-01-31 | Fairness Analysis of CLIP-Based Foundation Models for X-Ray Image Classification | Xiangyu Sun et.al. | 2501.19086 | null |
2025-01-31 | Memory-Efficient Fine-Tuning of Transformers via Token Selection | Antoine Simoulin et.al. | 2501.18824 | null |
2025-01-30 | OT-Transformer: A Continuous-time Transformer Architecture with Optimal Transport Regularization | Kelvin Kan et.al. | 2501.18793 | null |
2025-01-29 | Semantic Consistency Regularization with Large Language Models for Semi-supervised Sentiment Analysis | Kunrong Li et.al. | 2501.17598 | null |
2025-01-28 | Extending Information Bottleneck Attribution to Video Sequences | Veronika Solopova et.al. | 2501.16889 | link |
2025-01-28 | Misspellings in Natural Language Processing: A survey | Gianluca Sperduti et.al. | 2501.16836 | null |
2025-01-28 | DebugAgent: Efficient and Interpretable Error Slice Discovery for Comprehensive Model Debugging | Muxi Chen et.al. | 2501.16751 | null |
2025-01-28 | Toward Relative Positional Encoding in Spiking Transformers | Changze Lv et.al. | 2501.16745 | null |
2025-01-28 | Improving Interpretability and Accuracy in Neuro-Symbolic Rule Extraction Using Class-Specific Sparse Filters | Parth Padalkar et.al. | 2501.16677 | null |
2025-01-27 | Generating customized prompts for Zero-Shot Rare Event Medical Image Classification using LLM | Payal Kamboj et.al. | 2501.16481 | link |
2025-01-28 | SPECIAL: Zero-shot Hyperspectral Image Classification With CLIP | Li Pang et.al. | 2501.16222 | link |
2025-01-27 | The Linear Attention Resurrection in Vision Transformer | Chuanyang Zheng et.al. | 2501.16182 | null |
2025-01-27 | Enhancing the Convergence of Federated Learning Aggregation Strategies with Limited Data | Judith Sáinz-Pardo Díaz et.al. | 2501.15949 | null |
2025-01-26 | Quantum-Enhanced Attention Mechanism in NLP: A Hybrid Classical-Quantum Approach | S. M. Yousuf Iqbal Tomal et.al. | 2501.15630 | null |
2025-01-26 | Building Efficient Lightweight CNN Models | Nathan Isong et.al. | 2501.15547 | null |
2025-01-26 | Fuzzy-aware Loss for Source-free Domain Adaptation in Visual Emotion Recognition | Ying Zheng et.al. | 2501.15519 | null |
2025-01-26 | Variational Bayesian Adaptive Learning of Deep Latent Variables for Acoustic Knowledge Transfer | Hu Hu et.al. | 2501.15496 | null |
2025-01-25 | Pre-trained Model Guided Mixture Knowledge Distillation for Adversarial Federated Learning | Yu Qiao et.al. | 2501.15257 | null |
2025-01-24 | Feasible Learning | Juan Ramirez et.al. | 2501.14912 | link |
2025-01-24 | Rethinking Foundation Models for Medical Image Classification through a Benchmark Study on MedMNIST | Fuping Wu et.al. | 2501.14685 | null |
2025-01-24 | Geometric Mean Improves Loss For Few-Shot Learning | Tong Wu et.al. | 2501.14593 | null |
2025-01-24 | Idiom Detection in Sorani Kurdish Texts | Skala Kamaran Omer et.al. | 2501.14528 | null |
2025-01-24 | Guobin Shen et.al. | 2501.14484 | null | |
2025-01-24 | Impact of Batch Normalization on Convolutional Network Representations | Hermanus L. Potgieter et.al. | 2501.14441 | null |
2025-01-24 | Quantum Neural Networks: A Comparative Analysis and Noise Robustness Evaluation | Tasnim Ahmed et.al. | 2501.14412 | null |
2025-01-24 | Correlation-Based Band Selection for Hyperspectral Image Classification | Dibyabha Deb et.al. | 2501.14338 | link |
2025-01-24 | Relative Layer-Wise Relevance Propagation: a more Robust Neural Networks eXplaination | Eric Nyiri et.al. | 2501.14322 | null |
2025-01-24 | A Comprehensive Framework for Semantic Similarity Detection Using Transformer Architectures and Enhanced Ensemble Techniques | Lifu Gao et.al. | 2501.14288 | null |
2025-01-24 | TLXML: Task-Level Explanation of Meta-Learning via Influence Functions | Yoshihiro Mitsuka et.al. | 2501.14271 | null |
2025-01-23 | A Study of the Plausibility of Attention between RNN Encoders in Natural Language Inference | Duc Hau Nguyen et.al. | 2501.13735 | null |
2025-01-23 | A Transformer-based Autoregressive Decoder Architecture for Hierarchical Text Classification | Younes Yousef et.al. | 2501.13598 | link |
2025-01-23 | Multi-Level Attention and Contrastive Learning for Enhanced Text Classification with an Optimized Transformer | Jia Gao et.al. | 2501.13467 | null |
2025-01-23 | Atmospheric Noise-Resilient Image Classification in a Real-World Scenario: Using Hybrid CNN and Pin-GTSVM | Shlok Mehendale et.al. | 2501.13422 | null |
2025-01-23 | AEON: Adaptive Estimation of Instance-Dependent In-Distribution and Out-of-Distribution Label Noise for Robust Learning | Arpit Garg et.al. | 2501.13389 | null |
2025-01-23 | Multi-aspect Knowledge Distillation with Large Language Model | Taegyeong Lee et.al. | 2501.13341 | null |
2025-01-22 | Revisiting Data Augmentation for Ultrasound Images | Adam Tupper et.al. | 2501.13193 | link |
2025-01-22 | Regularization, Semi-supervision, and Supervision for a Plausible Attention-Based Explanation | Duc Hau Nguyen et.al. | 2501.12775 | link |
2025-01-22 | Estimating the Conformal Prediction Threshold from Noisy Labels | Coby Penso et.al. | 2501.12749 | link |
2025-01-22 | Adapting OpenAI's CLIP Model for Few-Shot Image Inspection in Manufacturing Quality Control: An Expository Case Study with Multiple Application Examples | Fadel M. Megahed et.al. | 2501.12596 | null |
2025-01-21 | Efficient Lung Ultrasound Severity Scoring Using Dedicated Feature Extractor | Jiaqi Guo et.al. | 2501.12524 | null |
2025-01-21 | CCESAR: Coastline Classification-Extraction From SAR Images Using CNN-U-Net Combination | Vidhu Arora et.al. | 2501.12384 | null |
2025-01-21 | CBVLM: Training-free Explainable Concept-based Large Vision Language Models for Medical Image Classification | Cristiano Patrício et.al. | 2501.12266 | null |
2025-01-21 | Early Detection and Classification of Breast Cancer Using Deep Learning Techniques | Mst. Mumtahina Labonno et.al. | 2501.12217 | null |
2025-01-21 | UAV-Assisted Real-Time Disaster Detection Using Optimized Transformer Model | Branislava Jankovic et.al. | 2501.12087 | null |
2025-01-20 | Communication-Efficient Federated Learning Based on Explanation-Guided Pruning for Remote Sensing Image Classification | Jonas Klotz et.al. | 2501.11493 | null |
2025-01-22 | QGAIC: Quantum Inspired Genetic Algorithm for Image Classification | Akhilesh Kumar Singh et.al. | 2501.11477 | null |
2025-01-20 | GenVidBench: A Challenging Benchmark for Detecting AI-Generated Video | Zhenliang Ni et.al. | 2501.11340 | null |
2025-01-20 | KPL: Training-Free Medical Knowledge Mining of Vision-Language Models | Jiaxiang Liu et.al. | 2501.11231 | link |
2025-01-19 | CLOFAI: A Dataset of Real And Fake Image Classification Tasks for Continual Learning | William Doherty et.al. | 2501.11140 | link |
2025-01-19 | Leveraging counterfactual concepts for debugging and improving CNN model performance | Syed Ali Tariq et.al. | 2501.11087 | null |
2025-01-17 | A Vision-Language Framework for Multispectral Scene Representation Using Language-Grounded Features | Enes Karanfil et.al. | 2501.10144 | null |
2025-01-17 | Classifier Ensemble for Efficient Uncertainty Calibration of Deep Neural Networks for Image Classification | Michael Schulze et.al. | 2501.10089 | null |
2025-01-17 | One-D-Piece: Image Tokenizer Meets Quality-Controllable Compression | Keita Miwa et.al. | 2501.10064 | null |
2025-01-17 | LWGANet: A Lightweight Group Attention Backbone for Remote Sensing Visual Tasks | Wei Lu et.al. | 2501.10040 | link |
2025-01-16 | Empirical Evaluation of Embedding Models in the Context of Text Classification in Document Review in Construction Delay Disputes | Fusheng Wei et.al. | 2501.09859 | null |
2025-01-16 | SRE-Conv: Symmetric Rotation Equivariant Convolution for Biomedical Image Classification | Yuexi Du et.al. | 2501.09753 | link |
2025-01-16 | Practical Continual Forgetting for Pre-trained Vision Models | Hongbo Zhao et.al. | 2501.09705 | link |
2025-01-16 | Multimodal Marvels of Deep Learning in Medical Diagnosis: A Comprehensive Review of COVID-19 Detection | Md Shofiqul Islama et.al. | 2501.09506 | link |
2025-01-16 | HydraMix: Multi-Image Feature Mixing for Small Data Image Classification | Christoph Reinders et.al. | 2501.09504 | null |
2025-01-16 | Quantum-Enhanced Transformers for Robust Acoustic Scene Classification in IoT Environments | Minh K. Quan et.al. | 2501.09394 | null |
2025-01-16 | Shape-Based Single Object Classification Using Ensemble Method Classifiers | Nur Shazwani Kamarudin et.al. | 2501.09311 | null |
2025-01-16 | Efficient Few-Shot Medical Image Analysis via Hierarchical Contrastive Vision-Language Learning | Harrison Fuller et.al. | 2501.09294 | null |
2025-01-16 | A Simple Graph Contrastive Learning Framework for Short Text Classification | Yonghao Liu et.al. | 2501.09219 | link |
2025-01-16 | Boosting Short Text Classification with Multi-Source Information Exploration and Dual-Level Contrastive Learning | Yonghao Liu et.al. | 2501.09214 | link |
2025-01-15 | Augmenting Human-Annotated Training Data with Large Language Model Generation and Distillation in Open-Response Assessment | Conrad Borchers et.al. | 2501.09126 | null |
2025-01-15 | IDEA: Image Description Enhanced CLIP-Adapter | Zhipeng Ye et.al. | 2501.08816 | null |
2025-01-15 | MIAFEx: An Attention-based Feature Extraction Method for Medical Image Classification | Oscar Ramos-Soto et.al. | 2501.08562 | null |
2025-01-14 | Towards Zero-Shot & Explainable Video Description by Reasoning over Graphs of Events in Space and Time | Mihai Masala et.al. | 2501.08460 | null |
2025-01-14 | Large Language Models For Text Classification: Case Study And Comprehensive Review | Arina Kostina et.al. | 2501.08457 | null |
2025-01-14 | READ: Reinforcement-based Adversarial Learning for Text Classification with Limited Labeled Data | Rohit Sharma et.al. | 2501.08035 | null |
2025-01-14 | Training Hybrid Neural Networks with Multimode Optical Nonlinearities Using Digital Twins | Ilker Oguz et.al. | 2501.07991 | null |
2025-01-14 | deepTerra -- AI Land Classification Made Easy | Andrew Keith Wilkinson et.al. | 2501.07859 | null |
2025-01-14 | A Low-cost and Ultra-lightweight Binary Neural Network for Traffic Signal Recognition | Mingke Xiao et.al. | 2501.07808 | null |
2025-01-14 | Balance Divergence for Knowledge Distillation | Yafei Qi et.al. | 2501.07804 | null |
2025-01-14 | Parameter-Inverted Image Pyramid Networks for Visual Perception and Multimodal Understanding | Zhaokai Wang et.al. | 2501.07783 | link |
2025-01-13 | Universal Training of Neural Networks to Achieve Bayes Optimal Classification Accuracy | Mohammadreza Tavasoli Naeini et.al. | 2501.07754 | null |
2025-01-13 | Uncertainty Guarantees on Automated Precision Weeding using Conformal Prediction | Paul Melki et.al. | 2501.07185 | null |
2025-01-13 | Adaptive Noise-Tolerant Network for Image Segmentation | Weizhi Li et.al. | 2501.07163 | null |
2025-01-12 | LarvSeg: Exploring Image Classification Data For Large Vocabulary Semantic Segmentation via Category-wise Attentive Classifier | Haojun Yu et.al. | 2501.06862 | link |
2025-01-12 | Rice Leaf Disease Detection: A Comparative Study Between CNN, Transformer and Non-neural Network Architectures | Samia Mehnaz et.al. | 2501.06740 | null |
2025-01-12 | Multi-Label Scene Classification in Remote Sensing Benefits from Image Super-Resolution | Ashitha Mudraje et.al. | 2501.06720 | null |
2025-01-11 | Synthetic Feature Augmentation Improves Generalization Performance of Language Models | Ashok Choudhary et.al. | 2501.06434 | null |
2025-01-10 | Kolmogorov-Arnold networks for metal surface defect classification | Maciej Krzywda et.al. | 2501.06389 | null |
2025-01-10 | Merging Feed-Forward Sublayers for Compressed Transformers | Neha Verma et.al. | 2501.06126 | link |
2025-01-10 | Averaged Adam accelerates stochastic optimization in the training of deep neural network approximations for partial differential equation and optimal control problems | Steffen Dereich et.al. | 2501.06081 | link |
2025-01-10 | Constrained Over-the-Air Model Updating for Wireless Online Federated Learning with Delayed Information | Juncheng Wang et.al. | 2501.05637 | null |
2025-01-10 | The Impact of Model Scaling on Seen and Unseen Language Performance | Rhitabrat Pokharel et.al. | 2501.05629 | null |
2025-01-09 | Vision-Language Models for Autonomous Driving: CLIP-Based Dynamic Scene Understanding | Mohammed Elhenawy et.al. | 2501.05566 | null |
2025-01-09 | Spatial Information Integration in Small Language Models for Document Layout Generation and Classification | Pablo Melendez et.al. | 2501.05497 | null |
2025-01-09 | An Empirical Study of Autoregressive Pre-training from Videos | Jathushan Rajasegaran et.al. | 2501.05453 | null |
2025-01-09 | A 1Mb mixed-precision quantized encoder for image classification and patch-based compression | Van Thien Nguyen et.al. | 2501.05097 | null |
2025-01-09 | A CT Image Classification Network Framework for Lung Tumors Based on Pre-trained MobileNetV2 Model and Transfer learning, And Its Application and Market Analysis in the Medical field | Ziyang Gao et.al. | 2501.04996 | null |
2025-01-09 | MambaHSI: Spatial-Spectral Mamba for Hyperspectral Image Classification | Yapeng Li et.al. | 2501.04944 | null |
2025-01-09 | A New Perspective on Privacy Protection in Federated Learning with Granular-Ball Computing | Guannan Lai et.al. | 2501.04940 | link |
2025-01-09 | ThriftLLM: On Cost-Effective Selection of Large Language Models for Classification Queries | Keke Huang et.al. | 2501.04901 | null |
2025-01-09 | Online Continual Learning: A Systematic Literature Review of Approaches, Challenges, and Benchmarks | Seyed Amir Bidaki et.al. | 2501.04897 | link |
2025-01-08 | Planarian Neural Networks: Evolutionary Patterns from Basic Bilateria Shaping Modern Artificial Neural Network Architectures | Ziyuan Huang et.al. | 2501.04700 | null |
2025-01-08 | Discrete Wavelet Transform-Based Capsule Network for Hyperspectral Image Classification | Zhiqiang Gao et.al. | 2501.04643 | null |
2025-01-08 | Enhancing Scene Classification in Cloudy Image Scenarios: A Collaborative Transfer Method with Information Regulation Mechanism using Optical Cloud-Covered and SAR Remote Sensing Images | Yuze Wang et.al. | 2501.04283 | null |
2025-01-08 | Comparison of Neural Models for X-ray Image Classification in COVID-19 Detection | Jimi Togni et.al. | 2501.04196 | null |
2025-01-07 | Temporal Feature Weaving for Neonatal Echocardiographic Viewpoint Video Classification | Satchel French et.al. | 2501.03967 | link |
2025-01-07 | Dolphin: Closed-loop Open-ended Auto-research through Thinking, Practice, and Feedback | Jiakang Yuan et.al. | 2501.03916 | null |
2025-01-07 | MedFocusCLIP : Improving few shot classification in medical datasets using pixel wise attention | Aadya Arora et.al. | 2501.03839 | null |
2025-01-07 | LHGNN: Local-Higher Order Graph Neural Networks For Audio Classification and Tagging | Shubhr Singh et.al. | 2501.03464 | null |
2025-01-06 | FTA-FTL: A Fine-Tuned Aggregation Federated Transfer Learning Scheme for Lithology Microscopic Image Classification | Keyvan RahimiZadeh et.al. | 2501.03349 | link |
2025-01-06 | CM3T: Framework for Efficient Multimodal Learning for Inhomogeneous Interaction Datasets | Tanay Agrawal et.al. | 2501.03332 | null |
2025-01-06 | Plant Leaf Disease Detection and Classification Using Deep Learning: A Review and A Proposed System on Bangladesh's Perspective | Md. Jalal Uddin Chowdhury et.al. | 2501.03305 | null |
2025-01-06 | Deep-Relative-Trust-Based Diffusion for Decentralized Deep Learning | Muyun Li et.al. | 2501.03162 | null |
2025-01-06 | Graph-based Retrieval Augmented Generation for Dynamic Few-shot Text Classification | Yubo Wang et.al. | 2501.02844 | null |
2025-01-06 | TARDiS : Text Augmentation for Refining Diversity and Separability | Kyungmin Kim et.al. | 2501.02739 | null |
2025-01-05 | FedRSClip: Federated Learning for Remote Sensing Scene Classification Using Vision-Language Models | Hui Lin et.al. | 2501.02461 | null |
2025-01-04 | Exploring Secure Machine Learning Through Payload Injection and FGSM Attacks on ResNet-50 | Umesh Yadav et.al. | 2501.02147 | null |
2025-01-03 | A Separable Self-attention Inspired by the State Space Model for Computer Vision | Juntao Zhang et.al. | 2501.02040 | link |
2025-01-03 | Google is all you need: Semi-Supervised Transfer Learning Strategy For Light Multimodal Multi-Task Classification Model | Haixu Liu et.al. | 2501.01611 | null |
2025-01-02 | Multi-Modal Video Feature Extraction for Popularity Prediction | Haixu Liu et.al. | 2501.01422 | null |
2025-01-02 | A Multi-task Supervised Compression Model for Split Computing | Yoshitomo Matsubara et.al. | 2501.01420 | link |
2025-01-02 | Multi-Head Explainer: A General Framework to Improve Explainability in CNNs and Transformers | Bohang Sun et.al. | 2501.01311 | null |
2025-01-02 | FAST: Fast Audio Spectrogram Transformer | Anugunj Naman et.al. | 2501.01104 | null |
2025-01-01 | A Novel Approach using CapsNet and Deep Belief Network for Detection and Identification of Oral Leukopenia | Hirthik Mathesh GV et.al. | 2501.00876 | null |
2025-01-01 | Ensuring superior learning outcomes and data security for authorized learner | Jeongho Bang et.al. | 2501.00754 | null |
2024-12-31 | TSPE: Task-Specific Prompt Ensemble for Improved Zero-Shot Audio Classification | Nishit Anand et.al. | 2501.00398 | null |
2024-12-31 | Exploring Variability in Fine-Tuned Models for Text Classification with DistilBERT | Giuliano Lorenzoni et.al. | 2501.00241 | null |
2024-12-30 | The Text Classification Pipeline: Starting Shallow going Deeper | Marco Siino et.al. | 2501.00174 | null |
2024-12-30 | Text Classification: Neural Networks VS Machine Learning Models VS Pre-trained Models | Christos Petridis et.al. | 2412.21022 | null |
2024-12-30 | FPGA-based Acceleration of Neural Network for Image Classification using Vitis AI | Zhengdong Li et.al. | 2412.20974 | null |
2024-12-30 | Uncertainty-Aware Out-of-Distribution Detection with Gaussian Processes | Yang Chen et.al. | 2412.20918 | null |
2024-12-30 | UniRS: Unifying Multi-temporal Remote Sensing Tasks through Vision Language Models | Yujie Li et.al. | 2412.20742 | null |
2024-12-30 | Improving Acoustic Scene Classification in Low-Resource Conditions | Zhi Chen et.al. | 2412.20722 | null |
2024-12-29 | Hilbert Curve Based Molecular Sequence Analysis | Sarwan Ali et.al. | 2412.20616 | null |
2024-12-29 | A Novel FPGA-based CNN Hardware Accelerator: Optimization for Convolutional Layers using Karatsuba Ofman Multiplier | Amit Sarkar et.al. | 2412.20393 | null |
2024-12-29 | HindiLLM: Large Language Model for Hindi | Sanjay Chouhan et.al. | 2412.20357 | null |
2024-12-29 | Deep Learning in Image Classification: Evaluating VGG19's Performance on Complex Visual Data | Weijie He et.al. | 2412.20345 | null |
2024-12-28 | Few-shot Algorithm Assurance | Dang Nguyen et.al. | 2412.20275 | null |
2024-12-27 | Asymmetrical Reciprocity-based Federated Learning for Resolving Disparities in Medical Diagnosis | Jiaqi Wang et.al. | 2412.19654 | null |
2024-12-27 | Enhancing Fine-grained Image Classification through Attentive Batch Training | Duy M. Le et.al. | 2412.19606 | null |
2024-12-27 | A Comparative Study of Machine Unlearning Techniques for Image and Text Classification Models | Omar M. Safa et.al. | 2412.19583 | null |
2024-12-27 | Multi-label Classification using Deep Multi-order Context-aware Kernel Networks | Mingyuan Jiu et.al. | 2412.19491 | null |
2024-12-27 | Residual Feature-Reutilization Inception Network for Image Classification | Yuanpeng He et.al. | 2412.19433 | null |
2024-12-27 | An In-Depth Analysis of Adversarial Discriminative Domain Adaptation for Digit Classification | Eugene Choi et.al. | 2412.19391 | link |
2024-12-26 | Assessing Pre-trained Models for Transfer Learning through Distribution of Spectral Components | Tengxue Zhang et.al. | 2412.19085 | null |
2024-12-26 | Let the Rule Speak: Enhancing In-context Learning Debiasing with Interpretability | Ruixi Lin et.al. | 2412.19018 | null |
2024-12-25 | Injecting Bias into Text Classification Models using Backdoor Attacks | A. Dilara Yavuz et.al. | 2412.18975 | null |
2024-12-25 | Research Experiment on Multi-Model Comparison for Chinese Text Classification Tasks | JiaCheng Li et.al. | 2412.18908 | null |
2024-12-24 | VisionGRU: A Linear-Complexity RNN Model for Efficient Image Analysis | Shicheng Yin et.al. | 2412.18178 | link |
2024-12-24 | Beyond Gradient Averaging in Parallel Optimization: Improved Robustness through Gradient Agreement Filtering | Francois Chaubard et.al. | 2412.18052 | null |
2024-12-23 | Explainability in Neural Networks for Natural Language Processing Tasks | Melkamu Mersha et.al. | 2412.18036 | null |
2024-12-23 | COBRA: COmBinatorial Retrieval Augmentation for Few-Shot Learning | Arnav M. Das et.al. | 2412.17684 | null |
2024-12-23 | Resource-Aware Arabic LLM Creation: Model Adaptation, Integration, and Multi-Domain Testing | Prakash Aryan et.al. | 2412.17548 | link |
2024-12-23 | Domain-Incremental Learning for Audio Classification | Manjunath Mulimani et.al. | 2412.17424 | null |
2024-12-23 | An Experimental Evaluation of Japanese Tokenizers for Sentiment-Based Text Classification | Andre Rusli et.al. | 2412.17361 | link |
2024-12-23 | DiffFormer: a Differential Spatial-Spectral Transformer for Hyperspectral Image Classification | Muhammad Ahmad et.al. | 2412.17350 | link |
2024-12-22 | Survey on Abstractive Text Summarization: Dataset, Models, and Metrics | Gospel Ozioma Nnadi et.al. | 2412.17165 | link |
2024-12-22 | LH-Mix: Local Hierarchy Correlation Guided Mixup over Hierarchical Prompt Tuning | Fanshuang Kong et.al. | 2412.16963 | link |
2024-12-22 | Predicting the Reliability of an Image Classifier under Image Distortion | Dang Nguyen et.al. | 2412.16881 | null |
2024-12-21 | Forget Vectors at Play: Universal Input Perturbations Driving Machine Unlearning in Image Classification | Changchang Sun et.al. | 2412.16780 | null |
2024-12-21 | UNEM: UNrolled Generalized EM for Transductive Few-Shot Learning | Long Zhou et.al. | 2412.16739 | link |
2024-12-20 | Mamba2D: A Natively Multi-Dimensional State-Space Model for Vision Tasks | Enis Baty et.al. | 2412.16146 | null |
2024-12-20 | Towards Interpretable Radiology Report Generation via Concept Bottlenecks using a Multi-Agentic RAG | Hasan Md Tusfiqur Alam et.al. | 2412.16086 | link |
2024-12-20 | A Thorough Investigation into the Application of Deep CNN for Enhancing Natural Language Processing Capabilities | Chang Weng et.al. | 2412.15900 | null |
2024-12-20 | Continual Learning Using a Kernel-Based Method Over Foundation Models | Saleh Momeni et.al. | 2412.15571 | link |
2024-12-19 | Time Will Tell: Timing Side Channels via Output Token Count in Large Language Models | Tianchen Zhang et.al. | 2412.15431 | null |
2024-12-19 | Till the Layers Collapse: Compressing a Deep Neural Network through the Lenses of Batch Normalization Layers | Zhu Liao et.al. | 2412.15077 | null |
2024-12-18 | Zero-Shot Prompting and Few-Shot Fine-Tuning: Revisiting Document Image Classification Using Large Language Models | Anna Scius-Bertrand et.al. | 2412.13859 | null |
2024-12-18 | Modelling Multi-modal Cross-interaction for ML-FSIC Based on Local Feature Selection | Kun Yan et.al. | 2412.13732 | null |
2024-12-18 | MBInception: A new Multi-Block Inception Model for Enhancing Image Processing Efficiency | Fatemeh Froughirad et.al. | 2412.13703 | null |
2024-12-17 | Identifying Bias in Deep Neural Networks Using Image Transforms | Sai Teja Erukude et.al. | 2412.13079 | link |
2024-12-17 | Token-Level Graphs for Short Text Classification | Gregor Donabauer et.al. | 2412.12754 | link |
2024-12-17 | Your Next State-of-the-Art Could Come from Another Domain: A Cross-Domain Analysis of Hierarchical Text Classification | Nan Li et.al. | 2412.12744 | link |
2024-12-17 | ShotVL: Human-Centric Highlight Frame Retrieval via Language Queries | Wangyu Xue et.al. | 2412.12675 | null |
2024-12-17 | Structural Pruning via Spatial-aware Information Redundancy for Semantic Segmentation | Dongyue Wu et.al. | 2412.12672 | link |
2024-12-19 | RemoteTrimmer: Adaptive Structural Pruning for Remote Sensing Image Classification | Guangwenjie Zou et.al. | 2412.12603 | link |
2024-12-17 | Addressing Small and Imbalanced Medical Image Datasets Using Generative Models: A Comparative Study of DDPM and PGGANs with Random and Greedy K Sampling | Iman Khazrak et.al. | 2412.12532 | link |
2024-12-16 | Gramian Multimodal Representation Learning and Alignment | Giordano Cicchetti et.al. | 2412.11959 | null |
2024-12-16 | The Impact of Generalization Techniques on the Interplay Among Privacy, Utility, and Fairness in Image Classification | Ahmad Hassanpour et.al. | 2412.11951 | null |
2024-12-16 | Does VLM Classification Benefit from LLM Description Semantics? | Pingchuan Ma et.al. | 2412.11917 | link |
2024-12-16 | Discrepancy-Aware Attention Network for Enhanced Audio-Visual Zero-Shot Learning | RunLin Yu et.al. | 2412.11715 | null |
2024-12-16 | LMM-Regularized CLIP Embeddings for Image Classification | Maria Tzelepi et.al. | 2412.11663 | null |
2024-12-16 | Non-Convex Optimization in Federated Learning via Variance Reduction and Adaptive Learning | Dipanwita Thakur et.al. | 2412.11660 | null |
2024-12-16 | CNNtention: Can CNNs do better with Attention? | Julian Glattki et.al. | 2412.11657 | null |
2024-12-16 | Explicit and Implicit Graduated Optimization in Deep Neural Networks | Naoki Sato et.al. | 2412.11501 | link |
2024-12-16 | Towards Better Multi-task Learning: A Framework for Optimizing Dataset Combinations in Large Language Models | Zaifu Zhan et.al. | 2412.11455 | null |
2024-12-16 | Scaled Conjugate Gradient Method for Nonconvex Optimization in Deep Neural Networks | Naoki Sato et.al. | 2412.11400 | null |
2024-12-13 | Robust image classification with multi-modal large language models | Francesco Villani et.al. | 2412.10353 | null |
2024-12-13 | MVQ:Towards Efficient DNN Compression and Acceleration with Masked Vector Quantization | Shuaiting Li et.al. | 2412.10261 | null |
2024-12-13 | Label-template based Few-Shot Text Classification with Contrastive Learning | Guanghua Hou et.al. | 2412.10110 | null |
2024-12-13 | Data Pruning Can Do More: A Comprehensive Data Pruning Approach for Object Re-identification | Zi Yang et.al. | 2412.10091 | link |
2024-12-13 | Low-Resource Fast Text Classification Based on Intra-Class and Inter-Class Distance Calculation | Yanxu Mao et.al. | 2412.09922 | null |
2024-12-12 | DQA: An Efficient Method for Deep Quantization of Deep Neural Network Activations | Wenhao Hu et.al. | 2412.09687 | null |
2024-12-12 | Embeddings are all you need! Achieving High Performance Medical Image Classification through Training-Free Embedding Analysis | Raj Hansini Khoiwal et.al. | 2412.09445 | null |
2024-12-12 | Learned Compression for Compressed Learning | Dan Jacobellis et.al. | 2412.09405 | link |
2024-12-12 | Advancing Attribution-Based Neural Network Explainability through Relative Absolute Magnitude Layer-Wise Relevance Propagation and Multi-Component Evaluation | Davor Vukadin et.al. | 2412.09311 | link |
2024-12-13 | An Efficient Framework for Enhancing Discriminative Models via Diffusion Techniques | Chunxiao Li et.al. | 2412.09063 | null |
2024-12-12 | STEAM: Squeeze and Transform Enhanced Attention Module | Rishabh Sabharwal et.al. | 2412.09023 | null |
2024-12-12 | Stochastic Learning of Non-Conjugate Variational Posterior for Image Classification | Kart-Leong Lim et.al. | 2412.08951 | null |
2024-12-11 | BDA: Bangla Text Data Augmentation Framework | Md. Tariquzzaman et.al. | 2412.08753 | null |
2024-12-11 | Advancing Single- and Multi-task Text Classification through Large Language Model Fine-tuning | Hang Zhao et.al. | 2412.08587 | null |
2024-12-11 | ALoRE: Efficient Visual Adaptation via Aggregating Low Rank Experts | Sinan Du et.al. | 2412.08341 | null |
2024-12-11 | Online training and pruning of photonic neural networks | Jiawei Zhang et.al. | 2412.08184 | null |
2024-12-11 | Wasserstein Distance Rivals Kullback-Leibler Divergence for Knowledge Distillation | Jiaming Lv et.al. | 2412.08139 | null |
2024-12-11 | Concept Bottleneck Large Language Models | Chung-En Sun et.al. | 2412.07992 | link |
2024-12-10 | FastDDS-Based Middleware System for Remote X-Ray Image Classification Using Raspberry Pi | Omar H. Khater et.al. | 2412.07818 | null |
2024-12-10 | Leveraging Content and Context Cues for Low-Light Image Enhancement | Igor Morawski et.al. | 2412.07693 | link |
2024-12-10 | Post-Training Non-Uniform Quantization for Convolutional Neural Networks | Ahmed Luqman et.al. | 2412.07391 | null |
2024-12-10 | Image Classification Using Singular Value Decomposition and Optimization | Isabela M. Yepes et.al. | 2412.07288 | link |
2024-12-10 | An Enhancement of CNN Algorithm for Rice Leaf Disease Image Classification in Mobile Applications | Kayne Uriel K. Rodrigo et.al. | 2412.07182 | null |
2024-12-09 | Convolution goes higher-order: a biologically inspired mechanism empowers image classification | Simone Azeglio et.al. | 2412.06740 | null |
2024-12-09 | Impact of Privacy Parameters on Deep Learning Models for Image Classification | Basanta Chaulagain et.al. | 2412.06689 | null |
2024-12-10 | Data Quality Enhancement on the Basis of Diversity with Large Language Models for Text Classification: Uncovered, Difficult, and Noisy | Min Zeng et.al. | 2412.06575 | null |
2024-12-09 | How Certain are Uncertainty Estimates? Three Novel Earth Observation Datasets for Benchmarking Uncertainty Quantification in Machine Learning | Yuanyuan Wang et.al. | 2412.06451 | null |
2024-12-09 | Optimizing Multi-Task Learning for Enhanced Performance in Large Language Models | Zhen Qi et.al. | 2412.06249 | null |
2024-12-08 | Hyperspectral Image Spectral-Spatial Feature Extraction via Tensor Principal Component Analysis | Yuemei Ren et.al. | 2412.06075 | null |
2024-12-08 | Vision Transformer-based Semantic Communications With Importance-Aware Quantization | Joohyuk Park et.al. | 2412.06038 | null |
2024-12-06 | Sparse autoencoders reveal selective remapping of visual concepts during adaptation | Hyesu Lim et.al. | 2412.05276 | link |
2024-12-06 | MTSpark: Enabling Multi-Task Learning with Spiking Neural Networks for Generalist Agents | Avaneesh Devkota et.al. | 2412.04847 | null |
2024-12-05 | Grounding Descriptions in Images informs Zero-Shot Visual Recognition | Shaunak Halbe et.al. | 2412.04429 | link |
2024-12-05 | FedDUAL: A Dual-Strategy with Adaptive Loss and Dynamic Aggregation for Mitigating Data Heterogeneity in Federated Learning | Pranab Sahoo et.al. | 2412.04416 | link |
2024-12-05 | Enhancing Whole Slide Image Classification through Supervised Contrastive Domain Adaptation | Ilán Carretero et.al. | 2412.04260 | null |
2024-12-05 | Demonstration Selection for In-Context Learning via Reinforcement Learning | Xubin Wang et.al. | 2412.03966 | null |
2024-12-05 | Quantized and Interpretable Learning Scheme for Deep Neural Networks in Classification Task | Alireza Maleki et.al. | 2412.03915 | null |
2024-12-05 | Multisource Collaborative Domain Generalization for Cross-Scene Remote Sensing Image Classification | Zhu Han et.al. | 2412.03897 | null |
2024-12-05 | Dual-Branch Subpixel-Guided Network for Hyperspectral Image Classification | Zhu Han et.al. | 2412.03893 | link |
2024-12-04 | Language Model Meets Prototypes: Towards Interpretable Text Classification Models through Prototypical Networks | Ximing Wen et.al. | 2412.03761 | null |
2024-12-05 | Continual Low-Rank Scaled Dot-product Attention | Ginés Carreto Picón et.al. | 2412.03214 | null |
2024-12-04 | Multi-Level Correlation Network For Few-Shot Image Classification | Yunkai Dang et.al. | 2412.03159 | link |
2024-12-04 | Assessing the performance of CT image denoisers using Laguerre-Gauss Channelized Hotelling Observer for lesion detection | Prabhat Kc et.al. | 2412.02920 | null |
2024-12-04 | Higher Order Transformers: Efficient Attention Mechanism for Tensor Structured Data | Soroush Omranpour et.al. | 2412.02919 | null |
2024-12-03 | Synergistic Development of Perovskite Memristors and Algorithms for Robust Analog Computing | Nanyang Ye et.al. | 2412.02779 | null |
2024-12-03 | Mixture of Physical Priors Adapter for Parameter-Efficient Fine-Tuning | Zhaozhi Wang et.al. | 2412.02759 | null |
2024-12-03 | Multimodal Remote Sensing Scene Classification Using VLMs and Dual-Cross Attention Networks | Jinjin Cai et.al. | 2412.02531 | null |
2024-12-04 | GenMix: Effective Data Augmentation with Generative Diffusion Model Image Editing | Khawar Islam et.al. | 2412.02366 | null |
2024-12-03 | Multi-Granularity Tibetan Textual Adversarial Attack Method Based on Masked Language Model | Xi Cao et.al. | 2412.02343 | null |
2024-12-03 | Active Learning via Classifier Impact and Greedy Selection for Interactive Image Retrieval | Leah Bar et.al. | 2412.02310 | link |
2024-12-03 | A Classic-Quantum Hybrid Network Framework: CQH-Net | Ao Liu et.al. | 2412.02059 | null |
2024-12-02 | PROFIT: A PROximal FIne Tuning Optimizer for Multi-Task Learning | Anirudh S Chakravarthy et.al. | 2412.01930 | null |
2024-12-02 | Concept Based Continuous Prompts for Interpretable Text Classification | Qian Chen et.al. | 2412.01644 | link |
2024-12-02 | NYT-Connections: A Deceptively Simple Text Classification Task that Stumps System-1 Thinkers | Angel Yahir Loredo Lopez et.al. | 2412.01621 | null |
2024-12-02 | Explaining the Unexplained: Revealing Hidden Correlations for Better Interpretability | Wen-Dong Jiang et.al. | 2412.01365 | null |
2024-12-02 | Class Distance Weighted Cross Entropy Loss for Classification of Disease Severity | Gorkem Polat et.al. | 2412.01246 | null |
2024-11-29 | LLM Teacher-Student Framework for Text Classification With No Manually Annotated Data: A Case Study in IPTC News Topic Classification | Taja Kuzman et.al. | 2411.19638 | link |
2024-11-29 | FairDD: Fair Dataset Distillation via Synchronized Matching | Qihang Zhou et.al. | 2411.19623 | null |
2024-11-29 | Memristive Nanowire Network for Energy Efficient Audio Classification: Pre-Processing-Free Reservoir Computing with Reduced Latency | Akshaya Rajesh et.al. | 2411.19611 | null |
2024-11-29 | Contextual Checkerboard Denoise -- A Novel Neural Network-Based Approach for Classification-Aware OCT Image Denoising | Md. Touhidul Islam et.al. | 2411.19549 | link |
2024-11-28 | CLIP meets DINO for Tuning Zero-Shot Classifier using Unlabeled Image Collections | Mohamed Fazli Imam et.al. | 2411.19346 | link |
2024-11-28 | Quantum Neural Networks in Practice: A Comparative Study with Classical Models from Standard Data Sets to Industrial Images | Daniel Basilewitsch et.al. | 2411.19276 | null |
2024-11-28 | Controlling Participation in Federated Learning with Feedback | Michael Cummins et.al. | 2411.19242 | null |
2024-11-28 | Introducing Three New Benchmark Datasets for Hierarchical Text Classification | Jaco du Toit et.al. | 2411.19119 | null |
2024-11-28 | MVFormer: Diversifying Feature Normalization and Token Mixing for Efficient Vision Transformers | Jongseong Bae et.al. | 2411.18995 | null |
2024-11-27 | Fall Leaf Adversarial Attack on Traffic Sign Classification | Anthony Etim et.al. | 2411.18776 | null |
2024-11-27 | Leveraging Semi-Supervised Learning to Enhance Data Mining for Image Classification under Limited Labeled Data | Aoran Shen et.al. | 2411.18622 | null |
2024-11-27 | Pruning Deep Convolutional Neural Network Using Conditional Mutual Information | Tien Vu-Van et.al. | 2411.18578 | null |
2024-11-27 | Mixture of Experts in Image Classification: What's the Sweet Spot? | Mathurin Videau et.al. | 2411.18322 | null |
2024-11-27 | KANs for Computer Vision: An Experimental Study | Karthik Mohan et.al. | 2411.18224 | null |
2024-11-27 | Spectral-Spatial Transformer with Active Transfer Learning for Hyperspectral Image Classification | Muhammad Ahmad et.al. | 2411.18115 | link |
2024-11-27 | Vision Mamba Distillation for Low-resolution Fine-grained Image Classification | Yao Chen et.al. | 2411.17980 | link |
2024-11-27 | Optimized Tradeoffs for Private Prediction with Majority Ensembling | Shuli Jiang et.al. | 2411.17965 | null |
2024-11-26 | What Differentiates Educational Literature? A Multimodal Fusion Approach of Transformers and Computational Linguistics | Jordan J. Bird et.al. | 2411.17593 | null |
2024-11-26 | TinyViM: Frequency Decoupling for Tiny Hybrid Vision Mamba | Xiaowen Ma et.al. | 2411.17473 | link |
2024-11-26 | SpikeAtConv: An Integrated Spiking-Convolutional Attention Architecture for Energy-Efficient Neuromorphic Vision Processing | Wangdan Liao et.al. | 2411.17439 | null |
2024-11-26 | CoA: Chain-of-Action for Generative Semantic Labels | Meng Wei et.al. | 2411.17406 | link |
2024-11-26 | BadScan: An Architectural Backdoor Attack on Visual State Space Models | Om Suhas Deshmukh et.al. | 2411.17283 | null |
2024-11-26 | An In-depth Investigation of Sparse Rate Reduction in Transformer-like Models | Yunzhe Hu et.al. | 2411.17182 | null |
2024-11-25 | Contrastive Multi-graph Learning with Neighbor Hierarchical Sifting for Semi-supervised Text Classification | Wei Ai et.al. | 2411.16787 | null |
2024-11-25 | A Supervised Machine Learning Approach for Assessing Grant Peer Review Reports | Gabriel Okasa et.al. | 2411.16662 | link |
2024-11-25 | Debiasing Classifiers by Amplifying Bias with Latent Diffusion and Large Language Models | Donggeun Ko et.al. | 2411.16079 | null |
2024-11-24 | Context-Aware Detection of Mixed Critical Events using Video Classification | Filza Akhlaq et.al. | 2411.15773 | null |
2024-11-23 | MUNBa: Machine Unlearning via Nash Bargaining | Jing Wu et.al. | 2411.15537 | null |
2024-11-23 | Twin Trigger Generative Networks for Backdoor Attacks against Object Detection | Zhiying Li et.al. | 2411.15439 | null |
2024-11-22 | MME-Survey: A Comprehensive Survey on Evaluation of Multimodal LLMs | Chaoyou Fu et.al. | 2411.15296 | null |
2024-11-21 | CODE-CL: COnceptor-Based Gradient Projection for DEep Continual Learning | Marco Paul E. Apolinario et.al. | 2411.15235 | null |
2024-11-21 | BiomedCoOp: Learning to Prompt for Biomedical Vision-Language Models | Taha Koleilat et.al. | 2411.15232 | null |
2024-11-22 | FOCUS: Knowledge-enhanced Adaptive Visual Compression for Few-shot Whole Slide Image Classification | Zhengrui Guo et.al. | 2411.14743 | link |
2024-11-21 | Adaptable Embeddings Network (AEN) | Stan Loosmore et.al. | 2411.13786 | null |
2024-11-20 | Hierarchical Text Classification (HTC) vs. eXtreme Multilabel Classification (XML): Two Sides of the Same Medal | Nerijus Bertalis et.al. | 2411.13687 | link |
2024-11-20 | Combining Autoregressive and Autoencoder Language Models for Text Classification | João Gonçalves et.al. | 2411.13282 | link |
2024-11-20 | MEGL: Multimodal Explanation-Guided Learning | Yifei Zhang et.al. | 2411.13053 | null |
2024-11-19 | Problem-dependent convergence bounds for randomized linear gradient compression | Thomas Flynn et.al. | 2411.12898 | null |
2024-11-19 | Enhancing Multi-Class Disease Classification: Neoplasms, Cardiovascular, Nervous System, and Digestive Disorders Using Advanced LLMs | Ahmed Akib Jawad Karim et.al. | 2411.12712 | null |
2024-11-22 | STREAM: A Universal State-Space Model for Sparse Geometric Data | Mark Schöne et.al. | 2411.12603 | null |
2024-11-19 | AdaCM |
Yuanbin Man et.al. | 2411.12593 | null |
2024-11-19 | Zero-Shot Crate Digging: DJ Tool Retrieval Using Speech Activity, Music Structure And CLAP Embeddings | Iroro Orife et.al. | 2411.12209 | link |
2024-11-19 | Invariant Shape Representation Learning For Image Classification | Tonmoy Hossain et.al. | 2411.12201 | link |
2024-11-19 | Self-Supervised Learning in Deep Networks: A Pathway to Robust Few-Shot Classification | Yuyang Xiao et.al. | 2411.12151 | null |
2024-11-18 | Just Leaf It: Accelerating Diffusion Classifiers with Hierarchical Class Pruning | Arundhati S. Shanbhag et.al. | 2411.12073 | link |
2024-11-18 | Vision Language Models Are Few-Shot Audio Spectrogram Classifiers | Satvik Dixit et.al. | 2411.12058 | null |
2024-11-18 | Fair Distillation: Teaching Fairness from Biased Teachers in Medical Imaging | Milad Masroor et.al. | 2411.11939 | null |
2024-11-18 | Exploring Emerging Trends and Research Opportunities in Visual Place Recognition | Antonios Gasteratos et.al. | 2411.11481 | null |
2024-11-16 | MetaLA: Unified Optimal Linear Approximation to Softmax Attention Map | Yuhong Chou et.al. | 2411.10741 | null |
2024-11-16 | Diagnostic Text-guided Representation Learning in Hierarchical Classification for Pathological Whole Slide Image | Jiawen Li et.al. | 2411.10709 | null |
2024-11-16 | Multi-perspective Contrastive Logit Distillation | Qi Wang et.al. | 2411.10693 | null |
2024-11-15 | Vision Eagle Attention: A New Lens for Advancing Image Classification | Mahmudul Hasan et.al. | 2411.10564 | link |
2024-11-15 | On the Cost of Model-Serving Frameworks: An Experimental Evaluation | Pasquale De Rosa et.al. | 2411.10337 | null |
2024-11-15 | Embedding Byzantine Fault Tolerance into Federated Learning via Virtual Data-Driven Consistency Scoring Plugin | Youngjoon Lee et.al. | 2411.10212 | link |
2024-11-15 | Outliers resistant image classification by anomaly detection | Anton Sergeev et.al. | 2411.10150 | null |
2024-11-15 | Adapting the Biological SSVEP Response to Artificial Neural Networks | Emirhan Böge et.al. | 2411.10084 | null |
2024-11-15 | Evidential Federated Learning for Skin Lesion Image Classification | Rutger Hendrix et.al. | 2411.10071 | null |
2024-11-14 | Adversarial Attacks Using Differentiable Rendering: A Survey | Matthew Hull et.al. | 2411.09749 | null |
2024-11-14 | ResidualDroppath: Enhancing Feature Reuse over Residual Connections | Sejik Park et.al. | 2411.09475 | null |
2024-11-14 | SAG-ViT: A Scale-Aware, High-Fidelity Patching Approach with Graph Attention for Vision Transformers | Shravan Venkatraman et.al. | 2411.09420 | null |
2024-11-14 | Heuristical Comparison of Vision Transformers Against Convolutional Neural Networks for Semantic Segmentation on Remote Sensing Imagery | Ashim Dahal et.al. | 2411.09101 | link |
2024-11-13 | Computed tomography using meta-optics | Maksym Zhelyeznuyakov et.al. | 2411.08995 | null |
2024-11-13 | CoCoP: Enhancing Text Classification with LLM through Code Completion Prompt | Mohammad Mahdi Mohajeri et.al. | 2411.08979 | null |
2024-11-13 | ScaleNet: Scale Invariance Learning in Directed Graphs | Qin Jiang et.al. | 2411.08758 | link |
2024-11-13 | Efficient Whole Slide Image Classification through Fisher Vector Representation | Ravi Kant Gupta et.al. | 2411.08530 | null |
2024-11-12 | HMIL: Hierarchical Multi-Instance Learning for Fine-Grained Whole Slide Image Classification | Cheng Jin et.al. | 2411.07660 | null |
2024-11-12 | Semantic segmentation on multi-resolution optical and microwave data using deep learning | Jai G Singla et.al. | 2411.07581 | null |
2024-11-11 | The Inherent Adversarial Robustness of Analog In-Memory Computing | Corey Lammie et.al. | 2411.07023 | null |
2024-11-11 | ScaleKD: Strong Vision Transformers Could Be Excellent Teachers | Jiawei Fan et.al. | 2411.06786 | link |
2024-11-11 | A Text Classification Model Combining Adversarial Training with Pre-trained Language Model and neural networks: A Case Study on Telecom Fraud Incident Texts | Liu Zhuoxian et.al. | 2411.06772 | null |
2024-11-11 | Can KAN Work? Exploring the Potential of Kolmogorov-Arnold Networks in Computer Vision | Yueyang Cang et.al. | 2411.06727 | null |
2024-11-10 | Deep Active Learning in the Open World | Tian Xie et.al. | 2411.06353 | null |
2024-11-09 | Clustering Algorithms and RAG Enhancing Semi-Supervised Text Classification with Large LLMs | Shan Zhong et.al. | 2411.06175 | null |
2024-11-09 | AI-Compass: A Comprehensive and Effective Multi-module Testing Tool for AI Systems | Zhiyu Zhu et.al. | 2411.06146 | null |
2024-11-09 | Exploring Structural Nonlinearity in Binary Polariton-Based Neuromorphic Architectures | Evgeny Sedov et.al. | 2411.06124 | null |
2024-11-09 | Mutual-energy inner product optimization method for constructing feature coordinates and image classification in Machine Learning | Yuanxiu Wang et.al. | 2411.06100 | null |
2024-11-08 | GUIDEQ: Framework for Guided Questioning for progressive informational collection and classification | Priya Mishra et.al. | 2411.05991 | link |
2024-11-08 | FisherMask: Enhancing Neural Network Labeling Efficiency in Image Classification Using Fisher Information | Shreen Gul et.al. | 2411.05752 | link |
2024-11-08 | Visual-TCAV: Concept-based Attribution and Saliency Maps for Post-hoc Explainability in Image Classification | Antonio De Santis et.al. | 2411.05698 | null |
2024-11-08 | Efficient Audio-Visual Fusion for Video Classification | Mahrukh Awan et.al. | 2411.05603 | null |
2024-11-08 | Training objective drives the consistency of representational similarity across datasets | Laure Ciernik et.al. | 2411.05561 | link |
2024-11-08 | Estimating the Influence of Sequentially Correlated Literary Properties in Textual Classification: A Data-Centric Hypothesis-Testing Approach | Gideon Yoffe et.al. | 2411.04950 | null |
2024-11-07 | Attention Masks Help Adversarial Attacks to Bypass Safety Detectors | Yunfan Shi et.al. | 2411.04772 | link |
2024-11-07 | Zero-Shot Temporal Resolution Domain Adaptation for Spiking Neural Networks | Sanja Karilanova et.al. | 2411.04760 | null |
2024-11-07 | Is network fragmentation a useful complexity measure? | Coenraad Mouton et.al. | 2411.04695 | null |
2024-11-07 | DISCO: DISCovering Overfittings as Causal Rules for Text Classification Models | Zijian Zhang et.al. | 2411.04649 | null |
2024-11-07 | Neural Fingerprints for Adversarial Attack Detection | Haim Fisher et.al. | 2411.04533 | link |
2024-11-06 | Multimodal Structure-Aware Quantum Data Processing | Hala Hawashin et.al. | 2411.04242 | null |
2024-11-06 | RaVL: Discovering and Mitigating Spurious Correlations in Fine-Tuned Vision-Language Models | Maya Varma et.al. | 2411.04097 | link |
2024-11-06 | Overcoming label shift in targeted federated learning | Edvin Listo Zec et.al. | 2411.03799 | null |
2024-11-06 | Deferred Poisoning: Making the Model More Vulnerable via Hessian Singularization | Yuhao He et.al. | 2411.03752 | null |
2024-11-05 | Judge Like a Real Doctor: Dual Teacher Sample Consistency Framework for Semi-supervised Medical Image Classification | Zhang Qixiang et.al. | 2411.03041 | null |
2024-11-06 | Confidence Calibration of Classifiers with Many Classes | Adrien LeCoz et.al. | 2411.02988 | link |
2024-11-05 | Domain Expansion and Boundary Growth for Open-Set Single-Source Domain Generalization | Pengkun Jiao et.al. | 2411.02920 | null |
2024-11-05 | ADOPT: Modified Adam Can Converge with Any |
Shohei Taniguchi et.al. | 2411.02853 | link |
2024-11-05 | Integrated lithium niobate photonic computing circuit based on efficient and high-speed electro-optic conversion | Yaowen Hu et.al. | 2411.02734 | null |
2024-11-06 | Wave Network: An Ultra-Small Language Model | Xin Zhang et.al. | 2411.02674 | null |
2024-11-04 | FUSECAPS: Investigating Feature Fusion Based Framework for Capsule Endoscopy Image Classification | Bidisha Chakraborty et.al. | 2411.02637 | null |
2024-11-04 | TripletCLIP: Improving Compositional Reasoning of CLIP via Synthetic Vision-Language Negatives | Maitreya Patel et.al. | 2411.02545 | null |
2024-11-04 | A Comparative Analysis of Instruction Fine-Tuning LLMs for Financial Text Classification | Sorouralsadat Fatemi et.al. | 2411.02476 | null |
2024-11-04 | Exploiting Contextual Uncertainty of Visual Data for Efficient Training of Deep Models | Sharat Agarwal et.al. | 2411.01925 | null |
2024-11-03 | Optimizing Gastrointestinal Diagnostics: A CNN-Based Model for VCE Image Classification | Vaneeta Ahlawat et.al. | 2411.01652 | null |
2024-11-03 | ParseCaps: An Interpretable Parsing Capsule Network for Medical Image Diagnosis | Xinyu Geng et.al. | 2411.01564 | null |
2024-11-03 | Efficient Deep Learning Infrastructures for Embedded Computing Systems: A Comprehensive Survey and Future Envision | Xiangzhong Luo et.al. | 2411.01431 | null |
2024-11-02 | Combining Financial Data and News Articles for Stock Price Movement Prediction Using Large Language Models | Ali Elahi et.al. | 2411.01368 | null |
2024-11-02 | Optimizing Violence Detection in Video Classification Accuracy through 3D Convolutional Neural Networks | Aarjav Kavathia et.al. | 2411.01348 | null |
2024-11-02 | MIC: Medical Image Classification Using Chest X-ray (COVID-19 and Pneumonia) Dataset with the Help of CNN and Customized CNN | Nafiz Fahad et.al. | 2411.01163 | null |
2024-11-02 | Few-Class Arena: A Benchmark for Efficient Selection of Vision Models and Dataset Difficulty Measurement | Bryan Bo Cao et.al. | 2411.01099 | link |
2024-11-01 | Towards Robust Text Classification: Mitigating Spurious Correlations with Causal Learning | Yuqing Zhou et.al. | 2411.01045 | null |
2024-11-01 | FISHing in Uncertainty: Synthetic Contrastive Learning for Genetic Aberration Detection | Simon Gutwein et.al. | 2411.01025 | link |
2024-10-31 | Video Token Merging for Long-form Video Understanding | Seon-Ho Lee et.al. | 2410.23782 | null |
2024-10-31 | Neurobench: DCASE 2020 Acoustic Scene Classification benchmark on XyloAudio 2 | Weijie Ke et.al. | 2410.23776 | null |
2024-10-31 | QUEST-A: Untrained Filtering with Trained Focusing led to Enhanced Quantum Architectures | Lian-Hui Yu et.al. | 2410.23560 | link |
2024-11-01 | Large Language Models for Patient Comments Multi-Label Classification | Hajar Sakai et.al. | 2410.23528 | null |
2024-10-30 | Multilingual Vision-Language Pre-training for the Remote Sensing Domain | João Daniel Silva et.al. | 2410.23370 | null |
2024-10-30 | Domain-decomposed image classification algorithms using linear discriminant analysis and convolutional neural networks | Axel Klawonn et.al. | 2410.23359 | null |
2024-10-30 | CLIPErase: Efficient Unlearning of Visual-Textual Associations in CLIP | Tianyu Yang et.al. | 2410.23330 | null |
2024-10-30 | Don't Just Pay Attention, PLANT It: Transfer L2R Models to Fine-tune Attention in Extreme Multi-Label Text Classification | Debjyoti Saharoy et.al. | 2410.23066 | null |
2024-10-30 | Automated Trustworthiness Oracle Generation for Machine Learning Text Classifiers | Lam Nguyen Tung et.al. | 2410.22663 | null |
2024-10-29 | Developing Convolutional Neural Networks using a Novel Lamarckian Co-Evolutionary Algorithm | Zaniar Sharifi et.al. | 2410.22487 | null |
2024-10-29 | EfficientNet with Hybrid Attention Mechanisms for Enhanced Breast Histopathology Classification: A Comprehensive Approach | Naren Sengodan et.al. | 2410.22392 | null |
2024-10-29 | DISCERN: Decoding Systematic Errors in Natural Language for Text Classifiers | Rakesh R. Menon et.al. | 2410.22239 | null |
2024-10-29 | Class-Aware Contrastive Optimization for Imbalanced Text Classification | Grigorii Khvatskii et.al. | 2410.22197 | null |
2024-10-29 | Active Learning for Vision-Language Models | Bardia Safaei et.al. | 2410.22187 | null |
2024-10-29 | Multi-Level Feature Distillation of Joint Teachers Trained on Distinct Image Datasets | Adrian Iordache et.al. | 2410.22184 | link |
2024-10-29 | Natural Language Processing for Analyzing Electronic Health Records and Clinical Notes in Cancer Research: A Review | Muhammad Bilal et.al. | 2410.22180 | null |
2024-10-29 | FakeFormer: Efficient Vulnerability-Driven Transformers for Generalisable Deepfake Detection | Dat Nguyen et.al. | 2410.21964 | null |
2024-10-29 | Bayesian Optimization for Hyperparameters Tuning in Neural Networks | Gabriele Onorato et.al. | 2410.21886 | null |
2024-10-29 | Advancing Efficient Brain Tumor Multi-Class Classification -- New Insights from the Vision Mamba Model in Transfer Learning | Yinyi Lai et.al. | 2410.21872 | null |
2024-10-28 | Audio Classification of Low Feature Spectrograms Utilizing Convolutional Neural Networks | Noel Elias et.al. | 2410.21561 | null |
2024-10-30 | A Novel Score-CAM based Denoiser for Spectrographic Signature Extraction without Ground Truth | Noel Elias et.al. | 2410.21557 | null |
2024-10-28 | Attacking Misinformation Detection Using Adversarial Examples Generated by Language Models | Piotr Przybyła et.al. | 2410.20940 | null |
2024-10-28 | Data-Efficient Low-Complexity Acoustic Scene Classification via Distilling and Progressive Pruning | Bing Han et.al. | 2410.20775 | null |
2024-10-28 | Interpretable Image Classification with Adaptive Prototype-based Vision Transformers | Chiyu Ma et.al. | 2410.20722 | null |
2024-10-27 | Graph Neural Networks on Discriminative Graphs of Words | Yassine Abbahaddou et.al. | 2410.20469 | null |
2024-10-27 | Historical Test-time Prompt Tuning for Vision Foundation Models | Jingyi Zhang et.al. | 2410.20346 | null |
2024-10-27 | Sequential Large Language Model-Based Hyper-Parameter Optimization | Kanan Mahammadli et.al. | 2410.20302 | link |
2024-10-26 | Enhancing CNN Classification with Lamarckian Memetic Algorithms and Local Search | Akhilbaran Ghosh et.al. | 2410.20234 | null |
2024-10-26 | Annotation Efficiency: Identifying Hard Samples via Blocked Sparse Linear Bandits | Adit Jain et.al. | 2410.20041 | null |
2024-10-26 | Attacks against Abstractive Text Summarization Models through Lead Bias and Influence Functions | Poojitha Thota et.al. | 2410.20019 | null |
2024-10-26 | Vulnerability of LLMs to Vertically Aligned Text Manipulations | Zhecheng Li et.al. | 2410.20016 | null |
2024-10-25 | Learning the Regularization Strength for Deep Fine-Tuning via a Data-Emphasized Variational Objective | Ethan Harvey et.al. | 2410.19675 | null |
2024-10-24 | Noise Adaption Network for Morse Code Image Classification | Xiaxia Wang et.al. | 2410.19180 | link |
2024-10-24 | Hybrid Quantum-Classical Feature Extraction approach for Image Classification using Autoencoders and Quantum SVMs | Donovan Slabbert et.al. | 2410.18814 | null |
2024-10-24 | Spatial-Temporal Search for Spiking Neural Networks | Kaiwei Che et.al. | 2410.18580 | null |
2024-10-25 | Interpretable Bilingual Multimodal Large Language Model for Diverse Biomedical Tasks | Lehan Wang et.al. | 2410.18387 | null |
2024-10-23 | Using Cartesian slice plots of a cosmological simulation as input of a convolutional neural network | Guillermo Arreaga-Garcia et.al. | 2410.18320 | null |
2024-10-25 | Backdoor in Seconds: Unlocking Vulnerabilities in Large Pre-trained Models via Model Editing | Dongliang Guo et.al. | 2410.18267 | null |
2024-10-23 | Future Token Prediction -- Causal Language Modelling with Per-Token Semantic State Vector for Multi-Token Prediction | Nicholas Walker et.al. | 2410.18160 | null |
2024-10-23 | Deep Learning for Active Region Classification: A Systematic Study from Convolutional Neural Networks to Vision Transformers | Edoardo Legnaro et.al. | 2410.17816 | null |
2024-10-23 | New Insight in Cervical Cancer Diagnosis Using Convolution Neural Network Architecture | Ach. Khozaimi et.al. | 2410.17735 | null |
2024-10-24 | Advancing Interpretability in Text Classification through Prototype Learning | Bowen Wei et.al. | 2410.17546 | null |
2024-10-23 | Enhancing Multimodal Medical Image Classification using Cross-Graph Modal Contrastive Learning | Jun-En Ding et.al. | 2410.17494 | null |
2024-10-22 | Data Obfuscation through Latent Space Projection (LSP) for Privacy-Preserving AI Governance: Case Studies in Medical Diagnosis and Finance Fraud Detection | Mahesh Vaijainthymala Krishnamoorthy et.al. | 2410.17459 | null |
2024-10-22 | Altogether: Image Captioning via Re-aligning Alt-text | Hu Xu et.al. | 2410.17251 | null |
2024-10-22 | KANICE: Kolmogorov-Arnold Networks with Interactive Convolutional Elements | Md Meftahul Ferdaus et.al. | 2410.17172 | link |
2024-10-22 | Development of CNN Architectures using Transfer Learning Methods for Medical Image Classification | Ganga Prasad Basyal et.al. | 2410.16711 | null |
2024-10-21 | Efficient Neural Network Training via Subset Pretraining | Jan Spörer et.al. | 2410.16523 | null |
2024-10-21 | 1024m at SMM4H 2024: Tasks 3, 5 & 6 -- Ensembles of Transformers and Large Language Models for Medical Text Classification | Ram Mohan Rao Kadiyala et.al. | 2410.15998 | null |
2024-10-21 | Visual Representation Learning Guided By Multi-modal Prior Knowledge | Hongkuan Zhou et.al. | 2410.15981 | null |
2024-10-21 | AutoTrain: No-code training for state-of-the-art models | Abhishek Thakur et.al. | 2410.15735 | link |
2024-10-21 | ViMoE: An Empirical Study of Designing Vision Mixture-of-Experts | Xumeng Han et.al. | 2410.15732 | null |
2024-10-21 | P-YOLOv8: Efficient and Accurate Real-Time Detection of Distracted Driving | Mohamed R. Elshamy et.al. | 2410.15602 | null |
2024-10-20 | Open-vocabulary vs. Closed-set: Best Practice for Few-shot Object Detection Considering Text Describability | Yusuke Hosoya et.al. | 2410.15315 | link |
2024-10-19 | Spatial-Mamba: Effective Visual State Space Models via Structure-Aware State Fusion | Chaodong Xiao et.al. | 2410.15091 | link |
2024-10-19 | PAT: Parameter-Free Audio-Text Aligner to Boost Zero-Shot Audio Classification | Ashish Seth et.al. | 2410.15062 | null |
2024-10-19 | Weakly-supervised diagnosis identification from Italian discharge letters | Vittorio Torri et.al. | 2410.15051 | null |
2024-10-19 | Reflexive Guidance: Improving OoDD in Vision-Language Models via Self-Guided Image-Adaptive Concept Generation | Seulbi Lee et.al. | 2410.14975 | null |
2024-10-18 | A Hybrid Feature Fusion Deep Learning Framework for Leukemia Cancer Detection in Microscopic Blood Sample Using Gated Recurrent Unit and Uncertainty Quantification | Maksuda Akter et.al. | 2410.14536 | null |
2024-10-18 | Unlearning Backdoor Attacks for LLMs with Weak-to-Strong Knowledge Distillation | Shuai Zhao et.al. | 2410.14425 | link |
2024-10-18 | A Novel Method to Metigate Demographic and Expert Bias in ICD Coding with Causal Inference | Bin Zhang et.al. | 2410.14236 | null |
2024-10-18 | Comparative Evaluation of Clustered Federated Learning Method | Michael Ben Ali et.al. | 2410.14212 | link |
2024-10-17 | Reproducibility study of "LICO: Explainable Models with Language-Image Consistency" | Luan Fletcher et.al. | 2410.13989 | link |
2024-10-17 | LoLDU: Low-Rank Adaptation via Lower-Diag-Upper Decomposition for Parameter-Efficient Fine-Tuning | Yiming Shi et.al. | 2410.13618 | link |
2024-10-17 | Augmentation Policy Generation for Image Classification Using Large Language Models | Ant Duru et.al. | 2410.13453 | null |
2024-10-17 | Similarity-Dissimilarity Loss with Supervised Contrastive Learning for Multi-label Classification | Guangming Huang et.al. | 2410.13439 | null |
2024-10-16 | Interpreting and Analyzing CLIP's Zero-Shot Image Classification via Mutual Knowledge | Fawaz Sammani et.al. | 2410.13016 | link |
2024-10-16 | PND-Net: Plant Nutrition Deficiency and Disease Classification using Graph Convolutional Network | Asish Bera et.al. | 2410.12742 | null |
2024-10-16 | Beyond Speech and More: Investigating the Emergent Ability of Speech Foundation Models for Classifying Physiological Time-Series Signals | Orchid Chetia Phukan et.al. | 2410.12645 | null |
2024-10-17 | From Measurement Instruments to Data: Leveraging Theory-Driven Synthetic Training Data for Classifying Social Constructs | Lukas Birkenmaier et.al. | 2410.12622 | null |
2024-10-16 | Feature Augmentation for Self-supervised Contrastive Learning: A Closer Look | Yong Zhang et.al. | 2410.12396 | null |
2024-10-15 | Clustering doc2vec output for topic-dimensionality reduction: A MITRE ATT&CK calibration | Nathan Monnet et.al. | 2410.11573 | null |
2024-10-15 | LoKO: Low-Rank Kalman Optimizer for Online Fine-Tuning of Large Models | Hossein Abdi et.al. | 2410.11551 | null |
2024-10-15 | Reducing Labeling Costs in Sentiment Analysis via Semi-Supervised Learning | Minoo Jafarlou et.al. | 2410.11355 | null |
2024-10-14 | Towards a More Complete Theory of Function Preserving Transforms | Michael Painter et.al. | 2410.11038 | null |
2024-10-14 | Enhancing JEPAs with Spatial Conditioning: Robust and Efficient Representation Learning | Etai Littwin et.al. | 2410.10773 | null |
2024-10-15 | Ensemble of ConvNeXt V2 and MaxViT for Long-Tailed CXR Classification with View-Based Aggregation | Yosuke Yamagishi et.al. | 2410.10710 | link |
2024-10-14 | Queryable Prototype Multiple Instance Learning with Vision-Language Models for Incremental Whole Slide Image Classification | Jiaxiang Gou et.al. | 2410.10573 | null |
2024-10-14 | Dynamic Power Control in a Hardware Neural Network with Error-Configurable MAC Units | Maedeh Ghaderi et.al. | 2410.10545 | null |
2024-10-14 | Improve Meta-learning for Few-Shot Text Classification with All You Can Acquire from the Tasks | Xinyue Liu et.al. | 2410.10454 | link |
2024-10-14 | GlobalMamba: Global Image Serialization for Vision Mamba | Chengkun Wang et.al. | 2410.10316 | link |
2024-10-14 | A Multi-Task Text Classification Pipeline with Natural Language Explanations: A User-Centric Evaluation in Sentiment Analysis and Offensive Language Identification in Greek Tweets | Nikolaos Mylonas et.al. | 2410.10290 | null |
2024-10-14 | big.LITTLE Vision Transformer for Efficient Visual Recognition | He Guo et.al. | 2410.10267 | null |
2024-10-14 | SkillAggregation: Reference-free LLM-Dependent Aggregation | Guangzhi Sun et.al. | 2410.10215 | null |
2024-10-14 | Will the Inclusion of Generated Data Amplify Bias Across Generations in Future Image Classification Models? | Zeliang Zhang et.al. | 2410.10160 | null |
2024-10-11 | Efficient Hyperparameter Importance Assessment for CNNs | Ruinan Wang et.al. | 2410.08920 | null |
2024-10-11 | Parameter-Efficient Fine-Tuning of Large Language Models using Semantic Knowledge Tuning | Nusrat Jahan Prottasha et.al. | 2410.08598 | null |
2024-10-11 | DeBiFormer: Vision Transformer with Deformable Agent Bi-level Routing Attention | Nguyen Huu Bao Long et.al. | 2410.08582 | link |
2024-10-11 | Accelerated Distributed Stochastic Non-Convex Optimization over Time-Varying Directed Networks | Yiyue Chen et.al. | 2410.08508 | null |
2024-10-11 | Semantic Token Reweighting for Interpretable and Controllable Text Embeddings in CLIP | Eunji Kim et.al. | 2410.08469 | null |
2024-10-10 | Bilinear MLPs enable weight-based mechanistic interpretability | Michael T. Pearce et.al. | 2410.08417 | null |
2024-10-10 | What is Left After Distillation? How Knowledge Transfer Impacts Fairness and Bias | Aida Mohammadshahi et.al. | 2410.08407 | null |
2024-10-10 | Time Traveling to Defend Against Adversarial Example Attacks in Image Classification | Anthony Etim et.al. | 2410.08338 | null |
2024-10-10 | More Experts Than Galaxies: Conditionally-overlapping Experts With Biologically-Inspired Fixed Routing | Sagi Shaier et.al. | 2410.08003 | null |
2024-10-10 | When the Small-Loss Trick is Not Enough: Multi-Label Image Classification with Noisy Labels Applied to CCTV Sewer Inspections | Keryan Chelouche et.al. | 2410.07689 | null |
2024-10-10 | Invisibility Cloak: Disappearance under Human Pose Estimation via Backdoor Attacks | Minxing Zhang et.al. | 2410.07670 | null |
2024-10-10 | StablePrompt: Automatic Prompt Tuning using Reinforcement Learning for Large Language Models | Minchan Kwon et.al. | 2410.07652 | null |
2024-10-10 | Explainability of Deep Neural Networks for Brain Tumor Detection | S. Park et.al. | 2410.07613 | link |
2024-10-10 | CSA: Data-efficient Mapping of Unimodal Features to Multimodal Features | Po-han Li et.al. | 2410.07610 | null |
2024-10-09 | One Initialization to Rule them All: Fine-tuning via Explained Variance Adaptation | Fabian Paischer et.al. | 2410.07170 | link |
2024-10-09 | JPEG Inspired Deep Learning | Ahmed H. Salamah et.al. | 2410.07081 | link |
2024-10-09 | Optimizing Estimators of Squared Calibration Errors in Classification | Sebastian G. Gruber et.al. | 2410.07014 | null |
2024-10-09 | Spectral and Rhythm Features for Audio Classification with Deep Convolutional Neural Networks | Friedrich Wolf-Monheim et.al. | 2410.06927 | null |
2024-10-09 | QuadMamba: Learning Quadtree-based Selective Scan for Visual State Space Model | Fei Xie et.al. | 2410.06806 | null |
2024-10-09 | Convex Distillation: Efficient Compression of Deep Networks via Convex Optimization | Prateek Varshney et.al. | 2410.06567 | null |
2024-10-08 | A Comparative Study of Hybrid Models in Health Misinformation Text Classification | Mkululi Sikosana et.al. | 2410.06311 | null |
2024-10-08 | Conformal Structured Prediction | Botong Zhang et.al. | 2410.06296 | link |
2024-10-08 | TEOChat: A Large Vision-Language Assistant for Temporal Earth Observation Data | Jeremy Andrew Irvin et.al. | 2410.06234 | null |
2024-10-08 | Manual Verbalizer Enrichment for Few-Shot Text Classification | Quang Anh Nguyen et.al. | 2410.06173 | null |
2024-10-07 | LoTLIP: Improving Language-Image Pre-training for Long Text Understanding | Wei Wu et.al. | 2410.05249 | null |
2024-10-07 | Variable Resolution Pixel Quantization for Low Power Machine Vision Application on Edge | Senorita Deb et.al. | 2410.05189 | null |
2024-10-07 | IGroupSS-Mamba: Interval Group Spatial-Spectral Mamba for Hyperspectral Image Classification | Yan He et.al. | 2410.05100 | null |
2024-10-07 | Explanation sensitivity to the randomness of large language models: the case of journalistic text classification | Jeremie Bogaert et.al. | 2410.05085 | null |
2024-10-07 | Control-oriented Clustering of Visual Latent Representation | Han Qi et.al. | 2410.05063 | null |
2024-10-07 | SELECT: A Large-Scale Benchmark of Data Curation Strategies for Image Classification | Benjamin Feuer et.al. | 2410.05057 | link |
2024-10-07 | Art Forgery Detection using Kolmogorov Arnold and Convolutional Neural Networks | Sandro Boccuzzo et.al. | 2410.04866 | null |
2024-10-06 | MECFormer: Multi-task Whole Slide Image Classification with Expert Consultation Network | Doanh C. Bui et.al. | 2410.04507 | null |
2024-10-06 | Interpret Your Decision: Logical Reasoning Regularization for Generalization in Visual Classification | Zhaorui Tan et.al. | 2410.04492 | link |
2024-10-05 | IT |
Nikita Durasov et.al. | 2410.04201 | null |
2024-10-04 | Classification-Denoising Networks | Louis Thiry et.al. | 2410.03505 | null |
2024-10-04 | A Multimodal Framework for Deepfake Detection | Kashish Gandhi et.al. | 2410.03487 | null |
2024-10-04 | On Uncertainty In Natural Language Processing | Dennis Ulmer et.al. | 2410.03446 | link |
2024-10-04 | Comparing zero-shot self-explanations with human rationales in multilingual text classification | Stephanie Brandl et.al. | 2410.03296 | null |
2024-10-04 | Sm: enhanced localization in Multiple Instance Learning for medical imaging classification | Francisco M. Castro-Macías et.al. | 2410.03276 | null |
2024-10-04 | Selective Transformer for Hyperspectral Image Classification | Yichu Xu et.al. | 2410.03171 | null |
2024-10-03 | CPFD: Confidence-aware Privileged Feature Distillation for Short Video Classification | Jinghao Shi et.al. | 2410.03038 | null |
2024-10-03 | On Expert Estimation in Hierarchical Mixture of Experts: Beyond Softmax Gating Functions | Huy Nguyen et.al. | 2410.02935 | null |
2024-10-03 | Lie Algebra Canonicalization: Equivariant Neural Operators under arbitrary Lie Groups | Zakhar Shumaylov et.al. | 2410.02698 | null |
2024-10-03 | LoGra-Med: Long Context Multi-Graph Alignment for Medical Vision-Language Model | Duy M. H. Nguyen et.al. | 2410.02615 | null |
2024-10-03 | Personalized Quantum Federated Learning for Privacy Image Classification | Jinjing Shi et.al. | 2410.02547 | null |
2024-10-03 | BiSSL: Bilevel Optimization for Self-Supervised Pre-Training and Fine-Tuning | Gustav Wagner Zakarias et.al. | 2410.02387 | null |
2024-10-03 | CTARR: A fast and robust method for identifying anatomical regions on CT images via atlas registration | Thomas Buddenkotte et.al. | 2410.02316 | link |
2024-10-03 | Hard Negative Sample Mining for Whole Slide Image Classification | Wentao Huang et.al. | 2410.02212 | link |
2024-10-02 | Kolmogorov-Arnold Network Autoencoders | Mohammadamin Moradi et.al. | 2410.02077 | link |
2024-10-02 | Synthio: Augmenting Small-Scale Audio Classification Datasets with Synthetic Data | Sreyan Ghosh et.al. | 2410.02056 | null |
2024-10-02 | FLAG: Financial Long Document Classification via AMR-based GNN | Bolun et.al. | 2410.02024 | link |
2024-10-02 | MONICA: Benchmarking on Long-tailed Medical Image Classification | Lie Ju et.al. | 2410.02010 | null |
2024-10-02 | Revisiting Hierarchical Text Classification: Inference and Metrics | Roman Plaud et.al. | 2410.01305 | link |
2024-10-02 | Automatic deductive coding in discourse analysis: an application of large language models in learning analytics | Lishan Zhang et.al. | 2410.01240 | null |
2024-10-01 | Deep Nets with Subsampling Layers Unwittingly Discard Useful Activations at Test-Time | Chiao-An Yang et.al. | 2410.01083 | link |
2024-10-01 | Local-to-Global Self-Supervised Representation Learning for Diabetic Retinopathy Grading | Mostafa Hajighasemloua et.al. | 2410.00779 | null |
2024-10-01 | NECOMIMI: Neural-Cognitive Multimodal EEG-informed Image Generation with Diffusion Models | Chi-Sheng Chen et.al. | 2410.00712 | null |
2024-10-01 | TikGuard: A Deep Learning Transformer-Based Solution for Detecting Unsuitable TikTok Content for Kids | Mazen Balat et.al. | 2410.00403 | null |
2024-09-30 | KPCA-CAM: Visual Explainability of Deep Computer Vision Models using Kernel PCA | Sachin Karmani et.al. | 2410.00267 | null |
2024-09-30 | A Methodology for Explainable Large Language Models with Integrated Gradients and Linguistic Analysis in Text Classification | Marina Ribeiro et.al. | 2410.00250 | null |
2024-09-30 | Evaluating the performance of state-of-the-art esg domain-specific pre-trained large language models in text classification against existing models and traditional machine learning techniques | Tin Yuet Chung et.al. | 2410.00207 | null |
2024-10-02 | Evaluating the fairness of task-adaptive pretraining on unlabeled test data before few-shot text classification | Kush Dubey et.al. | 2410.00179 | link |
2024-09-30 | POMONAG: Pareto-Optimal Many-Objective Neural Architecture Generator | Eugenio Lomurno et.al. | 2409.20447 | null |
2024-09-30 | Satellite image classification with neural quantum kernels | Pablo Rodriguez-Grasa et.al. | 2409.20356 | null |
2024-09-30 | All-optical autoencoder machine learning framework using diffractive processors | Peijie Feng et.al. | 2409.20346 | null |
2024-09-30 | Fine-Tuning Personalization in Federated Learning to Mitigate Adversarial Clients | Youssef Allouah et.al. | 2409.20329 | null |
2024-09-30 | Classroom-Inspired Multi-Mentor Distillation with Adaptive Learning Strategies | Shalini Sarode et.al. | 2409.20237 | null |
2024-09-30 | Classification of Radiological Text in Small and Imbalanced Datasets in a Non-English Language | Vincent Beliveau et.al. | 2409.20147 | null |
2024-09-30 | SATA: Spatial Autocorrelation Token Analysis for Enhancing the Robustness of Vision Transformers | Nick Nikzad et.al. | 2409.19850 | null |
2024-09-29 | Adversarial Examples for DNA Classification | Hyunwoo Yoo et.al. | 2409.19788 | null |
2024-09-29 | FAST: A Dual-tier Few-Shot Learning Paradigm for Whole Slide Image Classification | Kexue Fu et.al. | 2409.19720 | null |
2024-09-29 | Vision-Language Models are Strong Noisy Label Detectors | Tong Wei et.al. | 2409.19696 | link |
2024-09-27 | Unconditional stability of a recurrent neural circuit implementing divisive normalization | Shivang Rawat et.al. | 2409.18946 | null |
2024-09-27 | Subspace Preserving Quantum Convolutional Neural Network Architectures | Léo Monbroussou et.al. | 2409.18918 | null |
2024-09-27 | Med-IC: Fusing a Single Layer Involution with Convolutions for Enhanced Medical Image Classification and Segmentation | Md. Farhadul Islam et.al. | 2409.18506 | null |
2024-09-26 | Towards the Mitigation of Confirmation Bias in Semi-supervised Learning: a Debiased Training Perspective | Yu Wang et.al. | 2409.18316 | null |
2024-09-26 | Realistic Evaluation of Model Merging for Compositional Generalization | Derek Tam et.al. | 2409.18314 | null |
2024-09-26 | DARE: Diverse Visual Question Answering with Robustness Evaluation | Hannah Sterz et.al. | 2409.18023 | null |
2024-09-26 | The Lou Dataset -- Exploring the Impact of Gender-Fair Language in German Text Classification | Andreas Waldis et.al. | 2409.17929 | null |
2024-09-26 | Cascade Prompt Learning for Vision-Language Model Adaptation | Ge Wu et.al. | 2409.17805 | null |
2024-09-26 | Byzantine-Robust Aggregation for Securing Decentralized Federated Learning | Diego Cajaraville-Aboy et.al. | 2409.17754 | null |
2024-09-26 | Let the Quantum Creep In: Designing Quantum Neural Network Models by Gradually Swapping Out Classical Components | Peiyong Wang et.al. | 2409.17583 | link |
2024-09-26 | Leveraging Annotator Disagreement for Text Classification | Jin Xu et.al. | 2409.17577 | null |
2024-09-26 | Uni-Med: A Unified Medical Generalist Foundation Model For Multi-Task Learning Via Connector-MoE | Xun Zhu et.al. | 2409.17508 | null |
2024-09-26 | Reducing and Exploiting Data Augmentation Noise through Meta Reweighting Contrastive Learning for Text Classification | Guanyi Mou et.al. | 2409.17474 | null |
2024-09-26 | Navigating the Shortcut Maze: A Comprehensive Analysis of Shortcut Learning in Text Classification by Language Models | Yuqing Zhou et.al. | 2409.17455 | null |
2024-09-25 | Block Expanded DINORET: Adapting Natural Domain Foundation Models for Retinal Imaging Without Catastrophic Forgetting | Jay Zoellin et.al. | 2409.17332 | null |
2024-09-25 | BitQ: Tailoring Block Floating Point Precision for Improved DNN Efficiency on Resource-Constrained Devices | Yongqi Xu et.al. | 2409.17093 | link |
2024-09-25 | Accumulator-Aware Post-Training Quantization | Ian Colbert et.al. | 2409.17092 | null |
2024-09-26 | HVT: A Comprehensive Vision Framework for Learning in Non-Euclidean Space | Jacob Fein-Ashley et.al. | 2409.16897 | link |
2024-09-25 | Shifting from endangerment to rebirth in the Artificial Intelligence Age: An Ensemble Machine Learning Approach for Hawrami Text Classification | Aram Khaksar et.al. | 2409.16884 | null |
2024-09-25 | Explicitly Modeling Pre-Cortical Vision with a Neuro-Inspired Front-End Improves CNN Robustness | Lucas Piper et.al. | 2409.16838 | link |
2024-09-24 | Unleashing the Potential of Synthetic Images: A Study on Histopathology Image Classification | Leire Benito-Del-Valle et.al. | 2409.16002 | link |
2024-09-24 | An ensemble framework approach of hybrid Quantum convolutional neural networks for classification of breast cancer images | Dibyasree Guha et.al. | 2409.15958 | null |
2024-09-24 | iGAiVA: Integrated Generative AI and Visual Analytics in a Machine Learning Workflow for Text Classification | Yuanzhe Jin et.al. | 2409.15848 | link |
2024-09-23 | Optimizing News Text Classification with Bi-LSTM and Attention Mechanism for Efficient Data Processing | Bingyao Liu et.al. | 2409.15576 | null |
2024-09-23 | Critic Loss for Image Classification | Brendan Hogan Rappazzo et.al. | 2409.15565 | null |
2024-09-23 | VLMine: Long-Tail Data Mining with Vision Language Models | Mao Ye et.al. | 2409.15486 | null |
2024-09-23 | HydroVision: LiDAR-Guided Hydrometric Prediction with Vision Transformers and Hybrid Graph Learning | Naghmeh Shafiee Roudbari et.al. | 2409.15213 | null |
2024-09-23 | Benchmarking Edge AI Platforms for High-Performance ML Inference | Rakshith Jayanth et.al. | 2409.14803 | null |
2024-09-23 | Less yet robust: crucial region selection for scene recognition | Jianqi Zhang et.al. | 2409.14741 | null |
2024-09-22 | Low-Light Enhancement Effect on Classification and Detection: An Empirical Study | Xu Wu et.al. | 2409.14461 | null |
2024-09-18 | Unraveling the Hessian: A Key to Smooth Convergence in Loss Function Landscapes | Nikita Kiselev et.al. | 2409.11995 | link |
2024-09-18 | Data Efficient Acoustic Scene Classification using Teacher-Informed Confusing Class Instruction | Jin Jie Sean Yeo et.al. | 2409.11964 | null |
2024-09-18 | Agglomerative Token Clustering | Joakim Bruslund Haurum et.al. | 2409.11923 | null |
2024-09-18 | Distillation-free Scaling of Large SSMs for Images and Videos | Hamid Suleman et.al. | 2409.11867 | null |
2024-09-18 | Community Shaping in the Digital Age: A Temporal Fusion Framework for Analyzing Discourse Fragmentation in Online Social Networks | Amirhossein Dezhboro et.al. | 2409.11665 | null |
2024-09-18 | Few-Shot Learning Approach on Tuberculosis Classification Based on Chest X-Ray Images | A. A. G. Yogi Pramana et.al. | 2409.11644 | null |
2024-09-18 | Hyperspectral Image Classification Based on Faster Residual Multi-branch Spiking Neural Network | Yang Liu et.al. | 2409.11619 | null |
2024-09-17 | Multi-Cohort Framework with Cohort-Aware Attention and Adversarial Mutual-Information Minimization for Whole Slide Image Classification | Sharon Peled et.al. | 2409.11119 | null |
2024-09-17 | Anti-ESIA: Analyzing and Mitigating Impacts of Electromagnetic Signal Injection Attacks | Denglin Kang et.al. | 2409.10922 | null |
2024-09-16 | Are Deep Learning Models Robust to Partial Object Occlusion in Visual Recognition Tasks? | Kaleb Kassaw et.al. | 2409.10775 | null |
2024-09-16 | Frequency-Guided Masking for Enhanced Vision Self-Supervised Learning | Amin Karimi Monsefi et.al. | 2409.10362 | null |
2024-09-16 | InfoDisent: Explainability of Image Classification Models by Information Disentanglement | Łukasz Struski et.al. | 2409.10329 | null |
2024-09-16 | Enhancing Image Classification in Small and Unbalanced Datasets through Synthetic Data Augmentation | Neil De La Fuente et.al. | 2409.10286 | null |
2024-09-15 | Finetuning CLIP to Reason about Pairwise Differences | Dylan Sam et.al. | 2409.09721 | null |
2024-09-15 | Compositional Audio Representation Learning | Sripathi Sridhar et.al. | 2409.09619 | null |
2024-09-14 | One missing piece in Vision and Language: A Survey on Comics Understanding | Emanuele Vivoli et.al. | 2409.09502 | link |
2024-09-14 | Real-world Adversarial Defense against Patch Attacks based on Diffusion Model | Xingxing Wei et.al. | 2409.09406 | null |
2024-09-14 | Turbo your multi-modal classification with contrastive learning | Zhiyu Zhang et.al. | 2409.09282 | null |
2024-09-14 | Leveraging Foundation Models for Efficient Federated Learning in Resource-restricted Edge Networks | S. Kawa Atapour et.al. | 2409.09273 | null |
2024-09-13 | ReCLAP: Improving Zero Shot Audio Classification by Describing Sounds | Sreyan Ghosh et.al. | 2409.09213 | link |
2024-09-13 | Pushing the boundaries of event subsampling in event-based video classification using CNNs | Hesam Araghi et.al. | 2409.08953 | link |
2024-09-13 | Pushing Joint Image Denoising and Classification to the Edge | Thomas C Markhorst et.al. | 2409.08943 | null |
2024-09-13 | Byzantine-Robust and Communication-Efficient Distributed Learning via Compressed Momentum Filtering | Changxin Liu et.al. | 2409.08640 | null |
2024-09-13 | Anytime Continual Learning for Open Vocabulary Classification | Zhen Zhu et.al. | 2409.08518 | link |
2024-09-12 | Enhancing Few-Shot Image Classification through Learnable Multi-Scale Embedding and Attention Mechanisms | Fatemeh Askari et.al. | 2409.07989 | link |
2024-09-12 | Microscopic-Mamba: Revealing the Secrets of Microscopic Images with Just 4M Parameters | Shun Zou et.al. | 2409.07896 | link |
2024-09-12 | Classifying Images with CoLaNET Spiking Neural Network -- the MNIST Example | Mikhail Kiselev et.al. | 2409.07833 | null |
2024-09-12 | Efficient Privacy-Preserving KAN Inference Using Homomorphic Encryption | Zhizheng Lai et.al. | 2409.07751 | null |
2024-09-12 | DFDG: Data-Free Dual-Generator Adversarial Distillation for One-Shot Federated Learning | Kangyang Luo et.al. | 2409.07734 | null |
2024-09-12 | Cooperative Inference with Interleaved Operator Partitioning for CNNs | Zhibang Liu et.al. | 2409.07693 | null |
2024-09-11 | Token Turing Machines are Efficient Vision Models | Purvish Jajal et.al. | 2409.07613 | null |
2024-09-11 | Minimizing Embedding Distortion for Robust Out-of-Distribution Performance | Tom Shaked et.al. | 2409.07582 | null |
2024-09-11 | A Contrastive Symmetric Forward-Forward Algorithm (SFFA) for Continual Learning Tasks | Erik B. Terres-Escudero et.al. | 2409.07387 | null |
2024-09-11 | Optimizing Neural Network Performance and Interpretability with Diophantine Equation Encoding | Ronald Katende et.al. | 2409.07310 | null |
2024-09-11 | LLM-based feature generation from text for interpretable machine learning | Vojtěch Balek et.al. | 2409.07132 | null |
2024-09-11 | Privacy-Preserving Federated Learning with Consistency via Knowledge Distillation Using Conditional Generator | Kangyang Luo et.al. | 2409.06955 | null |
2024-09-10 | Dynamic Decoupling of Placid Terminal Attractor-based Gradient Descent Algorithm | Jinwei Zhao et.al. | 2409.06542 | null |
2024-09-10 | Seam Carving as Feature Pooling in CNN | Mohammad Imrul Jubair et.al. | 2409.06311 | null |
2024-09-10 | EntAugment: Entropy-Driven Adaptive Data Augmentation Framework for Image Classification | Suorong Yang et.al. | 2409.06290 | link |
2024-09-09 | A Small Claims Court for the NLP: Judging Legal Text Classification Strategies With Small Datasets | Mariana Yukari Noguti et.al. | 2409.05972 | null |
2024-09-09 | SVFit: Parameter-Efficient Fine-Tuning of Large Pre-Trained Models Using Singular Values | Chengwei Sun et.al. | 2409.05926 | null |
2024-09-09 | Adversarial Attacks on Data Attribution | Xinhe Wang et.al. | 2409.05657 | null |
2024-09-09 | Look One and More: Distilling Hybrid Order Relational Knowledge for Cross-Resolution Image Recognition | Shiming Ge et.al. | 2409.05384 | null |
2024-09-09 | RexUniNLU: Recursive Method with Explicit Schema Instructor for Universal NLU | Chengyuan Liu et.al. | 2409.05275 | null |
2024-09-09 | Scalable Frame Sampling for Video Classification: A Semi-Optimal Policy Approach with Reduced Search Space | Junho Lee et.al. | 2409.05260 | null |
2024-09-08 | PatchAlign:Fair and Accurate Skin Disease Image Classification by Alignment with Clinical Labels | Aayushman et.al. | 2409.04975 | link |
2024-09-07 | Activation Function Optimization Scheme for Image Classification | Abdur Rahman et.al. | 2409.04915 | null |
2024-09-07 | LoCa: Logit Calibration for Knowledge Distillation | Runming Yang et.al. | 2409.04778 | null |
2024-09-07 | Swin Transformer for Robust Differentiation of Real and Synthetic Images: Intra- and Inter-Dataset Analysis | Preetu Mehta et.al. | 2409.04734 | null |
2024-09-06 | Connectivity-Inspired Network for Context-Aware Recognition | Gianluca Carloni et.al. | 2409.04360 | null |
2024-09-06 | An optically accelerated extreme learning machine using hot atomic vapors | Pierre Azam et.al. | 2409.04312 | null |
2024-09-06 | PlantSeg: A Large-Scale In-the-wild Dataset for Plant Disease Segmentation | Tianqi Wei et.al. | 2409.04038 | null |
2024-09-05 | Deep Clustering of Remote Sensing Scenes through Heterogeneous Transfer Learning | Isaac Ray et.al. | 2409.03938 | null |
2024-09-05 | WaterMAS: Sharpness-Aware Maximization for Neural Network Watermarking | Carl De Sousa Trias et.al. | 2409.03902 | null |
2024-09-05 | On-board Satellite Image Classification for Earth Observation: A Comparative Study of Pre-Trained Vision Transformer Models | Thanh-Dung Le et.al. | 2409.03901 | null |
2024-09-05 | Have Large Vision-Language Models Mastered Art History? | Ombretta Strafforello et.al. | 2409.03521 | null |
2024-09-05 | Non-Uniform Illumination Attack for Fooling Convolutional Neural Networks | Akshay Jain et.al. | 2409.03458 | link |
2024-09-05 | Training-free Conversion of Pretrained ANNs to SNNs for Low-Power and High-Performance Applications | Tong Bu et.al. | 2409.03368 | null |
2024-09-05 | PEPL: Precision-Enhanced Pseudo-Labeling for Fine-Grained Image Classification in Semi-Supervised Learning | Bowen Tian et.al. | 2409.03192 | null |
2024-09-05 | The AdEMAMix Optimizer: Better, Faster, Older | Matteo Pagliardini et.al. | 2409.03137 | null |
2024-09-04 | iConFormer: Dynamic Parameter-Efficient Tuning with Input-Conditioned Adaptation | Hayeon Jo et.al. | 2409.02838 | null |
2024-09-03 | MedUnA: Language guided Unsupervised Adaptation of Vision-Language Models for Medical Image Classification | Umaima Rahman et.al. | 2409.02729 | null |
2024-09-05 | OpenFact at CheckThat! 2024: Combining Multiple Attack Methods for Effective Adversarial Text Generation | Włodzimierz Lewoniewski et.al. | 2409.02649 | null |
2024-09-04 | Boosting Generalizability towards Zero-Shot Cross-Dataset Single-Image Indoor Depth by Meta-Initialization | Cho-Ying Wu et.al. | 2409.02486 | null |
2024-09-03 | Evaluation and Comparison of Visual Language Models for Transportation Engineering Problems | Sanjita Prajapati et.al. | 2409.02278 | null |
2024-09-05 | Robust Clustering on High-Dimensional Data with Stochastic Quantization | Anton Kozyriev et.al. | 2409.02066 | link |
2024-09-03 | Compressed learning based onboard semantic compression for remote sensing platforms | Protim Bhattacharjee et.al. | 2409.01988 | null |
2024-09-03 | State-of-the-art Advances of Deep-learning Linguistic Steganalysis Research | Yihao Wang et.al. | 2409.01780 | null |
2024-09-03 | Enhancing Fine-Grained Visual Recognition in the Low-Data Regime Through Feature Magnitude Regularization | Avraham Chapman et.al. | 2409.01672 | null |
2024-09-03 | ReSpike: Residual Frames-based Hybrid Spiking Neural Networks for Efficient Action Recognition | Shiting Xiao et.al. | 2409.01564 | null |
2024-08-30 | Assessing Generative Language Models in Classification Tasks: Performance and Self-Evaluation Capabilities in the Environmental and Climate Change Domain | Francesca Grasso et.al. | 2408.17362 | link |
2024-08-30 | Covariance-corrected Whitening Alleviates Network Degeneration on Imbalanced Classification | Zhiwei Zhang et.al. | 2408.17197 | null |
2024-08-30 | Improving Extraction of Clinical Event Contextual Properties from Electronic Health Records: A Comparative Study | Shubham Agarwal et.al. | 2408.17181 | null |
2024-09-02 | Instant Adversarial Purification with Adversarial Consistency Distillation | Chun Tong Lei et.al. | 2408.17064 | null |
2024-08-30 | Generative Modeling Perspective for Control and Reasoning in Robotics | Takuma Yoneda et.al. | 2408.17041 | null |
2024-08-29 | Tex-ViT: A Generalizable, Robust, Texture-based dual-branch cross-attention deepfake detector | Deepak Dagar et.al. | 2408.16892 | null |
2024-08-29 | SODAWideNet++: Combining Attention and Convolutions for Salient Object Detection | Rohit Venkata Sai Dulam et.al. | 2408.16645 | null |
2024-08-29 | Android Malware Detection Based on RGB Images and Multi-feature Fusion | Zhiqiang Wang et.al. | 2408.16555 | null |
2024-08-29 | SAU: A Dual-Branch Network to Enhance Long-Tailed Recognition via Generative Models | Guangxi Li et.al. | 2408.16273 | link |
2024-08-29 | Improving Diffusion-based Data Augmentation with Inversion Spherical Interpolation | Yanghao Wang et.al. | 2408.16266 | null |
2024-08-29 | Low Saturation Confidence Distribution-based Test-Time Adaptation for Cross-Domain Remote Sensing Image Classification | Yu Liang et.al. | 2408.16265 | null |
2024-08-28 | EMP: Enhance Memory in Data Pruning | Jinying Xiao et.al. | 2408.16031 | null |
2024-08-28 | Local Descriptors Weighted Adaptive Threshold Filtering For Few-Shot Learning | Bingchen Yan et.al. | 2408.15924 | null |
2024-08-28 | ModalityMirror: Improving Audio Classification in Modality Heterogeneity Federated Learning with Multimodal Distillation | Tiantian Feng et.al. | 2408.15803 | null |
2024-08-28 | Visual Prompt Engineering for Medical Vision Language Models in Radiology | Stefan Denner et.al. | 2408.15802 | null |
2024-08-28 | Harnessing the Intrinsic Knowledge of Pretrained Language Models for Challenging Text Classification Settings | Lingyu Gao et.al. | 2408.15650 | null |
2024-08-27 | DCT-CryptoNets: Scaling Private Inference in the Frequency Domain | Arjun Roy et.al. | 2408.15231 | null |
2024-08-27 | A Review of Transformer-Based Models for Computer Vision Tasks: Capturing Global Context and Spatial Relationships | Gracile Astlin Pereira et.al. | 2408.15178 | null |
2024-08-28 | AnomalousPatchCore: Exploring the Use of Anomalous Samples in Industrial Anomaly Detection | Mykhailo Koshil et.al. | 2408.15113 | null |
2024-08-27 | Data downlink prioritization using image classification on-board a 6U CubeSat | Keenan A. A. Chatar et.al. | 2408.14865 | null |
2024-08-27 | Leveraging Self-supervised Audio Representations for Data-Efficient Acoustic Scene Classification | Yiqiang Cai et.al. | 2408.14862 | null |
2024-08-27 | Text-guided Foundation Model Adaptation for Long-Tailed Medical Image Classification | Sirui Li et.al. | 2408.14770 | null |
2024-08-26 | On-Chip Learning with Memristor-Based Neural Networks: Assessing Accuracy and Efficiency Under Device Variations, Conductance Errors, and Input Noise | M. Reza Eslami et.al. | 2408.14680 | null |
2024-08-26 | Attend-Fusion: Efficient Audio-Visual Fusion for Video Classification | Mahrukh Awan et.al. | 2408.14441 | null |
2024-08-26 | Uncertainties of Latent Representations in Computer Vision | Michael Kirchhof et.al. | 2408.14281 | null |
2024-08-26 | MSFMamba: Multi-Scale Feature Fusion State Space Model for Multi-Source Remote Sensing Image Classification | Feng Gao et.al. | 2408.14255 | null |
2024-08-26 | Feature Aligning Few shot Learning Method Using Local Descriptors Weighted Rules | Bingchen Yan et.al. | 2408.14192 | null |
2024-08-26 | GenFormer -- Generated Images are All You Need to Improve Robustness of Transformers on Small Datasets | Sven Oehri et.al. | 2408.14131 | null |
2024-08-25 | Few-Shot Histopathology Image Classification: Evaluating State-of-the-Art Methods and Unveiling Performance Insights | Ardhendu Sekhar et.al. | 2408.13816 | null |
2024-08-25 | On the Robustness of Kolmogorov-Arnold Networks: An Adversarial Perspective | Tal Alter et.al. | 2408.13809 | null |
2024-08-25 | Enhancing Adaptive Deep Networks for Image Classification via Uncertainty-aware Decision Fusion | Xu Zhang et.al. | 2408.13744 | link |
2024-08-25 | 3D-RCNet: Learning from Transformer to Build a 3D Relational ConvNet for Hyperspectral Image Classification | Haizhao Jing et.al. | 2408.13728 | null |
2024-08-24 | Enhanced Astronomical Source Classification with Integration of Attention Mechanisms and Vision Transformers | Srinadh Reddy Bhavanam et.al. | 2408.13634 | null |
2024-08-23 | Domain-specific long text classification from sparse relevant information | Célia D'Cruz et.al. | 2408.13253 | null |
2024-08-23 | EAViT: External Attention Vision Transformer for Audio Classification | Aquib Iqbal et.al. | 2408.13201 | null |
2024-08-23 | A gradient system based on anisotropic monochrome image processing with orientation auto-adjustment | Harbir Antil et.al. | 2408.12847 | null |
2024-08-23 | Underwater SONAR Image Classification and Analysis using LIME-based Explainable Artificial Intelligence | Purushothaman Natarajan et.al. | 2408.12837 | null |
2024-08-23 | VALE: A Multimodal Visual and Language Explanation Framework for Image Classifiers using eXplainable AI and Language Models | Purushothaman Natarajan et.al. | 2408.12808 | null |
2024-08-23 | BackdoorLLM: A Comprehensive Benchmark for Backdoor Attacks on Large Language Models | Yige Li et.al. | 2408.12798 | null |
2024-08-23 | Semi-Supervised Variational Adversarial Active Learning via Learning to Rank and Agreement-Based Pseudo Labeling | Zongyao Lyu et.al. | 2408.12774 | null |
2024-08-23 | Symmetric masking strategy enhances the performance of Masked Image Modeling | Khanh-Binh Nguyen et.al. | 2408.12772 | null |
2024-08-22 | ssProp: Energy-Efficient Training for Convolutional Neural Networks with Scheduled Sparse Back Propagation | Lujia Zhong et.al. | 2408.12561 | link |
2024-08-22 | The Russian-focused embedders' exploration: ruMTEB benchmark and Russian embedding model design | Artem Snegirev et.al. | 2408.12503 | null |
2024-08-22 | Enhanced Infield Agriculture with Interpretable Machine Learning Approaches for Crop Classification | Sudi Murindanyi et.al. | 2408.12426 | null |
2024-08-22 | AT-SNN: Adaptive Tokens for Vision Transformer on Spiking Neural Network | Donghwa Kang et.al. | 2408.12293 | null |
2024-08-22 | Whole Slide Image Classification of Salivary Gland Tumours | John Charlton et.al. | 2408.12275 | null |
2024-08-22 | Query-Efficient Video Adversarial Attack with Stylized Logo | Duoxun Tang et.al. | 2408.12099 | null |
2024-08-21 | Approaching Deep Learning through the Spectral Dynamics of Weights | David Yunis et.al. | 2408.11804 | link |
2024-08-21 | SBDet: A Symmetry-Breaking Object Detector via Relaxed Rotation-Equivariance | Zhiqiang Wu et.al. | 2408.11760 | null |
2024-08-21 | Improving Calibration by Relating Focal Loss, Temperature Scaling, and Properness | Viacheslav Komisarenko et.al. | 2408.11598 | link |
2024-08-21 | MSCPT: Few-shot Whole Slide Image Classification with Multi-scale and Context-focused Prompt Tuning | Minghao Han et.al. | 2408.11505 | null |
2024-08-21 | Enabling Small Models for Zero-Shot Classification through Model Label Learning | Jia Zhang et.al. | 2408.11449 | null |
2024-08-21 | Automatic Dataset Construction (ADC): Sample Collection, Data Curation, and Beyond | Minghao Liu et.al. | 2408.11338 | null |
2024-08-21 | Towards Evaluating Large Language Models on Sarcasm Understanding | Yazhou Zhang et.al. | 2408.11319 | null |
2024-08-20 | Privacy-preserving Universal Adversarial Defense for Black-box Models | Qiao Li et.al. | 2408.10647 | null |
2024-08-20 | A Tutorial on Explainable Image Classification for Dementia Stages Using Convolutional Neural Network and Gradient-weighted Class Activation Mapping | Kevin Kam Fung Yuen et.al. | 2408.10572 | null |
2024-08-20 | NoMatterXAI: Generating "No Matter What" Alterfactual Examples for Explaining Black-Box Text Classification Models | Tuc Nguyen et.al. | 2408.10528 | null |
2024-08-20 | Cervical Cancer Detection Using Multi-Branch Deep Learning Model | Tatsuhiro Baba et.al. | 2408.10498 | null |
2024-08-19 | HaSPeR: An Image Repository for Hand Shadow Puppet Recognition | Syed Rifat Raiyan et.al. | 2408.10360 | link |
2024-08-19 | Leveraging Superfluous Information in Contrastive Representation Learning | Xuechu Yu et.al. | 2408.10292 | null |
2024-08-19 | SMILE: Zero-Shot Sparse Mixture of Low-Rank Experts Construction From Pre-Trained Foundation Models | Anke Tang et.al. | 2408.10174 | link |
2024-08-19 | Towards Robust Federated Image Classification: An Empirical Study of Weight Selection Strategies in Manufacturing | Vinit Hegiste et.al. | 2408.10024 | null |
2024-08-19 | Detecting Adversarial Attacks in Semantic Segmentation via Uncertainty Estimation: A Deep Analysis | Kira Maag et.al. | 2408.10021 | null |
2024-08-19 | Active Learning for Identifying Disaster-Related Tweets: A Comparison with Keyword Filtering and Generic Fine-Tuning | David Hanny et.al. | 2408.09914 | null |
2024-08-19 | Ranking Generated Answers: On the Agreement of Retrieval Models with Humans on Consumer Health Questions | Sebastian Heineking et.al. | 2408.09831 | null |
2024-08-19 | AutoML-guided Fusion of Entity and LLM-based representations | Boshko Koloski et.al. | 2408.09794 | null |
2024-08-19 | Dataset Distillation for Histopathology Image Classification | Cong Cong et.al. | 2408.09709 | null |
2024-08-19 | A Strategy to Combine 1stGen Transformers and Open LLMs for Automatic Text Classification | Claudio M. V. de Andrade et.al. | 2408.09629 | null |
2024-08-18 | Attention Is Not What You Need: Revisiting Multi-Instance Learning for Whole Slide Image Classification | Xin Liu et.al. | 2408.09449 | null |
2024-08-17 | Narrowing the Focus: Learned Optimizers for Pretrained Models | Gus Kristiansen et.al. | 2408.09310 | null |
2024-08-16 | DPA: Dual Prototypes Alignment for Unsupervised Adaptation of Vision-Language Models | Eman Ali et.al. | 2408.08855 | null |
2024-08-16 | LEVIS: Large Exact Verifiable Input Spaces for Neural Networks | Mohamad Fares El Hajj Chehade et.al. | 2408.08824 | null |
2024-08-16 | Leveraging FourierKAN Classification Head for Pre-Trained Transformer-based Text Classification | Abdullah Al Imran et.al. | 2408.08803 | null |
2024-08-16 | Xpikeformer: Hybrid Analog-Digital Hardware Acceleration for Spiking Transformers | Zihang Song et.al. | 2408.08794 | null |
2024-08-16 | Quantum convolutional neural networks for jet images classification | Hala Elhag et.al. | 2408.08701 | null |
2024-08-16 | MM-UNet: A Mixed MLP Architecture for Improved Ophthalmic Image Segmentation | Zunjie Xiao et.al. | 2408.08600 | null |
2024-08-16 | Tell Codec What Worth Compressing: Semantically Disentangled Image Coding for Machine with LMMs | Jinming Liu et.al. | 2408.08575 | null |
2024-08-16 | Efficient Image-to-Image Diffusion Classifier for Adversarial Robustness | Hefei Mei et.al. | 2408.08502 | link |
2024-08-15 | Beyond Uniform Query Distribution: Key-Driven Grouped Query Attention | Zohaib Khan et.al. | 2408.08454 | null |
2024-08-15 | Predictive uncertainty estimation in deep learning for lung carcinoma classification in digital pathology under real dataset shifts | Abdur R. Fayjie et.al. | 2408.08432 | null |
2024-08-15 | SLCA++: Unleash the Power of Sequential Fine-tuning for Continual Learning with Pre-training | Gengwei Zhang et.al. | 2408.08295 | link |
2024-08-15 | Moving Healthcare AI-Support Systems for Visually Detectable Diseases onto Constrained Devices | Tess Watt et.al. | 2408.08215 | null |
2024-08-15 | Towards flexible perception with visual memory | Robert Geirhos et.al. | 2408.08172 | null |
2024-08-15 | Category-Prompt Refined Feature Learning for Long-Tailed Multi-Label Image Classification | Jiexuan Yan et.al. | 2408.08125 | link |
2024-08-15 | HAIR: Hypernetworks-based All-in-One Image Restoration | Jin Cao et.al. | 2408.08091 | link |
2024-08-14 | Large Language Models Prompting With Episodic Memory | Dai Do et.al. | 2408.07465 | null |
2024-08-14 | Leveraging Perceptual Scores for Dataset Pruning in Computer Vision Tasks | Raghavendra Singh et.al. | 2408.07243 | null |
2024-08-13 | Efficient Search for Customized Activation Functions with Gradient Descent | Lukas Strack et.al. | 2408.06820 | link |
2024-08-13 | Do Vision-Language Foundational models show Robust Visual Perception? | Shivam Chandhok et.al. | 2408.06781 | link |
2024-08-13 | Towards Cross-Domain Single Blood Cell Image Classification via Large-Scale LoRA-based Segment Anything Model | Yongcheng Li et.al. | 2408.06716 | link |
2024-08-13 | Coherence Awareness in Diffractive Neural Networks | Matan Kleiner et.al. | 2408.06681 | null |
2024-08-12 | Is it a work or leisure travel? Applying text classification to identify work-related travel on social networks | Lucas Félix et.al. | 2408.06341 | null |
2024-08-12 | Audio Enhancement for Computer Audition -- An Iterative Training Paradigm Using Sample Importance | Manuel Milling et.al. | 2408.06264 | null |
2024-08-12 | Deep Learning System Boundary Testing through Latent Space Style Mixing | Amr Abdellatif et.al. | 2408.06258 | null |
2024-08-12 | Global-to-Local Support Spectrums for Language Model Explainability | Lucas Agussurja et.al. | 2408.05976 | null |
2024-08-12 | A Simple Task-aware Contrastive Local Descriptor Selection Strategy for Few-shot Learning between inter class and intra class | Qian Qiao et.al. | 2408.05953 | null |
2024-08-12 | Classifier Guidance Enhances Diffusion-based Adversarial Purification by Preserving Predictive Information | Mingkun Zhang et.al. | 2408.05900 | null |
2024-08-11 | HiLight: A Hierarchy-aware Light Global Model with Hierarchical Local ConTrastive Learning | Zhijian Chen et.al. | 2408.05786 | null |
2024-08-11 | PRECISe : Prototype-Reservation for Explainable Classification under Imbalanced and Scarce-Data Settings | Vaibhav Ganatra et.al. | 2408.05754 | null |
2024-08-11 | Disposable-key-based image encryption for collaborative learning of Vision Transformer | Rei Aso et.al. | 2408.05737 | null |
2024-08-11 | A Novel Momentum-Based Deep Learning Techniques for Medical Image Classification and Segmentation | Koushik Biswas et.al. | 2408.05692 | null |
2024-08-09 | A conformalized learning of a prediction set with applications to medical imaging classification | Roy Hirsch et.al. | 2408.05037 | null |
2024-08-09 | Generalisation First, Memorisation Second? Memorisation Localisation for Natural Language Classification Tasks | Verna Dankers et.al. | 2408.04965 | null |
2024-08-09 | LiD-FL: Towards List-Decodable Federated Learning | Hong Liu et.al. | 2408.04963 | null |
2024-08-09 | In Defense of Lazy Visual Grounding for Open-Vocabulary Semantic Segmentation | Dahyun Kang et.al. | 2408.04961 | link |
2024-08-08 | Enhanced Prototypical Part Network (EPPNet) For Explainable Image Classification Via Prototypes | Bhushan Atote et.al. | 2408.04606 | null |
2024-08-08 | SCENE: Evaluating Explainable AI Techniques Using Soft Counterfactuals | Haoran Zheng et.al. | 2408.04575 | null |
2024-08-08 | An experimental comparative study of backpropagation and alternatives for training binary neural networks for image classification | Ben Crulis et.al. | 2408.04460 | null |
2024-08-08 | Dual-branch PolSAR Image Classification Based on GraphMAE and Local Feature Extraction | Yuchen Wang et.al. | 2408.04294 | null |
2024-08-07 | FMiFood: Multi-modal Contrastive Learning for Food Image Classification | Xinyue Pan et.al. | 2408.03922 | null |
2024-08-07 | Leveraging Variation Theory in Counterfactual Data Augmentation for Optimized Active Learning | Simret Araya Gebreegziabher et.al. | 2408.03819 | null |
2024-08-07 | Intuitionistic Fuzzy Cognitive Maps for Interpretable Image Classification | Georgia Sovatzidi et.al. | 2408.03745 | null |
2024-08-07 | CAS-ViT: Convolutional Additive Self-attention Vision Transformers for Efficient Mobile Applications | Tianfang Zhang et.al. | 2408.03703 | link |
2024-08-07 | Designing Extremely Memory-Efficient CNNs for On-device Vision Tasks | Jaewook Lee et.al. | 2408.03663 | null |
2024-08-07 | Making Robust Generalizers Less Rigid with Soft Ascent-Descent | Matthew J. Holland et.al. | 2408.03619 | null |
2024-08-06 | AI Foundation Models in Remote Sensing: A Survey | Siqi Lu et.al. | 2408.03464 | null |
2024-08-06 | Compress and Compare: Interactively Evaluating Efficiency and Behavior Across ML Model Compression Experiments | Angie Boggust et.al. | 2408.03274 | null |
2024-08-06 | A Debiased Nearest Neighbors Framework for Multi-Label Text Classification | Zifeng Cheng et.al. | 2408.03202 | null |
2024-08-06 | Leveraging Parameter Efficient Training Methods for Low Resource Text Classification: A Case Study in Marathi | Pranita Deshmukh et.al. | 2408.03172 | null |
2024-08-06 | Comb, Prune, Distill: Towards Unified Pruning for Vision Model Compression | Jonas Schmitt et.al. | 2408.03046 | null |
2024-08-06 | L3iTC at the FinLLM Challenge Task: Quantization for Financial Text Classification & Summarization | Elvys Linhares Pontes et.al. | 2408.03033 | null |
2024-08-06 | Adversarial Robustness of Open-source Text Classification Models and Fine-Tuning Chains | Hao Qin et.al. | 2408.02963 | null |
2024-08-06 | Dual-View Pyramid Pooling in Deep Neural Networks for Improved Medical Image Classification and Confidence Calibration | Xiaoqing Zhang et.al. | 2408.02906 | null |
2024-08-05 | Interpretation of the Intent Detection Problem as Dynamics in a Low-dimensional Space | Eduardo Sanchez-Karhunen et.al. | 2408.02838 | null |
2024-08-05 | Pre-trained Encoder Inference: Revealing Upstream Encoders In Downstream Machine Learning Services | Shaopeng Fu et.al. | 2408.02814 | null |
2024-08-05 | FPT+: A Parameter and Memory Efficient Transfer Learning Method for High-resolution Medical Image Classification | Yijin Huang et.al. | 2408.02426 | null |
2024-08-05 | On the Robustness of Malware Detectors to Adversarial Samples | Muhammad Salman et.al. | 2408.02310 | null |
2024-08-05 | Low-Cost Self-Ensembles Based on Multi-Branch Transformation and Grouped Convolution | Hojung Lee et.al. | 2408.02307 | null |
2024-08-05 | Network Fission Ensembles for Low-Cost Self-Ensembles | Hojung Lee et.al. | 2408.02301 | null |
2024-08-04 | VidModEx: Interpretable and Efficient Black Box Model Extraction for High-Dimensional Spaces | Somnath Sendhil Kumar et.al. | 2408.02140 | null |
2024-08-04 | DeMansia: Mamba Never Forgets Any Tokens | Ricky Fang et.al. | 2408.01986 | null |
2024-08-06 | A Survey and Evaluation of Adversarial Attacks for Object Detection | Khoi Nguyen Tiet Nguyen et.al. | 2408.01934 | null |
2024-08-03 | Safe Semi-Supervised Contrastive Learning Using In-Distribution Data as Positive Examples | Min Gu Kwak et.al. | 2408.01872 | null |
2024-08-03 | LAM3D: Leveraging Attention for Monocular 3D Object Detection | Diana-Alexandra Sas et.al. | 2408.01739 | null |
2024-08-02 | Counterfactual Explanations for Medical Image Classification and Regression using Diffusion Autoencoder | Matan Atad et.al. | 2408.01571 | null |
2024-08-02 | Spatial-Spectral Morphological Mamba for Hyperspectral Image Classification | Muhammad Ahmad et.al. | 2408.01372 | null |
2024-08-02 | WaveMamba: Spatial-Spectral Wavelet Mamba for Hyperspectral Image Classification | Muhammad Ahmad et.al. | 2408.01231 | null |
2024-08-02 | Multi-head Spatial-Spectral Mamba for Hyperspectral Image Classification | Muhammad Ahmad et.al. | 2408.01224 | null |
2024-08-02 | Rethinking Pre-trained Feature Extractor Selection in Multiple Instance Learning for Whole Slide Image Classification | Bryan Wong et.al. | 2408.01167 | null |
2024-08-01 | CERT-ED: Certifiably Robust Text Classification for Edit Distance | Zhuoqun Huang et.al. | 2408.00728 | null |
2024-08-01 | Deep Learning in Medical Image Classification from MRI-based Brain Tumor Images | Xiaoyi Liu et.al. | 2408.00636 | null |
2024-08-01 | DECIDER: Leveraging Foundation Model Priors for Improved Model Failure Detection and Explanation | Rakshith Subramanyam et.al. | 2408.00331 | null |
2024-07-31 | Vera Verto: Multimodal Hijacking Attack | Minxing Zhang et.al. | 2408.00129 | null |
2024-07-31 | Learning Video Context as Interleaved Multimodal Sequences | Kevin Qinghong Lin et.al. | 2407.21757 | null |
2024-07-30 | Contrasting Deep Learning Models for Direct Respiratory Insufficiency Detection Versus Blood Oxygen Saturation Estimation | Marcelo Matheus Gauy et.al. | 2407.20989 | null |
2024-07-30 | Faithful and Plausible Natural Language Explanations for Image Classification: A Pipeline Approach | Adam Wojciechowski et.al. | 2407.20899 | null |
2024-08-01 | DFE-IANet: A Method for Polyp Image Classification Based on Dual-domain Feature Extraction and Interaction Attention | Wei Wang et.al. | 2407.20843 | null |
2024-08-01 | The Susceptibility of Example-Based Explainability Methods to Class Outliers | Ikhtiyor Nematov et.al. | 2407.20678 | null |
2024-07-30 | Knowledge Fused Recognition: Fusing Hierarchical Knowledge for Image Recognition through Quantitative Relativity Modeling and Deep Metric Learning | Yunfeng Zhao et.al. | 2407.20600 | null |
2024-07-30 | Exploring Liquid Neural Networks on Loihi-2 | Wiktoria Agata Pawlak et.al. | 2407.20590 | null |
2024-07-29 | Graphite: A Graph-based Extreme Multi-Label Short Text Classifier for Keyphrase Recommendation | Ashirbad Mishra et.al. | 2407.20462 | null |
2024-07-29 | Diffusion Feedback Helps CLIP See Better | Wenxuan Wang et.al. | 2407.20171 | null |
2024-07-29 | Distilling High Diagnostic Value Patches for Whole Slide Image Classification Using Attention Mechanism | Tianhang Nan et.al. | 2407.19821 | null |
2024-07-28 | Competition-based Adaptive ReLU for Deep Neural Networks | Junjia Chen et.al. | 2407.19441 | null |
2024-07-28 | Depth-Wise Convolutions in Vision Transformers for Efficient Training on Small Datasets | Tianxiao Zhang et.al. | 2407.19394 | link |
2024-07-27 | Inference-Time Selective Debiasing | Gleb Kuzmin et.al. | 2407.19345 | null |
2024-07-27 | Stellar Blend Image Classification Using Computationally Efficient Gaussian Processes | Chinedu Eleh et.al. | 2407.19297 | null |
2024-07-27 | Towards Robust Few-shot Class Incremental Learning in Audio Classification using Contrastive Representation | Riyansha Singh et.al. | 2407.19265 | null |
2024-07-27 | A Survey of Malware Detection Using Deep Learning | Ahmed Bensaoud et.al. | 2407.19153 | null |
2024-07-26 | UniForensics: Face Forgery Detection via General Facial Representation | Ziyuan Fang et.al. | 2407.19079 | null |
2024-07-26 | A Scalable Quantum Non-local Neural Network for Image Classification | Sparsh Gupta et.al. | 2407.18906 | link |
2024-07-26 | Unifying Visual and Semantic Feature Spaces with Diffusion Models for Enhanced Cross-Modal Alignment | Yuze Zheng et.al. | 2407.18854 | null |
2024-07-26 | Local Binary Pattern(LBP) Optimization for Feature Extraction | Zeinab Sedaghatjoo et.al. | 2407.18665 | null |
2024-07-26 | Topology Optimization of Random Memristors for Input-Aware Dynamic SNN | Bo Wang et.al. | 2407.18625 | null |
2024-07-26 | Content-driven Magnitude-Derivative Spectrum Complementary Learning for Hyperspectral Image Classification | Huiyan Bai et.al. | 2407.18593 | null |
2024-07-26 | VSSD: Vision Mamba with Non-Casual State Space Duality | Yuheng Shi et.al. | 2407.18559 | link |
2024-07-25 | Self-supervised pre-training with diffusion model for few-shot landmark detection in x-ray images | Roberto Di Via et.al. | 2407.18125 | null |
2024-07-25 | Mew: Multiplexed Immunofluorescence Image Analysis through an Efficient Multiplex Network | Sukwon Yun et.al. | 2407.17857 | link |
2024-07-25 | SAM-MIL: A Spatial Contextual Aware Multiple Instance Learning Approach for Whole Slide Image Classification | Heng Fang et.al. | 2407.17689 | link |
2024-07-26 | Unsqueeze [CLS] Bottleneck to Learn Rich Representations | Qing Su et.al. | 2407.17671 | link |
2024-07-24 | Explaining the Model, Protecting Your Data: Revealing and Mitigating the Data Privacy Risks of Post-Hoc Model Explanations via Membership Inference | Catherine Huang et.al. | 2407.17663 | null |
2024-07-23 | S-E Pipeline: A Vision Transformer (ViT) based Resilient Classification Pipeline for Medical Imaging Against Adversarial Attacks | Neha A S et.al. | 2407.17587 | null |
2024-07-24 | A Novel Two-Step Fine-Tuning Pipeline for Cold-Start Active Learning in Text Classification Tasks | Fabiano Belém et.al. | 2407.17284 | null |
2024-07-24 | Graph Neural Networks: A suitable Alternative to MLPs in Latent 3D Medical Image Classification? | Johannes Kiechle et.al. | 2407.17219 | link |
2024-07-24 | Quanv4EO: Empowering Earth Observation by means of Quanvolutional Neural Networks | Alessandro Sebastianelli et.al. | 2407.17108 | null |
2024-07-24 | An Adaptive Gradient Regularization Method | Huixiu Jiang et.al. | 2407.16944 | null |
2024-07-23 | Lawma: The Power of Specialization for Legal Tasks | Ricardo Dominguez-Olmedo et.al. | 2407.16615 | null |
2024-07-23 | Deep Bayesian segmentation for colon polyps: Well-calibrated predictions in medical imaging | Daniela L. Ramos et.al. | 2407.16608 | null |
2024-07-23 | Designing robust diffractive neural networks with improved transverse shift tolerance | Daniil V. Soshnikov et.al. | 2407.16456 | null |
2024-07-23 | Image Classification using Fuzzy Pooling in Convolutional Kolmogorov-Arnold Networks | Ayan Igali et.al. | 2407.16268 | null |
2024-07-23 | HSVLT: Hierarchical Scale-Aware Vision-Language Transformer for Multi-Label Image Classification | Shuyi Ouyang et.al. | 2407.16244 | null |
2024-07-23 | Improved Few-Shot Image Classification Through Multiple-Choice Questions | Dipika Khullar et.al. | 2407.16145 | null |
2024-07-22 | Pavement Fatigue Crack Detection and Severity Classification Based on Convolutional Neural Network | Zhen Wang et.al. | 2407.16021 | null |
2024-07-22 | AIDE: Antithetical, Intent-based, and Diverse Example-Based Explanations | Ikhtiyor Nematov et.al. | 2407.16010 | null |
2024-07-22 | Comprehensive Study on Performance Evaluation and Optimization of Model Compression: Bridging Traditional Deep Learning and Large Language Models | Aayush Saxena et.al. | 2407.15904 | null |
2024-07-22 | Beyond Size and Class Balance: Alpha as a New Dataset Quality Metric for Deep Learning | Josiah Couch et.al. | 2407.15724 | null |
2024-07-22 | Retinomorphic Feature Detection and Machine Vision in a Network Laser | Wai Kit Ng et.al. | 2407.15558 | null |
2024-07-22 | Learning deep illumination-robust features from multispectral filter array images | Anis Amziane et.al. | 2407.15472 | null |
2024-07-22 | Is user feedback always informative? Retrieval Latent Defending for Semi-Supervised Domain Adaptation without Source Data | Junha Song et.al. | 2407.15383 | null |
2024-07-22 | FMDNN: A Fuzzy-guided Multi-granular Deep Neural Network for Histopathological Image Classification | Weiping Ding et.al. | 2407.15312 | null |
2024-07-21 | Assessing Sample Quality via the Latent Space of Generative Models | Jingyi Xu et.al. | 2407.15171 | null |
2024-07-21 | A multi-level multi-label text classification dataset of 19th century Ottoman and Russian literary and critical texts | Gokcen Gokceoglu et.al. | 2407.15136 | null |
2024-07-20 | Toward Efficient Convolutional Neural Networks With Structured Ternary Patterns | Christos Kyrkou et.al. | 2407.14831 | link |
2024-07-20 | Subgraph Clustering and Atom Learning for Improved Image Classification | Aryan Singh et.al. | 2407.14772 | null |
2024-07-20 | A Comprehensive Review of Few-shot Action Recognition | Yuyang Wanyan et.al. | 2407.14744 | null |
2024-07-19 | DEPICT: Diffusion-Enabled Permutation Importance for Image Classification Tasks | Sarah Jabbour et.al. | 2407.14509 | null |
2024-07-19 | Enhancing Zero-shot Audio Classification using Sound Attribute Knowledge from Large Language Models | Xuenan Xu et.al. | 2407.14355 | null |
2024-07-19 | EmoCAM: Toward Understanding What Drives CNN-based Emotion Recognition | Youssef Doulfoukar et.al. | 2407.14314 | null |
2024-07-18 | CoAPT: Context Attribute words for Prompt Tuning | Gun Lee et.al. | 2407.13808 | null |
2024-07-18 | GroupMamba: Parameter-Efficient and Accurate Group Visual State Space Model | Abdelrahman Shaker et.al. | 2407.13772 | link |
2024-07-18 | Addressing Imbalance for Class Incremental Learning in Medical Image Classification | Xuze Hao et.al. | 2407.13768 | null |
2024-07-18 | Differential Privacy Mechanisms in Neural Tangent Kernel Regression | Jiuxiang Gu et.al. | 2407.13621 | null |
2024-07-18 | CycleMix: Mixing Source Domains for Domain Generalization in Style-Dependent Data | Aristotelis Ballas et.al. | 2407.13421 | link |
2024-07-17 | LookupViT: Compressing visual information to a limited number of tokens | Rajat Koner et.al. | 2407.12753 | null |
2024-07-17 | Toward INT4 Fixed-Point Training via Exploring Quantization Error for Gradients | Dohyung Kim et.al. | 2407.12637 | null |
2024-07-17 | Domain-specific or Uncertainty-aware models: Does it really make a difference for biomedical text classification? | Aman Sinha et.al. | 2407.12626 | null |
2024-07-18 | Benchmarking Robust Self-Supervised Learning Across Diverse Downstream Tasks | Antoni Kowalczuk et.al. | 2407.12588 | link |
2024-07-17 | Non-parametric regularization for class imbalance federated medical image classification | Jeffry Wicaksana et.al. | 2407.12446 | link |
2024-07-17 | FETCH: A Memory-Efficient Replay Approach for Continual Learning in Image Classification | Markus Weißflog et.al. | 2407.12375 | null |
2024-07-17 | Adaptive Cascading Network for Continual Test-Time Adaptation | Kien X. Nguyen et.al. | 2407.12240 | null |
2024-07-16 | Generalized Coverage for More Robust Low-Budget Active Learning | Wonho Bae et.al. | 2407.12212 | null |
2024-07-18 | A Closer Look at Benchmarking Self-Supervised Pre-training with Image Classification | Markus Marks et.al. | 2407.12210 | null |
2024-07-16 | Novel Artistic Scene-Centric Datasets for Effective Transfer Learning in Fragrant Spaces | Shumei Liu et.al. | 2407.11701 | null |
2024-07-16 | Probing the Efficacy of Federated Parameter-Efficient Fine-Tuning of Vision Transformers for Medical Image Classification | Naif Alkhunaizi et.al. | 2407.11573 | null |
2024-07-16 | TCFormer: Visual Recognition via Token Clustering Transformer | Wang Zeng et.al. | 2407.11321 | link |
2024-07-16 | PADRe: A Unifying Polynomial Attention Drop-in Replacement for Efficient Vision Transformer | Pierre-David Letourneau et.al. | 2407.11306 | null |
2024-07-15 | Unconstrained Open Vocabulary Image Classification: Zero-Shot Transfer from Text to Image via CLIP Inversion | Philipp Allgeuer et.al. | 2407.11211 | null |
2024-07-16 | DataDream: Few-shot Guided Dataset Generation | Jae Myung Kim et.al. | 2407.10910 | link |
2024-07-15 | Pathology-knowledge Enhanced Multi-instance Prompt Learning for Few-shot Whole Slide Image Classification | Linhao Qu et.al. | 2407.10814 | null |
2024-07-15 | Employing Sentence Space Embedding for Classification of Data Stream from Fake News Domain | Paweł Zyblewski et.al. | 2407.10807 | null |
2024-07-15 | Anticipating Future Object Compositions without Forgetting | Youssef Zahran et.al. | 2407.10723 | null |
2024-07-15 | GeoMix: Towards Geometry-Aware Data Augmentation | Wentao Zhao et.al. | 2407.10681 | link |
2024-07-15 | Learning Natural Consistency Representation for Face Forgery Video Detection | Daichi Zhang et.al. | 2407.10550 | null |
2024-07-15 | Improving Hyperbolic Representations via Gromov-Wasserstein Regularization | Yifei Yang et.al. | 2407.10495 | null |
2024-07-15 | Backdoor Attacks against Image-to-Image Networks | Wenbo Jiang et.al. | 2407.10445 | null |
2024-07-14 | Deep Learning Algorithms for Early Diagnosis of Acute Lymphoblastic Leukemia | Dimitris Papaioannou et.al. | 2407.10251 | null |
2024-07-14 | Advancing Continual Learning for Robust Deepfake Audio Classification | Feiyi Dong et.al. | 2407.10108 | null |
2024-07-12 | Evaluating the Adversarial Robustness of Semantic Segmentation: Trying Harder Pays Off | Levente Halmosi et.al. | 2407.09150 | link |
2024-07-12 | Open Vocabulary Multi-Label Video Classification | Rohit Gupta et.al. | 2407.09073 | null |
2024-07-12 | GPC: Generative and General Pathology Image Classifier | Anh Tien Nguyen et.al. | 2407.09035 | null |
2024-07-12 | CAMP: Continuous and Adaptive Learning Model in Pathology | Anh Tien Nguyen et.al. | 2407.09030 | null |
2024-07-12 | SlideGCD: Slide-based Graph Collaborative Training with Knowledge Distillation for Whole Slide Image Classification | Tong Shu et.al. | 2407.08968 | null |
2024-07-12 | Domain-Hierarchy Adaptation via Chain of Iterative Reasoning for Few-shot Hierarchical Text Classification | Ke Ji et.al. | 2407.08959 | null |
2024-07-11 | Local Clustering for Lung Cancer Image Classification via Sparse Solution Technique | Jackson Hamel et.al. | 2407.08800 | null |
2024-07-11 | Data Adaptive Traceback for Vision-Language Foundation Models in Image Classification | Wenshuo Peng et.al. | 2407.08787 | null |
2024-07-11 | ElasticAST: An Audio Spectrogram Transformer for All Length and Resolutions | Jiu Feng et.al. | 2407.08691 | link |
2024-07-11 | Histopathological Image Classification with Cell Morphology Aware Deep Neural Networks | Andrey Ignatov et.al. | 2407.08625 | link |
2024-07-11 | BiasPruner: Debiased Continual Learning for Medical Image Classification | Nourhan Bayasi et.al. | 2407.08609 | link |
2024-07-11 | GraphMamba: An Efficient Graph Structure Learning Vision Mamba for Hyperspectral Image Classification | Aitao Yang et.al. | 2407.08255 | link |
2024-07-11 | Beyond Text: Leveraging Multi-Task Learning and Cognitive Appraisal Theory for Post-Purchase Intention Analysis | Gerard Christopher Yeo et.al. | 2407.08182 | null |
2024-07-11 | Enrich the content of the image Using Context-Aware Copy Paste | Qiushi Guo et.al. | 2407.08151 | null |
2024-07-10 | MambaVision: A Hybrid Mamba-Transformer Vision Backbone | Ali Hatamizadeh et.al. | 2407.08083 | link |
2024-07-10 | The Misclassification Likelihood Matrix: Some Classes Are More Likely To Be Misclassified Than Others | Daniel Sikar et.al. | 2407.07818 | null |
2024-07-11 | Trainable Highly-expressive Activation Functions | Irit Chelly et.al. | 2407.07564 | null |
2024-07-10 | HDKD: Hybrid Data-Efficient Knowledge Distillation Network for Medical Image Classification | Omar S. EL-Assiouti et.al. | 2407.07516 | null |
2024-07-10 | Towards a text-based quantitative and explainable histopathology image analysis | Anh Tien Nguyen et.al. | 2407.07360 | null |
2024-07-11 | FALFormer: Feature-aware Landmarks self-attention for Whole-slide Image Classification | Doanh C. Bui et.al. | 2407.07340 | link |
2024-07-10 | Dual-stage Hyperspectral Image Classification Model with Spectral Supertoken | Peifu Liu et.al. | 2407.07307 | link |
2024-07-09 | Exploring Camera Encoder Designs for Autonomous Driving Perception | Barath Lakshmanan et.al. | 2407.07276 | null |
2024-07-09 | CTRL-F: Pairing Convolution with Transformer for Image Classification via Multi-Level Feature Cross-Attention and Representation Learning Fusion | Hosam S. EL-Assiouti et.al. | 2407.06673 | null |
2024-07-09 | NoisyAG-News: A Benchmark for Addressing Instance-Dependent Noise in Text Classification | Hongfei Huang et.al. | 2407.06579 | null |
2024-07-08 | Hybrid Classical-Quantum architecture for vectorised image classification of hand-written sketches | Y. Cordero et.al. | 2407.06416 | null |
2024-07-08 | GeoWATCH for Detecting Heavy Construction in Heterogeneous Time Series of Satellite Images | Jon Crall et.al. | 2407.06337 | null |
2024-07-08 | Multi-Label Plant Species Classification with Self-Supervised Vision Transformers | Murilo Gustineli et.al. | 2407.06298 | link |
2024-07-08 | Active Label Refinement for Robust Training of Imbalanced Medical Image Classification Tasks in the Presence of High Label Noise | Bidur Khanal et.al. | 2407.05973 | null |
2024-07-08 | Wavelet Convolutions for Large Receptive Fields | Shahaf E. Finder et.al. | 2407.05848 | link |
2024-07-08 | Evaluating the Fairness of Neural Collapse in Medical Image Classification | Kaouther Mouheb et.al. | 2407.05843 | null |
2024-07-08 | Learning to Adapt Category Consistent Meta-Feature of CLIP for Few-Shot Classification | Jiaying Shi et.al. | 2407.05647 | null |
2024-07-08 | New Directions in Text Classification Research: Maximizing The Performance of Sentiment Classification from Limited Data | Surya Agustian et.al. | 2407.05627 | null |
2024-07-08 | Momentum Auxiliary Network for Supervised Local Learning | Junhao Su et.al. | 2407.05623 | link |
2024-07-08 | Open-world Multi-label Text Classification with Extremely Weak Supervision | Xintong Li et.al. | 2407.05609 | link |
2024-07-08 | FALIP: Visual Prompt as Foveal Attention Boosts CLIP Zero-Shot Performance | Jiedong Zhuang et.al. | 2407.05578 | null |
2024-07-08 | An accurate detection is not all you need to combat label noise in web-noisy datasets | Paul Albert et.al. | 2407.05528 | null |
2024-07-07 | Leveraging Topological Guidance for Improved Knowledge Distillation | Eun Som Jeon et.al. | 2407.05316 | link |
2024-07-05 | AWT: Transferring Vision-Language Models via Augmentation, Weighting, and Transportation | Yuhan Zhu et.al. | 2407.04603 | null |
2024-07-05 | AMD: Automatic Multi-step Distillation of Large-scale Vision Models | Cheng Han et.al. | 2407.04208 | null |
2024-07-04 | LeDNet: Localization-enabled Deep Neural Network for Multi-Label Radiography Image Classification | Lalit Pant et.al. | 2407.03931 | null |
2024-07-04 | DocXplain: A Novel Model-Agnostic Explainability Method for Document Image Classification | Saifullah Saifullah et.al. | 2407.03830 | null |
2024-07-04 | reBEN: Refined BigEarthNet Dataset for Remote Sensing Image Analysis | Kai Norman Clasen et.al. | 2407.03653 | link |
2024-07-04 | Resampled Datasets Are Not Enough: Mitigating Societal Bias Beyond Single Attributes | Yusuke Hirota et.al. | 2407.03623 | null |
2024-07-04 | Self Adaptive Threshold Pseudo-labeling and Unreliable Sample Contrastive Loss for Semi-supervised Image Classification | Xuerong Zhang et.al. | 2407.03596 | null |
2024-07-04 | DGR-MIL: Exploring Diverse Global Representation in Multiple Instance Learning for Whole Slide Image Classification | Wenhui Zhu et.al. | 2407.03575 | link |
2024-07-03 | A multicategory jet image classification framework using deep neural network | Jairo Orozco Sandoval et.al. | 2407.03524 | null |
2024-07-03 | Model Guidance via Explanations Turns Image Classifiers into Segmentation Models | Xiaoyan Yu et.al. | 2407.03009 | null |
2024-07-03 | ShiftAddAug: Augment Multiplication-Free Tiny Neural Network with Hybrid Computation | Yipin Guo et.al. | 2407.02881 | null |
2024-07-03 | Fine-Grained Scene Image Classification with Modality-Agnostic Adapter | Yiqun Wang et.al. | 2407.02769 | link |
2024-07-03 | ADFQ-ViT: Activation-Distribution-Friendly Post-Training Quantization for Vision Transformers | Yanfeng Jiang et.al. | 2407.02763 | null |
2024-07-02 | Spectral Graph Reasoning Network for Hyperspectral Image Classification | Huiling Wang et.al. | 2407.02647 | null |
2024-07-01 | CGRclust: Chaos Game Representation for Twin Contrastive Clustering of Unlabelled DNA Sequences | Fatemeh Alipour et.al. | 2407.02538 | link |
2024-07-02 | Exploring the Role of Transliteration in In-Context Learning for Low-resource Languages Written in Non-Latin Scripts | Chunlan Ma et.al. | 2407.02320 | null |
2024-07-03 | Federated Distillation for Medical Image Classification: Towards Trustworthy Computer-Aided Diagnosis | Sufen Ren et.al. | 2407.02261 | null |
2024-07-02 | Hybrid Feature Collaborative Reconstruction Network for Few-Shot Fine-Grained Image Classification | Shulei Qiu et.al. | 2407.02123 | null |
2024-07-01 | Optimized Learning for X-Ray Image Classification for Multi-Class Disease Diagnoses with Accelerated Computing Strategies | Sebastian A. Cruz Romero et.al. | 2407.01705 | null |
2024-07-02 | xLSTM-UNet can be an Effective 2D & 3D Medical Image Segmentation Backbone with Vision-LSTM (ViL) better than its Mamba Counterpart | Tianrun Chen et.al. | 2407.01530 | link |
2024-07-01 | Scarecrow monitoring system:employing mobilenet ssd for enhanced animal supervision | Balaji VS et.al. | 2407.01435 | null |
2024-07-01 | Semantic Compositions Enhance Vision-Language Contrastive Learning | Maxwell Aladago et.al. | 2407.01408 | null |
2024-07-01 | GalLoP: Learning Global and Local Prompts for Vision-Language Models | Marc Lafon et.al. | 2407.01400 | null |
2024-07-01 | Protecting Privacy in Classifiers by Token Manipulation | Re'em Harel et.al. | 2407.01334 | null |
2024-07-01 | Gradient-based Class Weighting for Unsupervised Domain Adaptation in Dense Prediction Visual Tasks | Roberto Alcover-Couso et.al. | 2407.01327 | null |
2024-06-28 | Extract More from Less: Efficient Fine-Grained Visual Recognition in Low-Data Regimes | Dmitry Demidov et.al. | 2406.19814 | link |
2024-06-27 | Fibottention: Inceptive Visual Representation Learning with Diverse Attention Across Heads | Ali Khaleghi Rahimian et.al. | 2406.19391 | link |
2024-06-27 | Learning Visual Conditioning Tokens to Correct Domain Shift for Fully Test-time Adaptation | Yushun Tang et.al. | 2406.19341 | null |
2024-06-27 | Spiking Convolutional Neural Networks for Text Classification | Changze Lv et.al. | 2406.19230 | link |
2024-06-27 | Adaptive Stochastic Weight Averaging | Caglar Demir et.al. | 2406.19092 | link |
2024-06-27 | FedMLP: Federated Multi-Label Medical Image Classification under Task Heterogeneity | Zhaobin Sun et.al. | 2406.18995 | link |
2024-06-26 | Detecting Machine-Generated Texts: Not Just "AI vs Humans" and Explainability is Complicated | Jiazhou Ji et.al. | 2406.18259 | null |
2024-06-26 | ViT-1.58b: Mobile Vision Transformers in the 1-bit Era | Zhengqing Yuan et.al. | 2406.18051 | null |
2024-06-25 | Benchmarking Deep Learning Models on NVIDIA Jetson Nano for Real-Time Systems: An Empirical Investigation | Tushar Prasanna Swaminathan et.al. | 2406.17749 | link |
2024-06-25 | Structured Unrestricted-Rank Matrices for Parameter Efficient Fine-tuning | Arijit Sehanobish et.al. | 2406.17740 | null |
2024-06-25 | BayTTA: Uncertainty-aware medical image classification with optimized test-time augmentation using Bayesian model averaging | Zeinab Sherkatghanad et.al. | 2406.17640 | link |
2024-06-26 | Mitigate the Gap: Investigating Approaches for Improving Cross-Modal Alignment in CLIP | Sedigheh Eslami et.al. | 2406.17639 | null |
2024-06-25 | Knowledge Distillation in Automated Annotation: Supervised Text Classification with LLM-Generated Training Labels | Nicholas Pangakis et.al. | 2406.17633 | null |
2024-06-25 | Retrieval-style In-Context Learning for Few-shot Hierarchical Text Classification | Huiyao Chen et.al. | 2406.17534 | link |
2024-06-25 | TSynD: Targeted Synthetic Data Generation for Enhanced Medical Image Classification | Joshua Niemeijer et.al. | 2406.17473 | null |
2024-06-25 | Dynamic Scheduling for Vehicle-to-Vehicle Communications Enhanced Federated Learning | Jintao Yan et.al. | 2406.17470 | null |
2024-06-25 | Implicit-Zoo: A Large-Scale Dataset of Neural Implicit Functions for 2D Images and 3D Scenes | Qi Ma et.al. | 2406.17438 | null |
2024-06-25 | Robustly Optimized Deep Feature Decoupling Network for Fatty Liver Diseases Detection | Peng Huang et.al. | 2406.17338 | null |
2024-06-24 | Evaluation of Language Models in the Medical Context Under Resource-Constrained Settings | Andrea Posada et.al. | 2406.16611 | link |
2024-06-24 | Improving robustness to corruptions with multiplicative weight perturbations | Trung Trinh et.al. | 2406.16540 | null |
2024-06-24 | UNICAD: A Unified Approach for Attack Detection, Noise Reduction and Novel Class Identification | Alvaro Lopez Pellicer et.al. | 2406.16501 | null |
2024-06-24 | Improving Quaternion Neural Networks with Quaternionic Activation Functions | Johannes Pöppelbaum et.al. | 2406.16481 | null |
2024-06-24 | Learning in Wilson-Cowan model for metapopulation | Raffaele Marino et.al. | 2406.16453 | link |
2024-06-24 | Context-augmented Retrieval: A Novel Framework for Fast Information Retrieval based Response Generation using Large Language Model | Sai Ganesh et.al. | 2406.16383 | null |
2024-06-24 | Combining Supervised Learning and Reinforcement Learning for Multi-Label Classification Tasks with Partial Labels | Zixia Jia et.al. | 2406.16293 | null |
2024-06-23 | Jacobian Descent for Multi-Objective Optimization | Pierre Quinton et.al. | 2406.16232 | null |
2024-06-23 | Learning with Noisy Ground Truth: From 2D Classification to 3D Reconstruction | Yangdi Lu et.al. | 2406.15982 | null |
2024-06-22 | PUDD: Towards Robust Multi-modal Prototype-based Deepfake Detection | Alvaro Lopez Pellcier et.al. | 2406.15921 | null |
2024-06-21 | Retrieval Augmented Zero-Shot Text Classification | Tassallah Abdullahi et.al. | 2406.15241 | null |
2024-06-21 | DiffExplainer: Unveiling Black Box Models Via Counterfactual Generation | Yingying Fang et.al. | 2406.15182 | null |
2024-06-21 | This actually looks like that: Proto-BagNets for local and global interpretability-by-design | Kerol Djoumessi et.al. | 2406.15168 | link |
2024-06-21 | Hierarchical thematic classification of major conference proceedings | Arsentii Kuzmin et.al. | 2406.14983 | null |
2024-06-21 | Demonstrating the Efficacy of Kolmogorov-Arnold Networks in Vision Tasks | Minjong Cheon et.al. | 2406.14916 | link |
2024-06-21 | MU-Bench: A Multitask Multimodal Benchmark for Machine Unlearning | Jiali Cheng et.al. | 2406.14796 | null |
2024-06-20 | Depth |
Parker Seegmiller et.al. | 2406.14695 | null |
2024-06-20 | Automatic Labels are as Effective as Manual Labels in Biomedical Images Classification with Deep Learning | Niccolò Marini et.al. | 2406.14351 | null |
2024-06-20 | Self-supervised Interpretable Concept-based Models for Text Classification | Francesco De Santis et.al. | 2406.14335 | null |
2024-06-20 | Adaptive Adversarial Cross-Entropy Loss for Sharpness-Aware Minimization | Tanapat Ratchatorn et.al. | 2406.14329 | null |
2024-06-20 | Boosting Hyperspectral Image Classification with Gate-Shift-Fuse Mechanisms in a Novel CNN-Transformer Approach | Mohamed Fadhlallah Guerri et.al. | 2406.14120 | null |
2024-06-20 | Seg-LSTM: Performance of xLSTM for Semantic Segmentation of Remotely Sensed Images | Qinfeng Zhu et.al. | 2406.14086 | link |
2024-06-21 | CMTNet: Convolutional Meets Transformer Network for Hyperspectral Images Classification | Faxu Guo et.al. | 2406.14080 | null |
2024-06-20 | Communication-Efficient Adaptive Batch Size Strategies for Distributed Local Gradient Methods | Tim Tsz-Kit Lau et.al. | 2406.13936 | null |
2024-06-19 | WATT: Weight Average Test-Time Adaption of CLIP | David Osowiechi et.al. | 2406.13875 | link |
2024-06-19 | CNN Based Flank Predictor for Quadruped Animal Species | Vanessa Suessle et.al. | 2406.13588 | null |
2024-06-19 | Online Domain-Incremental Learning Approach to Classify Acoustic Scenes in All Locations | Manjunath Mulimani et.al. | 2406.13386 | null |
2024-06-18 | LayerMerge: Neural Network Depth Compression through Layer Pruning and Merging | Jinuk Kim et.al. | 2406.12837 | link |
2024-06-18 | Privacy Preserving Federated Learning in Medical Imaging with Uncertainty Estimation | Nikolas Koutsoubis et.al. | 2406.12815 | link |
2024-06-18 | Online Anchor-based Training for Image Classification Tasks | Maria Tzelepi et.al. | 2406.12662 | null |
2024-06-18 | Fighting Randomness with Randomness: Mitigating Optimisation Instability of Fine-Tuning using Delayed Ensemble and Noisy Interpolation | Branislav Pecher et.al. | 2406.12471 | null |
2024-06-18 | GW-MoE: Resolving Uncertainty in MoE Router with Global Workspace Theory | Haoze Wu et.al. | 2406.12375 | null |
2024-06-18 | What Did I Do Wrong? Quantifying LLMs' Sensitivity and Consistency to Prompt Engineering | Federico Errica et.al. | 2406.12334 | null |
2024-06-18 | Unleashing the Potential of Open-set Noisy Samples Against Label Noise for Medical Image Classification | Zehui Liao et.al. | 2406.12293 | null |
2024-06-18 | Advancing Cross-Domain Generalizability in Face Anti-Spoofing: Insights, Design, and Metrics | Hyojin Kim et.al. | 2406.12258 | null |
2024-06-19 | MiSuRe is all you need to explain your image segmentation | Syed Nouman Hasany et.al. | 2406.12173 | null |
2024-06-17 | Enhancing Text Classification through LLM-Driven Active Learning and Human Annotation | Hamidreza Rouzegar et.al. | 2406.12114 | link |
2024-06-17 | Scaling the Codebook Size of VQGAN to 100,000 with a Utilization Rate of 99% | Lei Zhu et.al. | 2406.11837 | link |
2024-06-17 | PrAViC: Probabilistic Adaptation Framework for Real-Time Video Classification | Magdalena Trędowicz et.al. | 2406.11443 | null |
2024-06-17 | Cross-domain Open-world Discovery | Shuo Wen et.al. | 2406.11422 | link |
2024-06-17 | BaFTA: Backprop-Free Test-Time Adaptation For Zero-Shot Vision-Language Models | Xuefeng Hu et.al. | 2406.11309 | null |
2024-06-17 | An Empirical Investigation of Matrix Factorization Methods for Pre-trained Transformers | Ashim Gupta et.al. | 2406.11307 | null |
2024-06-17 | Text Grafting: Near-Distribution Weak Supervision for Minority Classes in Text Classification | Letian Peng et.al. | 2406.11115 | null |
2024-06-16 | Fine-grained Classes and How to Find Them | Matej Grcić et.al. | 2406.11070 | link |
2024-06-16 | Leveraging Foundation Models for Multi-modal Federated Learning with Incomplete Modality | Liwei Che et.al. | 2406.11048 | null |
2024-06-16 | Curating Stopwords in Marathi: A TF-IDF Approach for Improved Text Analysis and Information Retrieval | Rohan Chavan et.al. | 2406.11029 | link |
2024-06-16 | Universal Cross-Lingual Text Classification | Riya Savant et.al. | 2406.11028 | null |
2024-06-14 | UniAudio 1.5: Large Language Model-driven Audio Codec is A Few-shot Audio Task Learner | Dongchao Yang et.al. | 2406.10056 | null |
2024-06-14 | Comparison of fine-tuning strategies for transfer learning in medical image classification | Ana Davila et.al. | 2406.10050 | null |
2024-06-14 | Forgetting Order of Continual Learning: Examples That are Learned First are Forgotten Last | Guy Hacohen et.al. | 2406.09935 | null |
2024-06-13 | MirrorCheck: Efficient Adversarial Defense for Vision-Language Models | Samar Fares et.al. | 2406.09250 | null |
2024-06-13 | Self-Training for Sample-Efficient Active Learning for Text Classification with Pre-Trained Language Models | Christopher Schröder et.al. | 2406.09206 | null |
2024-06-13 | Large-Scale Evaluation of Open-Set Image Classification Techniques | Halil Bisgin et.al. | 2406.09112 | link |
2024-06-13 | LaCoOT: Layer Collapse through Optimal Transport | Victor Quétu et.al. | 2406.08933 | null |
2024-06-13 | The Penalized Inverse Probability Measure for Conformal Classification | Paul Melki et.al. | 2406.08884 | null |
2024-06-13 | Conceptual Learning via Embedding Approximations for Reinforcing Interpretability and Transparency | Maor Dikter et.al. | 2406.08840 | link |
2024-06-13 | DenoiseReID: Denoising Model for Representation Learning of Person Re-Identification | Zhengrui Xu et.al. | 2406.08773 | null |
2024-06-12 | Fine-Tuned 'Small' LLMs (Still) Significantly Outperform Zero-Shot Generative AI Models in Text Classification | Martin Juan José Bucher et.al. | 2406.08660 | null |
2024-06-12 | Intelligent Multi-View Test Time Augmentation | Efe Ozturk et.al. | 2406.08593 | null |
2024-06-12 | Transformation-Dependent Adversarial Attacks | Yaoteng Tan et.al. | 2406.08443 | null |
2024-06-12 | AdaNCA: Neural Cellular Automata As Adaptors For More Robust Vision Transformer | Yitao Xu et.al. | 2406.08298 | null |
2024-06-12 | DistilDoc: Knowledge Distillation for Visually-Rich Document Applications | Jordy Van Landeghem et.al. | 2406.08226 | null |
2024-06-12 | Fully Few-shot Class-incremental Audio Classification Using Expandable Dual-embedding Extractor | Yongjie Si et.al. | 2406.08122 | null |
2024-06-12 | Low-Complexity Acoustic Scene Classification Using Parallel Attention-Convolution Network | Yanxiong Li et.al. | 2406.08119 | null |
2024-06-12 | A |
Lixian Zhang et.al. | 2406.08079 | null |
2024-06-12 | Adversarial Evasion Attack Efficiency against Large Language Models | João Vitorino et.al. | 2406.08050 | null |
2024-06-12 | Accurate Explanation Model for Image Classifiers using Class Association Embedding | Ruitao Xie et.al. | 2406.07961 | link |
2024-06-12 | Multi-Teacher Multi-Objective Meta-Learning for Zero-Shot Hyperspectral Band Selection | Jie Feng et.al. | 2406.07949 | null |
2024-06-12 | Small Scale Data-Free Knowledge Distillation | He Liu et.al. | 2406.07876 | link |
2024-06-11 | fKAN: Fractional Kolmogorov-Arnold Networks with trainable Jacobi basis functions | Alireza Afzal Aghaei et.al. | 2406.07456 | link |
2024-06-11 | Minimizing Energy Costs in Deep Learning Model Training: The Gaussian Sampling Approach | Challapalli Phanindra Revanth et.al. | 2406.07332 | null |
2024-06-11 | Noise-Robust Voice Conversion by Conditional Denoising Training Using Latent Variables of Recording Quality and Environment | Takuto Igarashi et.al. | 2406.07280 | null |
2024-06-11 | EEG-ImageNet: An Electroencephalogram Dataset and Benchmarks with Image Visual Stimuli of Multi-Granularity Labels | Shuqi Zhu et.al. | 2406.07151 | link |
2024-06-11 | RS-Agent: Automating Remote Sensing Tasks through Intelligent Agents | Wenjia Xu et.al. | 2406.07089 | null |
2024-06-11 | DualMamba: A Lightweight Spectral-Spatial Mamba-Convolution Network for Hyperspectral Image Classification | Jiamu Sheng et.al. | 2406.07050 | null |
2024-06-11 | Fairness-Aware Meta-Learning via Nash Bargaining | Yi Zeng et.al. | 2406.07029 | null |
2024-06-11 | Mitigating Boundary Ambiguity and Inherent Bias for Text Classification in the Era of Large Language Models | Zhenyi Lu et.al. | 2406.07001 | link |
2024-06-11 | Scaling up masked audio encoder learning for general audio classification | Heinrich Dinkel et.al. | 2406.06992 | null |
2024-06-10 | Multi-Objective Neural Architecture Search for In-Memory Computing | Md Hasibul Amin et.al. | 2406.06746 | null |
2024-06-10 | Robust Latent Representation Tuning for Image-text Classification | Hao Sun et.al. | 2406.06048 | null |
2024-06-09 | Contrastive Learning from Synthetic Audio Doppelgangers | Manuel Cherep et.al. | 2406.05923 | null |
2024-06-09 | Scaling Graph Convolutions for Mobile Vision | William Avery et.al. | 2406.05850 | link |
2024-06-09 | Evolution-aware VAriance (EVA) Coreset Selection for Medical Image Classification | Yuxin Hong et.al. | 2406.05677 | null |
2024-06-09 | Which Backbone to Use: A Resource-efficient Domain Specific Comparison for Computer Vision | Pranav Jeevan et.al. | 2406.05612 | link |
2024-06-08 | Aligning Human Knowledge with Visual Concepts Towards Explainable Medical Image Classification | Yunhe Gao et.al. | 2406.05596 | null |
2024-06-07 | The Unmet Promise of Synthetic Training Images: Using Retrieved Real Images Performs Better | Scott Geng et.al. | 2406.05184 | link |
2024-06-07 | A Novel Time Series-to-Image Encoding Approach for Weather Phenomena Classification | Christian Giannetti et.al. | 2406.05096 | null |
2024-06-07 | Classification Metrics for Image Explanations: Towards Building Reliable XAI-Evaluations | Benjamin Fresz et.al. | 2406.05068 | link |
2024-06-07 | REP: Resource-Efficient Prompting for On-device Continual Learning | Sungho Jeon et.al. | 2406.04772 | null |
2024-06-07 | AICoderEval: Improving AI Domain Code Generation of Large Language Models | Yinghui Xia et.al. | 2406.04712 | null |
2024-06-07 | Cooperative Meta-Learning with Gradient Augmentation | Jongyun Shin et.al. | 2406.04639 | link |
2024-06-06 | OCCAM: Towards Cost-Efficient and Accuracy-Aware Image Classification Inference | Dujian Ding et.al. | 2406.04508 | null |
2024-06-06 | Can Language Models Use Forecasting Strategies? | Sarah Pratt et.al. | 2406.04446 | null |
2024-06-06 | Parameter-Inverted Image Pyramid Networks | Xizhou Zhu et.al. | 2406.04330 | link |
2024-06-07 | BEADs: Bias Evaluation Across Domains | Shaina Raza et.al. | 2406.04220 | null |
2024-06-06 | What Do Language Models Learn in Context? The Structured Task Hypothesis | Jiaoda Li et.al. | 2406.04216 | null |
2024-06-06 | Pointer-Guided Pre-Training: Infusing Large Language Models with Paragraph-Level Contextual Awareness | Lars Hillebrand et.al. | 2406.04156 | link |
2024-06-07 | ReDistill: Residual Encoded Distillation for Peak Memory Reduction | Fang Chen et.al. | 2406.03744 | null |
2024-06-06 | LLMEmbed: Rethinking Lightweight LLM's Genuine Function in Text Classification | Chun Liu et.al. | 2406.03725 | link |
2024-06-05 | Convolutional Neural Networks and Vision Transformers for Fashion MNIST Classification: A Literature Review | Sonia Bbouzidi et.al. | 2406.03478 | null |
2024-06-05 | IrokoBench: A New Benchmark for African Languages in the Age of Large Language Models | David Ifeoluwa Adelani et.al. | 2406.03368 | null |
2024-06-05 | Audio Mamba: Bidirectional State Space Model for Audio Representation Learning | Mehmet Hamza Erol et.al. | 2406.03344 | link |
2024-06-05 | FusionBench: A Comprehensive Benchmark of Deep Model Fusion | Anke Tang et.al. | 2406.03280 | null |
2024-06-05 | VWise: A novel benchmark for evaluating scene classification for vehicular applications | Pedro Azevedo et.al. | 2406.03273 | null |
2024-06-05 | Tiny models from tiny data: Textual and null-text inversion for few-shot distillation | Erik Landolsi et.al. | 2406.03146 | link |
2024-06-05 | Exploiting LMM-based knowledge for image classification tasks | Maria Tzelepi et.al. | 2406.03071 | null |
2024-06-04 | Randomized Geometric Algebra Methods for Convex Neural Networks | Yifei Wang et.al. | 2406.02806 | null |
2024-06-04 | DL-KDD: Dual-Light Knowledge Distillation for Action Recognition in the Dark | Chi-Jui Chang et.al. | 2406.02468 | null |
2024-06-04 | GrootVL: Tree Topology is All You Need in State Space Model | Yicheng Xiao et.al. | 2406.02395 | link |
2024-06-04 | Hybrid Quantum-Classical Neural Network for LAB Color Space Image Classification | Kwokho Ng et.al. | 2406.02229 | null |
2024-06-03 | Few-Shot Classification of Interactive Activities of Daily Living (InteractADL) | Zane Durante et.al. | 2406.01662 | link |
2024-06-03 | CoLa-DCE -- Concept-guided Latent Diffusion Counterfactual Explanations | Franz Motzkus et.al. | 2406.01649 | null |
2024-06-03 | Asynchronous Multi-Server Federated Learning for Geo-Distributed Clients | Yuncong Zuo et.al. | 2406.01439 | null |
2024-06-03 | Compute-Efficient Medical Image Classification with Softmax-Free Transformers and Sequence Normalization | Firas Khader et.al. | 2406.01314 | null |
2024-06-03 | Continuous Geometry-Aware Graph Diffusion via Hyperbolic Neural PDE | Jiaxu Liu et.al. | 2406.01282 | null |
2024-06-04 | MultiMax: Sparse and Multi-Modal Attention Learning | Yuxuan Zhou et.al. | 2406.01189 | link |
2024-06-03 | Synergizing Unsupervised and Supervised Learning: A Hybrid Approach for Accurate Natural Language Task Modeling | Wrick Talukdar et.al. | 2406.01096 | null |
2024-05-31 | You Only Scan Once: Efficient Multi-dimension Sequential Modeling with LightNet | Zhen Qin et.al. | 2405.21022 | null |
2024-05-31 | Investigating Calibration and Corruption Robustness of Post-hoc Pruned Perception CNNs: An Image Classification Benchmark Study | Pallavi Mitra et.al. | 2405.20876 | null |
2024-05-31 | Improving Generalization and Convergence by Enhancing Implicit Regularization | Mingze Wang et.al. | 2405.20763 | null |
2024-05-31 | Robust Stable Spiking Neural Networks | Jianhao Ding et.al. | 2405.20694 | null |
2024-05-31 | Enhancing Counterfactual Image Generation Using Mahalanobis Distance with Distribution Preferences in Feature Space | Yukai Zhang et.al. | 2405.20685 | null |
2024-05-31 | GenMix: Combining Generative and Mixture Data Augmentation for Medical Image Classification | Hansang Lee et.al. | 2405.20650 | null |
2024-05-31 | ToxVidLLM: A Multimodal LLM-based Framework for Toxicity Detection in Code-Mixed Videos | Krishanu Maity et.al. | 2405.20628 | null |
2024-05-30 | Mitigating the Impact of Labeling Errors on Training via Rockafellian Relaxation | Louis L. Chen et.al. | 2405.20531 | null |
2024-05-30 | DeMamba: AI-Generated Video Detection on Million-Scale GenVideo Benchmark | Haoxing Chen et.al. | 2405.19707 | link |
2024-05-30 | A Novel Approach for Automated Design Information Mining from Issue Logs | Jiuang Zhao et.al. | 2405.19623 | null |
2024-05-29 | I Bet You Did Not Mean That: Testing Semantic Importance via Betting | Jacopo Teneggi et.al. | 2405.19146 | link |
2024-05-29 | Verifiably Robust Conformal Prediction | Linus Jeary et.al. | 2405.18942 | null |
2024-05-29 | Leveraging Many-To-Many Relationships for Defending Against Visual-Language Adversarial Attacks | Futa Waseda et.al. | 2405.18770 | null |
2024-05-29 | GIST: Greedy Independent Set Thresholding for Diverse Data Summarization | Matthew Fahrbach et.al. | 2405.18754 | null |
2024-05-29 | LLM-based Hierarchical Concept Decomposition for Interpretable Fine-Grained Image Classification | Renyi Qu et.al. | 2405.18672 | null |
2024-05-28 | Its Not a Modality Gap: Characterizing and Addressing the Contrastive Gap | Abrar Fahim et.al. | 2405.18570 | null |
2024-05-28 | Why are Visually-Grounded Language Models Bad at Image Classification? | Yuhui Zhang et.al. | 2405.18415 | link |
2024-05-28 | MSPE: Multi-Scale Patch Embedding Prompts Vision Transformers to Any Resolution | Wenzhuo Liu et.al. | 2405.18240 | null |
2024-05-28 | Confidence-aware multi-modality learning for eye disease screening | Ke Zou et.al. | 2405.18167 | link |
2024-05-28 | 4-bit Shampoo for Memory-Efficient Network Training | Sike Wang et.al. | 2405.18144 | null |
2024-05-28 | DMT-JEPA: Discriminative Masked Targets for Joint-Embedding Predictive Architecture | Shentong Mo et.al. | 2405.17995 | null |
2024-05-27 | WASH: Train your Ensemble with Communication-Efficient Weight Shuffling, then Average | Louis Fournier et.al. | 2405.17517 | null |
2024-05-27 | Model-Agnostic Zeroth-Order Policy Optimization for Meta-Learning of Ergodic Linear Quadratic Regulators | Yunian Pan et.al. | 2405.17370 | null |
2024-05-27 | On the Noise Robustness of In-Context Learning for Text Generation | Hongfu Gao et.al. | 2405.17264 | null |
2024-05-27 | Superpixelwise Low-rank Approximation based Partial Label Learning for Hyperspectral Image Classification | Shujun Yang et.al. | 2405.17110 | link |
2024-05-26 | Demystify Mamba in Vision: A Linear Attention Perspective | Dongchen Han et.al. | 2405.16605 | null |
2024-05-26 | AdaFisher: Adaptive Second Order Optimization via Fisher Information | Damien Martins Gomes et.al. | 2405.16397 | null |
2024-05-25 | ModelLock: Locking Your Model With a Spell | Yifeng Gao et.al. | 2405.16285 | null |
2024-05-25 | Accelerating Transformers with Spectrum-Preserving Token Merging | Hoai-Chau Tran et.al. | 2405.16148 | null |
2024-05-25 | Breaking the False Sense of Security in Backdoor Defense through Re-Activation Attack | Mingli Zhu et.al. | 2405.16134 | null |
2024-05-24 | Grounding Stylistic Domain Generalization with Quantitative Domain Shift Measures and Synthetic Scene Images | Yiran Luo et.al. | 2405.15961 | null |
2024-05-24 | A Neurosymbolic Framework for Bias Correction in CNNs | Parth Padalkar et.al. | 2405.15886 | null |
2024-05-24 | What Do You See? Enhancing Zero-Shot Image Classification with Multimodal Large Language Models | Abdelrahman Abdelhamed et.al. | 2405.15668 | null |
2024-05-24 | Class Machine Unlearning for Complex Data via Concepts Inference and Data Poisoning | Wenhan Chang et.al. | 2405.15662 | null |
2024-05-24 | Exposing Image Classifier Shortcuts with Counterfactual Frequency (CoF) Tables | James Hinns et.al. | 2405.15661 | null |
2024-05-24 | Harnessing Increased Client Participation with Cohort-Parallel Federated Learning | Akash Dhasade et.al. | 2405.15644 | null |
2024-05-24 | Transformer-based Federated Learning for Multi-Label Remote Sensing Image Classification | Barış Büyüktaş et.al. | 2405.15405 | null |
2024-05-24 | CLIP model is an Efficient Online Lifelong Learner | Leyuan Wang et.al. | 2405.15155 | null |
2024-05-24 | OptLLM: Optimal Assignment of Queries to Large Language Models | Yueyue Liu et.al. | 2405.15130 | null |
2024-05-23 | A Lost Opportunity for Vision-Language Models: A Comparative Study of Online Test-time Adaptation for Vision-Language Models | Mario Döbler et.al. | 2405.14977 | link |
2024-05-23 | Domain Wall Magnetic Tunnel Junction Reliable Integrate and Fire Neuron | Can Cui1 et.al. | 2405.14851 | null |
2024-05-23 | Explaining Black-box Model Predictions via Two-level Nested Feature Attributions with Consistency Property | Yuya Yoshikawa et.al. | 2405.14522 | null |
2024-05-23 | SIAVC: Semi-Supervised Framework for Industrial Accident Video Classification | Zuoyong Li et.al. | 2405.14506 | null |
2024-05-23 | Scalable Visual State Space Model with Fractal Scanning | Lv Tang et.al. | 2405.14480 | null |
2024-05-23 | Segformer++: Efficient Token-Merging Strategies for High-Resolution Semantic Segmentation | Daniel Kienzle et.al. | 2405.14467 | null |
2024-05-23 | Boosting Robustness by Clipping Gradients in Distributed Learning | Youssef Allouah et.al. | 2405.14432 | null |
2024-05-23 | Advancing Spiking Neural Networks for Sequential Modeling with Central Pattern Generators | Changze Lv et.al. | 2405.14362 | null |
2024-05-23 | Simple Hamiltonian dynamics is a powerful quantum processing resource | Akitada Sakurai et.al. | 2405.14245 | null |
2024-05-23 | ChronosLex: Time-aware Incremental Training for Temporal Generalization of Legal Classification Tasks | T. Y. S. S Santosh et.al. | 2405.14211 | null |
2024-05-22 | Just rotate it! Uncertainty estimation in closed-source models via multiple queries | Konstantinos Pitas et.al. | 2405.13864 | null |
2024-05-21 | Decentralized Federated Learning Over Imperfect Communication Channels | Weicai Li et.al. | 2405.12894 | null |
2024-05-21 | Multimodal Adaptive Inference for Document Image Classification with Anytime Early Exiting | Omar Hamed et.al. | 2405.12705 | null |
2024-05-21 | Exploration of Masked and Causal Language Modelling for Text Generation | Nicolo Micheletti et.al. | 2405.12630 | null |
2024-05-21 | 3DSS-Mamba: 3D-Spectral-Spatial Mamba for Hyperspectral Image Classification | Yan He et.al. | 2405.12487 | null |
2024-05-20 | Alzheimer's Magnetic Resonance Imaging Classification Using Deep and Meta-Learning Models | Nida Nasir et.al. | 2405.12126 | null |
2024-05-20 | Mamba-in-Mamba: Centralized Mamba-Cross-Scan in Tokenized Mamba Model for Hyperspectral Image Classification | Weilian Zhou et.al. | 2405.12003 | link |
2024-05-20 | A Constraint-Enforcing Reward for Adversarial Attacks on Text Classifiers | Tom Roth et.al. | 2405.11904 | null |
2024-05-21 | A Novel Cartography-Based Curriculum Learning Method Applied on RoNLI: The First Romanian Natural Language Inference Corpus | Eduard Poesina et.al. | 2405.11877 | link |
2024-05-20 | SSAMBA: Self-Supervised Audio Representation Learning with Mamba State Space Model | Siavash Shams et.al. | 2405.11831 | link |
2024-05-20 | Exploring Ordinality in Text Classification: A Comparative Study of Explicit and Implicit Techniques | Siva Rajesh Kasa et.al. | 2405.11775 | null |
2024-05-19 | SLAB: Efficient Transformers with Simplified Linear Attention and Progressive Re-parameterized Batch Normalization | Jialong Guo et.al. | 2405.11582 | link |
2024-05-19 | Reproducibility Study of CDUL: CLIP-Driven Unsupervised Learning for Multi-Label Image Classification | Manan Shah et.al. | 2405.11574 | link |
2024-05-19 | An Invisible Backdoor Attack Based On Semantic Feature | Yangming Chen et.al. | 2405.11551 | null |
2024-05-19 | Verification technology for finger vein biometric | George Kumi Kyeremeh et.al. | 2405.11540 | null |
2024-05-17 | Reduced storage direct tensor ring decomposition for convolutional neural networks compression | Mateusz Gabor et.al. | 2405.10802 | link |
2024-05-17 | Benchmarking Large Language Models on CFLUE -- A Chinese Financial Language Understanding Evaluation Dataset | Jie Zhu et.al. | 2405.10542 | link |
2024-05-17 | Smart Expert System: Large Language Models as Text Classifiers | Zhiqiang Wang et.al. | 2405.10523 | link |
2024-05-16 | Data-Efficient Low-Complexity Acoustic Scene Classification in the DCASE 2024 Challenge | Florian Schmid et.al. | 2405.10018 | null |
2024-05-16 | ROCOv2: Radiology Objects in COntext Version 2, an Updated Multimodal Image Dataset | Johannes Rückert et.al. | 2405.10004 | link |
2024-05-15 | Improving Label Error Detection and Elimination with Uncertainty Quantification | Johannes Jakubik et.al. | 2405.09602 | null |
2024-05-15 | Tackling Distribution Shifts in Task-Oriented Communication with Information Bottleneck | Hongru Li et.al. | 2405.09514 | null |
2024-05-15 | Feature-based Federated Transfer Learning: Communication Efficiency, Robustness and Privacy | Feng Wang et.al. | 2405.09014 | link |
2024-05-14 | The Pitfalls and Promise of Conformal Inference Under Adversarial Attacks | Ziquan Liu et.al. | 2405.08886 | link |
2024-05-14 | Harnessing the power of longitudinal medical imaging for eye disease prognosis using Transformer-based sequence modeling | Gregory Holste et.al. | 2405.08780 | null |
2024-05-14 | FolkTalent: Enhancing Classification and Tagging of Indian Folk Paintings | Nancy Hada et.al. | 2405.08776 | null |
2024-05-14 | The impact of Compositionality in Zero-shot Multi-label action recognition for Object-based tasks | Carmela Calabrese et.al. | 2405.08695 | null |
2024-05-14 | Achieving Fairness Through Channel Pruning for Dermatological Disease Diagnosis | Qingpeng Kong et.al. | 2405.08681 | link |
2024-05-14 | Investigating Design Choices in Joint-Embedding Predictive Architectures for General Audio Representation Learning | Alain Riou et.al. | 2405.08679 | null |
2024-05-14 | Dual-Branch Network for Portrait Image Quality Assessment | Wei Sun et.al. | 2405.08555 | null |
2024-05-13 | Who's in and who's out? A case study of multimodal CLIP-filtering in DataComp | Rachel Hong et.al. | 2405.08209 | link |
2024-05-14 | MambaOut: Do We Really Need Mamba for Vision? | Weihao Yu et.al. | 2405.07992 | link |
2024-05-13 | Constrained Exploration via Reflected Replica Exchange Stochastic Gradient Langevin Dynamics | Haoyang Zheng et.al. | 2405.07839 | link |
2024-05-13 | Analysis of the rate of convergence of an over-parametrized convolutional neural network image classifier learned by gradient descent | Michael Kohler et.al. | 2405.07619 | null |
2024-05-13 | On-device Online Learning and Semantic Management of TinyML Systems | Haoyu Ren et.al. | 2405.07601 | null |
2024-05-13 | GLiRA: Black-Box Membership Inference Attack via Knowledge Distillation | Andrey V. Galichin et.al. | 2405.07562 | null |
2024-05-13 | Fine-tuning the SwissBERT Encoder Model for Embedding Sentences and Documents | Juri Grosjean et.al. | 2405.07513 | null |
2024-05-13 | MoVL:Exploring Fusion Strategies for the Domain-Adaptive Application of Pretrained Models in Medical Imaging Tasks | Haijiang Tian et.al. | 2405.07411 | null |
2024-05-12 | Explainable Convolutional Neural Networks for Retinal Fundus Classification and Cutting-Edge Segmentation Models for Retinal Blood Vessels from Fundus Images | Fatema Tuj Johora Faria et.al. | 2405.07338 | null |
2024-05-12 | Differentiable Model Scaling using Differentiable Topk | Kai Liu et.al. | 2405.07194 | null |
2024-05-11 | A framework of text-dependent speaker verification for chinese numerical string corpus | Litong Zheng et.al. | 2405.07029 | null |
2024-05-10 | Pseudo-Prompt Generating in Pre-trained Vision-Language Models for Multi-Label Medical Image Classification | Yaoqin Ye et.al. | 2405.06468 | null |
2024-05-10 | Multi-level Personalized Federated Learning on Heterogeneous and Long-Tailed Data | Rongyu Zhang et.al. | 2405.06413 | null |
2024-05-10 | SaudiBERT: A Large Language Model Pretrained on Saudi Dialect Corpora | Faisal Qarah et.al. | 2405.06239 | null |
2024-05-09 | Deep Multi-Task Learning for Malware Image Classification | Ahmed Bensaoud et.al. | 2405.05906 | null |
2024-05-09 | Enhancing Suicide Risk Detection on Social Media through Semi-Supervised Deep Label Smoothing | Matthew Squires et.al. | 2405.05795 | null |
2024-05-09 | CSA-Net: Channel-wise Spatially Autocorrelated Attention Networks | Nick et.al. | 2405.05755 | null |
2024-05-09 | How Quality Affects Deep Neural Networks in Fine-Grained Image Classification | Joseph Smith et.al. | 2405.05742 | null |
2024-05-09 | End-to-End Generative Semantic Communication Powered by Shared Semantic Knowledge Base | Shuling Li et.al. | 2405.05738 | null |
2024-05-09 | Using Machine Translation to Augment Multilingual Classification | Adam King et.al. | 2405.05478 | null |
2024-05-08 | AFEN: Respiratory Disease Classification using Ensemble Learning | Rahul Nadkarni et.al. | 2405.05467 | null |
2024-05-08 | XAMPLER: Learning to Retrieve Cross-Lingual In-Context Examples | Peiqin Lin et.al. | 2405.05116 | link |
2024-05-08 | Explanation as a Watermark: Towards Harmless and Multi-bit Model Ownership Verification via Watermarking Feature Attribution | Shuo Shao et.al. | 2405.04825 | null |
2024-05-07 | Exploring Explainable AI Techniques for Improved Interpretability in Lung and Colon Cancer Classification | Mukaffi Bin Moin et.al. | 2405.04610 | link |
2024-05-07 | Pragmatist Intelligence: Where the Principle of Usefulness Can Take ANNs | Antonio Bikić et.al. | 2405.04386 | null |
2024-05-07 | Semi-Supervised Disease Classification based on Limited Medical Image Data | Yan Zhang et.al. | 2405.04295 | null |
2024-05-07 | DCNN: Dual Cross-current Neural Networks Realized Using An Interactive Deep Learning Discriminator for Fine-grained Objects | Da Fu et.al. | 2405.04093 | null |
2024-05-07 | Feature Map Convergence Evaluation for Functional Module | Ludan Zhang et.al. | 2405.04041 | null |
2024-05-07 | VMambaCC: A Visual State Space Model for Crowd Counting | Hao-Yuan Ma et.al. | 2405.03978 | null |
2024-05-06 | On Adversarial Examples for Text Classification by Perturbing Latent Representations | Korn Sooksatra et.al. | 2405.03789 | null |
2024-05-06 | CICA: Content-Injected Contrastive Alignment for Zero-Shot Document Image Classification | Sankalp Sinha et.al. | 2405.03660 | null |
2024-05-06 | Deep Space Separable Distillation for Lightweight Acoustic Scene Classification | ShuQi Ye et.al. | 2405.03567 | null |
2024-05-06 | Liberating Seen Classes: Boosting Few-Shot and Zero-Shot Text Classification via Anchor Generation and Classification Reframing | Han Liu et.al. | 2405.03565 | null |
2024-05-06 | A Lightweight Neural Architecture Search Model for Medical Image Classification | Lunchen Xie et.al. | 2405.03462 | null |
2024-05-06 | Interpretable Network Visualizations: A Human-in-the-Loop Approach for Post-hoc Explainability of CNN-based Image Classification | Matteo Bianchi et.al. | 2405.03301 | null |
2024-05-06 | TED: Accelerate Model Training by Internal Generalization | Jinying Xiao et.al. | 2405.03228 | null |
2024-05-06 | Advancing Multimodal Medical Capabilities of Gemini | Lin Yang et.al. | 2405.03162 | null |
2024-05-05 | A scoping review of using Large Language Models (LLMs) to investigate Electronic Health Records (EHRs) | Lingyao Li et.al. | 2405.03066 | null |
2024-05-05 | Parameter-Efficient Fine-Tuning with Discrete Fourier Transform | Ziqi Gao et.al. | 2405.03003 | null |
2024-05-04 | MMEarth: Exploring Multi-Modal Pretext Tasks For Geospatial Representation Learning | Vishal Nedungadi et.al. | 2405.02771 | null |
2024-05-03 | Multi-method Integration with Confidence-based Weighting for Zero-shot Image Classification | Siqi Yin et.al. | 2405.02155 | null |
2024-05-03 | The Trade-off between Performance, Efficiency, and Fairness in Adapter Modules for Text Classification | Minh Duc Bui et.al. | 2405.02010 | null |
2024-05-03 | Which Identities Are Mobilized: Towards an automated detection of social group appeals in political texts | Felicia Riethmüller et.al. | 2405.01904 | null |
2024-05-02 | PVF (Parameter Vulnerability Factor): A Quantitative Metric Measuring AI Vulnerability and Resilience Against Parameter Corruptions | Xun Jiao et.al. | 2405.01741 | null |
2024-05-02 | Development of Skip Connection in Deep Neural Networks for Computer Vision and Medical Image Analysis: A Survey | Guoping Xu et.al. | 2405.01725 | link |
2024-05-02 | SOAR: Advancements in Small Body Object Detection for Aerial Imagery Using State Space Models and Programmable Gradients | Tushar Verma et.al. | 2405.01699 | null |
2024-05-02 | Explainable AI (XAI) in Image Segmentation in Medicine, Industry, and Beyond: A Survey | Rokas Gipiškis et.al. | 2405.01636 | null |
2024-05-02 | Improving Intervention Efficacy via Concept Realignment in Concept Bottleneck Models | Nishad Singhi et.al. | 2405.01531 | null |
2024-05-03 | Decoupling Feature Extraction and Classification Layers for Calibrated Neural Networks | Mikkel Jordahn et.al. | 2405.01196 | null |
2024-05-02 | Uncertainty-aware self-training with expectation maximization basis transformation | Zijia Wang et.al. | 2405.01175 | null |
2024-05-02 | Transformers Fusion across Disjoint Samples for Hyperspectral Image Classification | Muhammad Ahmad et.al. | 2405.01095 | null |
2024-05-02 | Efficient and Flexible Method for Reducing Moderate-size Deep Neural Networks with Condensation | Tianyi Chen et.al. | 2405.01041 | null |
2024-05-02 | Benchmarking Representations for Speech, Music, and Acoustic Events | Moreno La Quatra et.al. | 2405.00934 | link |
2024-05-01 | Digital-analog quantum convolutional neural networks for image classification | Anton Simen et.al. | 2405.00548 | null |
2024-05-03 | BiomedRAG: A Retrieval Augmented Large Language Model for Biomedicine | Mingchen Li et.al. | 2405.00465 | null |
2024-05-01 | Visual and audio scene classification for detecting discrepancies in video: a baseline method and experimental protocol | Konstantinos Apostolidis et.al. | 2405.00384 | null |
2024-05-01 | Data Augmentation Policy Search for Long-Term Forecasting | Liran Nochumsohn et.al. | 2405.00319 | null |
2024-04-30 | Let's Focus: Focused Backdoor Attack against Federated Transfer Learning | Marco Arazzi et.al. | 2404.19420 | null |
2024-04-30 | Large Language Model Informed Patent Image Retrieval | Hao-Cheng Lo et.al. | 2404.19360 | null |
2024-04-30 | Enhancing Intrinsic Features for Debiasing via Investigating Class-Discerning Common Attributes in Bias-Contrastive Pair | Jeonghoon Park et.al. | 2404.19250 | null |
2024-04-29 | Spectral-Spatial Mamba for Hyperspectral Image Classification | Lingbo Huang et.al. | 2404.18401 | null |
2024-04-28 | TextGram: Towards a better domain-adaptive pretraining | Sharayu Hiwarkhedkar et.al. | 2404.18228 | null |
2024-04-28 | L3Cube-MahaNews: News-based Short Text and Long Document Classification Datasets in Marathi | Saloni Mittal et.al. | 2404.18216 | link |
2024-04-28 | S |
Guanchun Wang et.al. | 2404.18213 | null |
2024-04-27 | Implicit Generative Prior for Bayesian Neural Networks | Yijia Liu et.al. | 2404.18008 | link |
2024-04-27 | Towards Privacy-Preserving Audio Classification Systems | Bhawana Chhaglani et.al. | 2404.18002 | null |
2024-04-27 | A Method of Moments Embedding Constraint and its Application to Semi-Supervised Learning | Michael Majurski et.al. | 2404.17978 | null |
2024-04-27 | Spatial, Temporal, and Geometric Fusion for Remote Sensing Images | Hessah Albanwan et.al. | 2404.17851 | null |
2024-04-27 | Leveraging Cross-Modal Neighbor Representation for Improved CLIP Classification | Chao Yi et.al. | 2404.17753 | link |
2024-04-26 | SPLICE -- Streamlining Digital Pathology Image Processing | Areej Alsaafin et.al. | 2404.17704 | null |
2024-04-26 | SDFD: Building a Versatile Synthetic Face Image Dataset with Diverse Attributes | Georgia Baltsou et.al. | 2404.17255 | null |
2024-04-25 | Incorporating Lexical and Syntactic Knowledge for Unsupervised Cross-Lingual Transfer | Jianyu Zheng et.al. | 2404.16627 | link |
2024-04-25 | IMWA: Iterative Model Weight Averaging Benefits Class-Imbalanced Learning Tasks | Zitong Huang et.al. | 2404.16331 | null |
2024-04-25 | Lacunarity Pooling Layers for Plant Image Classification using Texture Analysis | Akshatha Mohan et.al. | 2404.16268 | link |
2024-04-24 | MiMICRI: Towards Domain-centered Counterfactual Explanations of Cardiovascular Image Classification Models | Grace Guo et.al. | 2404.16174 | null |
2024-04-24 | MoDE: CLIP Data Experts via Clustering | Jiawei Ma et.al. | 2404.16030 | link |
2024-04-26 | A Survey on Visual Mamba | Hanwei Zhang et.al. | 2404.15956 | null |
2024-04-24 | Vision Transformer-based Adversarial Domain Adaptation | Yahan Li et.al. | 2404.15817 | link |
2024-04-24 | Rethinking Model Prototyping through the MedMNIST+ Dataset Collection | Sebastian Doerrich et.al. | 2404.15786 | null |
2024-04-24 | Efficient Multi-Model Fusion with Adversarial Complementary Representation Learning | Zuheng Kang et.al. | 2404.15704 | null |
2024-04-24 | Brain Storm Optimization Based Swarm Learning for Diabetic Retinopathy Image Classification | Liang Qu et.al. | 2404.15585 | null |
2024-04-23 | An MRP Formulation for Supervised Learning: Generalized Temporal Difference Learning Models | Yangchen Pan et.al. | 2404.15518 | null |
2024-04-23 | Deep multi-prototype capsule networks | Saeid Abbassi et.al. | 2404.15445 | null |
2024-04-23 | A review of deep learning-based information fusion techniques for multimodal medical image classification | Yihao Li et.al. | 2404.15022 | null |
2024-04-23 | Social Media and Artificial Intelligence for Sustainable Cities and Societies: A Water Quality Analysis Use-case | Muhammad Asif Auyb et.al. | 2404.14977 | null |
2024-04-23 | Traditional to Transformers: A Survey on Current Trends and Future Prospects for Hyperspectral Image Classification | Muhammad Ahmad et.al. | 2404.14955 | link |
2024-04-23 | Pyramid Hierarchical Transformer for Hyperspectral Image Classification | Muhammad Ahmad et.al. | 2404.14945 | link |
2024-04-23 | Importance of Disjoint Sampling in Conventional and Transformer Models for Hyperspectral Image Classification | Muhammad Ahmad et.al. | 2404.14944 | link |
2024-04-23 | CoProNN: Concept-based Prototypical Nearest Neighbors for Explaining Vision Models | Teodor Chiaburu et.al. | 2404.14830 | link |
2024-04-22 | WangLab at MEDIQA-M3G 2024: Multimodal Medical Answer Generation using Large Language Models | Ronald Xie et.al. | 2404.14567 | null |
2024-04-22 | CKD: Contrastive Knowledge Distillation from A Sample-wise Perspective | Wencheng Zhu et.al. | 2404.14109 | null |
2024-04-21 | EncodeNet: A Framework for Boosting DNN Accuracy with Entropy-driven Generalized Converting Autoencoder | Hasanul Mahmud et.al. | 2404.13770 | null |
2024-04-21 | PEACH: Pretrained-embedding Explanation Across Contextual and Hierarchical Structure | Feiqi Cao et.al. | 2404.13645 | link |
2024-04-21 | I2CANSAY:Inter-Class Analogical Augmentation and Intra-Class Significance Analysis for Non-Exemplar Online Task-Free Continual Learning | Songlin Dong et.al. | 2404.13576 | null |
2024-04-21 | IMO: Greedy Layer-Wise Sparse Representation Learning for Out-of-Distribution Text Classification with Pre-trained Models | Tao Feng et.al. | 2404.13504 | null |
2024-04-20 | Nested-TNT: Hierarchical Vision Transformers with Multi-Scale Feature Processing | Yuang Liu et.al. | 2404.13434 | null |
2024-04-20 | Evaluating Subword Tokenization: Alien Subword Composition and OOV Generalization Challenge | Khuyagbaatar Batsuren et.al. | 2404.13292 | link |
2024-04-20 | 3D-Convolution Guided Spectral-Spatial Transformer for Hyperspectral Image Classification | Shyam Varahagiri et.al. | 2404.13252 | link |
2024-04-19 | On-board classification of underwater images using hybrid classical-quantum CNN based method | Sreeraj Rajan Warrier et.al. | 2404.13130 | null |
2024-04-19 | Next Generation Loss Function for Image Classification | Shakhnaz Akhmedova et.al. | 2404.12948 | null |
2024-04-19 | A Hybrid Generative and Discriminative PointNet on Unordered Point Sets | Yang Ye et.al. | 2404.12925 | null |
2024-04-19 | Transformer-Based Classification Outcome Prediction for Multimodal Stroke Treatment | Danqing Ma et.al. | 2404.12634 | null |
2024-04-18 | When LLMs are Unfit Use FastFit: Fast and Effective Text Classification with Many Classes | Asaf Yehudai et.al. | 2404.12365 | null |
2024-04-18 | Observation, Analysis, and Solution: Exploring Strong Lightweight Vision Transformers via Masked Image Modeling Pre-Training | Jin Gao et.al. | 2404.12210 | link |
2024-04-18 | Concept Induction using LLMs: a user experiment for assessment | Adrita Barua et.al. | 2404.11875 | null |
2024-04-17 | Pretraining Billion-scale Geospatial Foundational Models on Frontier | Aristeidis Tsaris et.al. | 2404.11706 | null |
2024-04-17 | AI-Enhanced Cognitive Behavioral Therapy: Deep Learning and Large Language Models for Extracting Cognitive Pathways from Social Media Texts | Meng Jiang et.al. | 2404.11449 | null |
2024-04-17 | Achieving Rotation Invariance in Convolution Operations: Shifting from Data-Driven to Mechanism-Assured | Hanlin Mo et.al. | 2404.11309 | null |
2024-04-17 | A Progressive Framework of Vision-language Knowledge Distillation and Alignment for Multilingual Scene | Wenbo Zhang et.al. | 2404.11249 | null |
2024-04-17 | A Novel ICD Coding Framework Based on Associated and Hierarchical Code Description Distillation | Bin Zhang et.al. | 2404.11132 | null |
2024-04-17 | Small Language Models are Good Too: An Empirical Study of Zero-Shot Classification | Pierre Lepagnol et.al. | 2404.11122 | null |
2024-04-18 | Supervised Contrastive Vision Transformer for Breast Histopathological Image Classification | Mohammad Shiri et.al. | 2404.11052 | null |
2024-04-17 | InfoMatch: Entropy Neural Estimation for Semi-Supervised Image Classification | Qi Han et.al. | 2404.11003 | link |
2024-04-16 | Incubating Text Classifiers Following User Instruction with Nothing but LLM | Letian Peng et.al. | 2404.10877 | null |
2024-04-16 | Vocabulary-free Image Classification and Semantic Segmentation | Alessandro Conti et.al. | 2404.10864 | link |
2024-04-16 | Assessing The Impact of CNN Auto Encoder-Based Image Denoising on Image Classification Tasks | Mohsen Hami et.al. | 2404.10664 | null |
2024-04-16 | Tree Bandits for Generative Bayes | Sean O'Hagan et.al. | 2404.10436 | null |
2024-04-16 | AudioProtoPNet: An interpretable deep learning model for bird sound classification | René Heinrich et.al. | 2404.10420 | null |
2024-04-16 | Lighter, Better, Faster Multi-Source Domain Adaptation with Gaussian Mixture Models and Optimal Transport | Eduardo Fernandes Montesuma et.al. | 2404.10261 | null |
2024-04-15 | Distributed Federated Learning-Based Deep Learning Model for Privacy MRI Brain Tumor Detection | Lisang Zhou et.al. | 2404.10026 | null |
2024-04-15 | Interaction as Explanation: A User Interaction-based Method for Explaining Image Classification Models | Hyeonggeun Yun et.al. | 2404.09828 | null |
2024-04-15 | Quantization of Large Language Models with an Overdetermined Basis | Daniil Merkulov et.al. | 2404.09737 | null |
2024-04-15 | Pseudo-label Learning with Calibrated Confidence Using an Energy-based Model | Masahito Toba et.al. | 2404.09585 | null |
2024-04-14 | Breast Cancer Image Classification Method Based on Deep Transfer Learning | Weimin Wang et.al. | 2404.09226 | null |
2024-04-14 | Coreset Selection for Object Detection | Hojun Lee et.al. | 2404.09161 | null |
2024-04-13 | Exploring Explainability in Video Action Recognition | Avinab Saha et.al. | 2404.09067 | null |
2024-04-13 | Fast Fishing: Approximating BAIT for Efficient and Scalable Deep Active Image Classification | Denis Huseljic et.al. | 2404.08981 | link |
2024-04-13 | PM2: A New Prompting Multi-modal Model Paradigm for Few-shot Medical Image Classification | Zhenwei Wang et.al. | 2404.08915 | null |
2024-04-12 | VertAttack: Taking advantage of Text Classifiers' horizontal vision | Jonathan Rusert et.al. | 2404.08538 | null |
2024-04-12 | SpectralMamba: Efficient Mamba for Hyperspectral Image Classification | Jing Yao et.al. | 2404.08489 | null |
2024-04-12 | OTTER: Improving Zero-Shot Classification via Optimal Transport | Changho Shin et.al. | 2404.08461 | null |
2024-04-12 | A Survey of Neural Network Robustness Assessment in Image Recognition | Jie Wang et.al. | 2404.08285 | null |
2024-04-12 | Convolutional neural network classification of cancer cytopathology images: taking breast cancer as an example | MingXuan Xiao et.al. | 2404.08279 | null |
2024-04-11 | HGRN2: Gated Linear RNNs with State Expansion | Zhen Qin et.al. | 2404.07904 | link |
2024-04-11 | Exploiting Object-based and Segmentation-based Semantic Features for Deep Learning-based Indoor Scene Classification | Ricardo Pereira et.al. | 2404.07739 | null |
2024-04-11 | Contrastive-Based Deep Embeddings for Label Noise-Resilient Histopathology Image Classification | Lucas Dedieu et.al. | 2404.07605 | link |
2024-04-11 | Learning to Classify New Foods Incrementally Via Compressed Exemplars | Justin Yang et.al. | 2404.07507 | null |
2024-04-11 | Interactive Prompt Debugging with Sequence Salience | Ian Tenney et.al. | 2404.07498 | null |
2024-04-11 | Privacy preserving layer partitioning for Deep Neural Network models | Kishore Rajasekar et.al. | 2404.07437 | null |
2024-04-11 | CopilotCAD: Empowering Radiologists with Report Completion Models and Quantitative Evidence from Medical Image Foundation Models | Sheng Wang et.al. | 2404.07424 | null |
2024-04-11 | Improving Shift Invariance in Convolutional Neural Networks with Translation Invariant Polyphase Sampling | Sourajit Saha et.al. | 2404.07410 | null |
2024-04-10 | Lost in Translation: Modern Neural Networks Still Struggle With Small Realistic Image Transformations | Ofir Shifman et.al. | 2404.07153 | null |
2024-04-10 | Learning of deep convolutional network image classifiers via stochastic gradient descent and over-parametrization | Michael Kohler et.al. | 2404.07128 | null |
2024-04-10 | Accelerating Cardiac MRI Reconstruction with CMRatt: An Attention-Driven Approach | Anam Hashmi et.al. | 2404.06941 | null |
2024-04-10 | Multi-Label Continual Learning for the Medical Domain: A Novel Benchmark | Marina Ceccon et.al. | 2404.06859 | null |
2024-04-10 | Neural Optimizer Equation, Decay Function, and Learning Rate Schedule Joint Evolution | Brandon Morgan et.al. | 2404.06679 | null |
2024-04-09 | Variational Stochastic Gradient Descent for Deep Neural Networks | Haotian Chen et.al. | 2404.06549 | link |
2024-04-09 | On adversarial training and the 1 Nearest Neighbor classifier | Amir Hagai et.al. | 2404.06313 | link |
2024-04-09 | Audio-Visual Generalized Zero-Shot Learning using Pre-Trained Large Multi-Modal Models | David Kurzendörfer et.al. | 2404.06309 | link |
2024-04-09 | Counterfactual Reasoning for Multi-Label Image Classification via Patching-Based Training | Ming-Kun Xie et.al. | 2404.06287 | null |
2024-04-09 | Quantum Circuit |
Yuka Hashimoto et.al. | 2404.06218 | null |
2024-04-09 | VI-OOD: A Unified Representation Learning Framework for Textual Out-of-distribution Detection | Li-Ming Zhan et.al. | 2404.06217 | link |
2024-04-09 | Symmetry-guided gradient descent for quantum neural networks | Kaiming Bian et.al. | 2404.06108 | null |
2024-04-10 | Using Few-Shot Learning to Classify Primary Lung Cancer and Other Malignancy with Lung Metastasis in Cytological Imaging via Endobronchial Ultrasound Procedures | Ching-Kai Lin et.al. | 2404.06080 | null |
2024-04-08 | Neural Cellular Automata for Lightweight, Robust and Explainable Classification of White Blood Cell Images | Michael Deutges et.al. | 2404.05584 | null |
2024-04-08 | On the Convergence of Continual Learning with Adaptive Methods | Seungyub Han et.al. | 2404.05555 | null |
2024-04-08 | Multi-Task Learning for Features Extraction in Financial Annual Reports | Syrielle Montariol et.al. | 2404.05281 | link |
2024-04-08 | Allowing humans to interactively guide machines where to look does not always improve a human-AI team's classification accuracy | Giang Nguyen et.al. | 2404.05238 | null |
2024-04-08 | iVPT: Improving Task-relevant Information Sharing in Visual Prompt Tuning by Cross-layer Dynamic Connection | Nan Zhou et.al. | 2404.05207 | null |
2024-04-08 | Semantic Stealth: Adversarial Text Attacks on NLP Using Several Methods | Roopkatha Dey et.al. | 2404.05159 | null |
2024-04-07 | PairAug: What Can Augmented Image-Text Pairs Do for Radiology? | Yutong Xie et.al. | 2404.04960 | link |
2024-04-07 | GvT: A Graph-based Vision Transformer with Talking-Heads Utilizing Sparsity, Trained from Scratch on Small Datasets | Dongjing Shan et.al. | 2404.04924 | null |
2024-04-06 | Focused Active Learning for Histopathological Image Classification | Arne Schmidt et.al. | 2404.04663 | null |
2024-04-06 | Trustless Audits without Revealing Data or Models | Suppakit Waiwitlikhit et.al. | 2404.04500 | null |
2024-04-05 | Evaluating Adversarial Robustness: A Comparison Of FGSM, Carlini-Wagner Attacks, And The Role of Distillation as Defense Mechanism | Trilokesh Ranjan Sarkar et.al. | 2404.04245 | null |
2024-04-05 | Noisy Label Processing for Classification: A Survey | Mengting Li et.al. | 2404.04159 | null |
2024-04-05 | Learning Correlation Structures for Vision Transformers | Manjin Kim et.al. | 2404.03924 | null |
2024-04-05 | LiDAR-Guided Cross-Attention Fusion for Hyperspectral Band Selection and Image Classification | Judy X Yang et.al. | 2404.03883 | null |
2024-04-04 | Dendrites endow artificial neural networks with accurate, robust and parameter-efficient learning | Spyridon Chavlis et.al. | 2404.03708 | null |
2024-04-05 | A Methodology to Study the Impact of Spiking Neural Network Parameters considering Event-Based Automotive Data | Iqra Bano et.al. | 2404.03493 | null |
2024-04-04 | Meta Invariance Defense Towards Generalizable Robustness to Unknown Adversarial Attacks | Lei Zhang et.al. | 2404.03340 | null |
2024-04-04 | Sparse Concept Bottleneck Models: Gumbel Tricks in Contrastive Learning | Andrei Semenov et.al. | 2404.03323 | link |
2024-04-04 | FACTUAL: A Novel Framework for Contrastive Learning Based Robust SAR Image Classification | Xu Wang et.al. | 2404.03225 | null |
2024-04-03 | Exploring the Trade-off Between Model Performance and Explanation Plausibility of Text Classifiers Using Human Rationales | Lucas E. Resck et.al. | 2404.03098 | link |
2024-04-03 | Guarantees of confidentiality via Hammersley-Chapman-Robbins bounds | Kamalika Chaudhuri et.al. | 2404.02866 | link |
2024-04-03 | FPT: Feature Prompt Tuning for Few-shot Readability Assessment | Ziyang Wang et.al. | 2404.02772 | link |
2024-04-03 | Adversarial Attacks and Dimensionality in Text Classifiers | Nandish Chattopadhyay et.al. | 2404.02660 | null |
2024-04-04 | Non-negative Subspace Feature Representation for Few-shot Learning in Medical Imaging | Keqiang Fan et.al. | 2404.02656 | null |
2024-04-03 | Adaptive Cross-lingual Text Classification through In-Context One-Shot Demonstrations | Emilio Villa-Cueva et.al. | 2404.02452 | link |
2024-04-03 | A Novel Approach to Breast Cancer Histopathological Image Classification Using Cross-Colour Space Feature Fusion and Quantum-Classical Stack Ensemble Method | Sambit Mallick et.al. | 2404.02447 | null |
2024-04-03 | Enhancing Low-Resource LLMs Classification with PEFT and Synthetic Data | Parth Patwa et.al. | 2404.02422 | null |
2024-04-02 | Smooth Deep Saliency | Rudolf Herdt et.al. | 2404.02282 | null |
2024-04-02 | Visual Concept Connectome (VCC): Open World Concept Discovery and their Interlayer Connections in Deep Models | Matthew Kowal et.al. | 2404.02233 | null |
2024-04-02 | ImageNot: A contrast with ImageNet preserves model rankings | Olawale Salaudeen et.al. | 2404.02112 | null |
2024-04-02 | Explainability in JupyterLab and Beyond: Interactive XAI Systems for Integrated and Collaborative Workflows | Grace Guo et.al. | 2404.02081 | null |
2024-04-02 | Ukrainian Texts Classification: Exploration of Cross-lingual Knowledge Transfer Approaches | Daryna Dementieva et.al. | 2404.02043 | null |
2024-04-02 | CAM-Based Methods Can See through Walls | Magamed Taimeskhanov et.al. | 2404.01964 | link |
2024-04-02 | Beyond Image Super-Resolution for Image Recognition with Task-Driven Perceptual Loss | Jaeha Kim et.al. | 2404.01692 | null |
2024-04-02 | A Universal Knowledge Embedded Contrastive Learning Framework for Hyperspectral Image Classification | Quanwei Liu et.al. | 2404.01673 | null |
2024-04-01 | Can Biases in ImageNet Models Explain Generalization? | Paul Gavrikov et.al. | 2404.01509 | link |
2024-04-01 | Parallel Proportional Fusion of Spiking Quantum Neural Network for Optimizing Image Classification | Zuyu Xu et.al. | 2404.01359 | null |
2024-04-01 | Bridging Remote Sensors with Multisensor Geospatial Foundation Models | Boran Han et.al. | 2404.01260 | link |
2024-04-01 | Diagnosis of Skin Cancer Using VGG16 and VGG19 Based Transfer Learning Models | Amir Faghihi et.al. | 2404.01160 | null |
2024-03-29 | Learn "No" to Say "Yes" Better: Improving Vision-Language Models via Negations | Jaisidh Singh et.al. | 2403.20312 | link |
2024-03-29 | MCNet: A crowd denstity estimation network based on integrating multiscale attention module | Qiang Guo et.al. | 2403.20173 | null |
2024-03-29 | Segmentation, Classification and Interpretation of Breast Cancer Medical Images using Human-in-the-Loop Machine Learning | David Vázquez-Lema et.al. | 2403.20112 | null |
2024-03-29 | Adverb Is the Key: Simple Text Data Augmentation with Adverb Deletion | Juhwan Choi et.al. | 2403.20015 | null |
2024-03-29 | Diverse Feature Learning by Self-distillation and Reset | Sejik Park et.al. | 2403.19941 | null |
2024-03-29 | Heterogeneous Network Based Contrastive Learning Method for PolSAR Land Cover Classification | Jianfeng Cai et.al. | 2403.19902 | link |
2024-03-28 | X-MIC: Cross-Modal Instance Conditioning for Egocentric Action Generalization | Anna Kukleva et.al. | 2403.19811 | link |
2024-03-28 | RSMamba: Remote Sensing Image Classification with State Space Model | Keyan Chen et.al. | 2403.19654 | link |
2024-03-28 | Enhance Image Classification via Inter-Class Image Mixup with Diffusion Model | Zhicai Wang et.al. | 2403.19600 | link |
2024-03-28 | The Bad Batches: Enhancing Self-Supervised Learning in Image Classification Through Representative Batch Curation | Ozgu Goksu et.al. | 2403.19579 | null |
2024-03-28 | Low-Rank Rescaled Vision Transformer Fine-Tuning: A Residual Design Approach | Wei Dong et.al. | 2403.19067 | link |
2024-03-27 | Evaluating Large Language Models for Health-Related Text Classification Tasks with Public Social Media Data | Yuting Guo et.al. | 2403.19031 | null |
2024-03-27 | Robustness and Visual Explanation for Black Box Image, Video, and ECG Signal Classification with Reinforcement Learning | Soumyendu Sarkar et.al. | 2403.18985 | null |
2024-03-27 | The Impact of Uniform Inputs on Activation Sparsity and Energy-Latency Attacks in Computer Vision | Andreas Müller et.al. | 2403.18587 | link |
2024-03-27 | Uncertainty-Aware SAR ATR: Defending Against Adversarial Attacks via Bayesian Neural Networks | Tian Ye et.al. | 2403.18318 | null |
2024-03-27 | Multi-scale Unified Network for Image Classification | Wenzhuo Liu et.al. | 2403.18294 | null |
2024-03-26 | The Need for Speed: Pruning Transformers with One Recipe | Samir Khaki et.al. | 2403.17921 | link |
2024-03-26 | Compressed Multi-task embeddings for Data-Efficient Downstream training and inference in Earth Observation | Carlos Gomes et.al. | 2403.17886 | null |
2024-03-26 | PlainMamba: Improving Non-Hierarchical Mamba in Visual Recognition | Chenhongyi Yang et.al. | 2403.17695 | link |
2024-03-26 | Language Models for Text Classification: Is In-Context Learning Enough? | Aleksandra Edwards et.al. | 2403.17661 | null |
2024-03-26 | Boosting Few-Shot Learning with Disentangled Self-Supervised Learning and Meta-Learning for Medical Image Classification | Eva Pachetti et.al. | 2403.17530 | null |
2024-03-26 | HILL: Hierarchy-aware Information Lossless Contrastive Learning for Hierarchical Text Classification | He Zhu et.al. | 2403.17307 | link |
2024-03-25 | Histogram Layers for Neural Engineered Features | Joshua Peeples et.al. | 2403.17176 | link |
2024-03-25 | Task2Box: Box Embeddings for Modeling Asymmetric Task Relationships | Rangel Daroya et.al. | 2403.17173 | link |
2024-03-25 | CipherFormer: Efficient Transformer Private Inference with Low Round Complexity | Weize Wang et.al. | 2403.16860 | null |
2024-03-25 | Assessing the Performance of Deep Learning for Automated Gleason Grading in Prostate Cancer | Dominik Müller et.al. | 2403.16695 | null |
2024-03-25 | DeepGleason: a System for Automated Gleason Grading of Prostate Cancer using Deep Neural Networks | Dominik Müller et.al. | 2403.16678 | link |
2024-03-25 | LARA: Linguistic-Adaptive Retrieval-Augmented LLMs for Multi-Turn Intent Classification | Liu Junhua et.al. | 2403.16504 | null |
2024-03-24 | On machine learning analysis of atomic force microscopy images for image classification, sample surface recognition | Igor Sokolov et.al. | 2403.16230 | null |
2024-03-24 | Leveraging Deep Learning and Xception Architecture for High-Accuracy MRI Classification in Alzheimer Diagnosis | Shaojie Li et.al. | 2403.16212 | null |
2024-03-24 | Multi-Task Learning with Multi-Task Optimization | Lu Bai et.al. | 2403.16162 | null |
2024-03-24 | CBGT-Net: A Neuromimetic Architecture for Robust Classification of Streaming Data | Shreya Sharma et.al. | 2403.15974 | link |
2024-03-23 | A Deep Learning Architectures for Kidney Disease Classification | Muhammad Shoaib Farooq et.al. | 2403.15895 | null |
2024-03-23 | VLUE: A New Benchmark and Multi-task Knowledge Transfer Learning for Vietnamese Natural Language Understanding | Phong Nguyen-Thuan Do et.al. | 2403.15882 | null |
2024-03-23 | VLM-CPL: Consensus Pseudo Labels from Vision-Language Models for Human Annotation-Free Pathological Image Classification | Lanfeng Zhong et.al. | 2403.15836 | null |
2024-03-22 | Your Image is My Video: Reshaping the Receptive Field via Image-To-Video Differentiable AutoAugmentation and Fusion | Sofia Casarin et.al. | 2403.15194 | null |
2024-03-22 | Image Classification with Rotation-Invariant Variational Quantum Circuits | Paul San Sebastian et.al. | 2403.15031 | null |
2024-03-22 | Extracting Human Attention through Crowdsourced Patch Labeling | Minsuk Chang et.al. | 2403.15013 | null |
2024-03-22 | Clean-image Backdoor Attacks | Dazhong Rong et.al. | 2403.15010 | null |
2024-03-22 | ParFormer: Vision Transformer Baseline with Parallel Local Global Token Mixer and Convolution Attention Patch Embedding | Novendra Setyawan et.al. | 2403.15004 | null |
2024-03-22 | MasonTigers at SemEval-2024 Task 8: Performance Analysis of Transformer-based Models on Machine-Generated Text Detection | Sadiya Sayara Chowdhury Puspo et.al. | 2403.14989 | null |
2024-03-21 | Learning with SASQuaTCh: a Novel Variational Quantum Transformer Architecture with Kernel-Based Self-Attention | Ethan N. Evans et.al. | 2403.14753 | null |
2024-03-21 | Estimating Physical Information Consistency of Channel Data Augmentation for Remote Sensing Images | Tom Burgert et.al. | 2403.14547 | null |
2024-03-21 | Multi-Level Explanations for Generative Language Models | Lucas Monteiro Paes et.al. | 2403.14459 | null |
2024-03-21 | Tensor network compressibility of convolutional models | Sukhbinder Singh et.al. | 2403.14379 | null |
2024-03-21 | LayoutLLM: Large Language Model Instruction Tuning for Visually Rich Document Understanding | Masato Fujitake et.al. | 2403.14252 | null |
2024-03-21 | Safeguarding Medical Image Segmentation Datasets against Unauthorized Training via Contour- and Texture-Aware Perturbations | Xun Lin et.al. | 2403.14250 | null |
2024-03-21 | Improving Image Classification Accuracy through Complementary Intra-Class and Inter-Class Mixup | Ye Xu et.al. | 2403.14137 | link |
2024-03-20 | Bridge the Modality and Capacity Gaps in Vision-Language Model Selection | Chao Yi et.al. | 2403.13797 | null |
2024-03-20 | Leveraging feature communication in federated learning for remote sensing image classification | Anh-Kiet Duong et.al. | 2403.13575 | null |
2024-03-20 | MTP: Advancing Remote Sensing Foundation Model via Multi-Task Pretraining | Di Wang et.al. | 2403.13430 | link |
2024-03-20 | Building Optimal Neural Architectures using Interpretable Knowledge | Keith G. Mills et.al. | 2403.13293 | link |
2024-03-19 | LUWA Dataset: Learning Lithic Use-Wear Analysis on Microscopic Images | Jing Zhang et.al. | 2403.13171 | null |
2024-03-19 | Improved EATFormer: A Vision Transformer for Medical Image Classification | Yulong Shisu et.al. | 2403.13167 | null |
2024-03-19 | SIFT-DBT: Self-supervised Initialization and Fine-Tuning for Imbalanced Digital Breast Tomosynthesis Image Classification | Yuexi Du et.al. | 2403.13148 | link |
2024-03-19 | Using evolutionary computation to optimize task performance of unclocked, recurrent Boolean circuits in FPGAs | Raphael Norman-Tenazas et.al. | 2403.13105 | null |
2024-03-19 | Investigating Text Shortening Strategy in BERT: Truncation vs Summarization | Mirza Alim Mutasodirin et.al. | 2403.12799 | link |
2024-03-18 | Posterior Uncertainty Quantification in Neural Networks using Data Augmentation | Luhuan Wu et.al. | 2403.12729 | null |
2024-03-19 | SEVEN: Pruning Transformer Model by Reserving Sentinels | Jinying Xiao et.al. | 2403.12688 | link |
2024-03-19 | Simple Hack for Transformers against Heavy Long-Text Classification on a Time- and Memory-Limited GPU Service | Mirza Alim Mutasodirin et.al. | 2403.12563 | null |
2024-03-19 | Prompt-Guided Adaptive Model Transformation for Whole Slide Image Classification | Yi Lin et.al. | 2403.12537 | null |
2024-03-19 | CrossTune: Black-Box Few-Shot Classification with Label Enhancement | Danqing Luo et.al. | 2403.12468 | null |
2024-03-18 | Generalizing deep learning models for medical image classification | Matta Sarah et.al. | 2403.12167 | null |
2024-03-19 | Leveraging Spatial and Semantic Feature Extraction for Skin Cancer Diagnosis with Capsule Networks and Graph Neural Networks | K. P. Santoso et.al. | 2403.12009 | null |
2024-03-18 | High-energy physics image classification: A Survey of Jet Applications | Hamza Kheddar et.al. | 2403.11934 | null |
2024-03-18 | Better (pseudo-)labels for semi-supervised instance segmentation | François Porcher et.al. | 2403.11675 | null |
2024-03-18 | Continual Forgetting for Pre-trained Vision Models | Hongbo Zhao et.al. | 2403.11530 | link |
2024-03-18 | Uncertainty-Calibrated Test-Time Model Adaptation without Forgetting | Mingkui Tan et.al. | 2403.11491 | null |
2024-03-17 | Potential of Domain Adaptation in Machine Learning in Ecology and Hydrology to Improve Model Extrapolability | Haiyang Shi et.al. | 2403.11331 | null |
2024-03-17 | A Modified Word Saliency-Based Adversarial Attack on Text Classification Models | Hetvi Waghela et.al. | 2403.11297 | null |
2024-03-17 | Forging the Forger: An Attempt to Improve Authorship Verification via Data Augmentation | Silvia Corbara et.al. | 2403.11265 | null |
2024-03-17 | Multiple Teachers-Meticulous Student: A Domain Adaptive Meta-Knowledge Distillation Model for Medical Image Classification | Shahabedin Nabavi et.al. | 2403.11226 | null |
2024-03-16 | Forward Learning of Graph Neural Networks | Namyong Park et.al. | 2403.11004 | null |
2024-03-16 | Understanding Robustness of Visual State Space Models for Image Classification | Chengbin Du et.al. | 2403.10935 | null |
2024-03-16 | Automatic location detection based on deep learning | Anjali Karangiya et.al. | 2403.10912 | null |
2024-03-14 | Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models | Akhil Kedia et.al. | 2403.09635 | link |
2024-03-14 | XCoOp: Explainable Prompt Learning for Computer-Aided Diagnosis via Concept-guided Context Optimization | Yequan Bie et.al. | 2403.09410 | null |
2024-03-14 | ConDiSR: Contrastive Disentanglement and Style Regularization for Single Domain Generalization | Aleksandr Matsun et.al. | 2403.09400 | null |
2024-03-14 | A Hierarchical Fused Quantum Fuzzy Neural Network for Image Classification | Sheng-Yao Wu et.al. | 2403.09318 | null |
2024-03-14 | CLIP-EBC: CLIP Can Count Accurately through Enhanced Blockwise Classification | Yiming Ma et.al. | 2403.09281 | null |
2024-03-14 | Are Vision Language Models Texture or Shape Biased and Can We Steer Them? | Paul Gavrikov et.al. | 2403.09193 | null |
2024-03-14 | Randomized Principal Component Analysis for Hyperspectral Image Classification | Mustafa Ustuner et.al. | 2403.09117 | null |
2024-03-14 | CardioCaps: Attention-based Capsule Network for Class-Imbalanced Echocardiogram Classification | Hyunkyung Han et.al. | 2403.09108 | link |
2024-03-14 | The First to Know: How Token Distributions Reveal Hidden Knowledge in Large Vision-Language Models? | Qinyu Zhao et.al. | 2403.09037 | link |
2024-03-13 | PathM3: A Multimodal Multi-Task Multiple Instance Learning Framework for Whole Slide Image Classification and Captioning | Qifeng Zhou et.al. | 2403.08967 | null |
2024-03-13 | DAM: Dynamic Adapter Merging for Continual Video QA Learning | Feng Cheng et.al. | 2403.08755 | link |
2024-03-13 | Leveraging Compressed Frame Sizes For Ultra-Fast Video Classification | Yuxing Han et.al. | 2403.08580 | null |
2024-03-13 | HOLMES: HOLonym-MEronym based Semantic inspection for Convolutional Image Classifiers | Francesco Dibitonto et.al. | 2403.08536 | link |
2024-03-13 | Pig aggression classification using CNN, Transformers and Recurrent Networks | Junior Silva Souza et.al. | 2403.08528 | null |
2024-03-13 | Reduced Jeffries-Matusita distance: A Novel Loss Function to Improve Generalization Performance of Deep Classification Models | Mohammad Lashkari et.al. | 2403.08408 | null |
2024-03-13 | Iterative Online Image Synthesis via Diffusion Model for Imbalanced Classification | Shuhan Li et.al. | 2403.08407 | null |
2024-03-13 | Advancing Security in AI Systems: A Novel Approach to Detecting Backdoors in Deep Neural Networks | Khondoker Murad Hossain et.al. | 2403.08208 | null |
2024-03-13 | Multiscale Low-Frequency Memory Network for Improved Feature Extraction in Convolutional Neural Networks | Fuzhi Wu et.al. | 2403.08157 | link |
2024-03-12 | Harnessing Artificial Intelligence to Combat Online Hate: Exploring the Challenges and Opportunities of Large Language Models in Hate Speech Detection | Tharindu Kumarage et.al. | 2403.08035 | null |
2024-03-13 | Visual Decoding and Reconstruction via EEG Embeddings with Guided Diffusion | Dongyang Li et.al. | 2403.07721 | link |
2024-03-12 | FPT: Fine-grained Prompt Tuning for Parameter and Memory Efficient Fine Tuning in High-resolution Medical Image Classification | Yijin Huang et.al. | 2403.07576 | null |
2024-03-12 | Backdoor Attack with Mode Mixture Latent Modification | Hongwei Zhang et.al. | 2403.07463 | null |
2024-03-12 | In-context learning enables multimodal large language models to classify cancer pathology images | Dyke Ferber et.al. | 2403.07407 | null |
2024-03-12 | Premonition: Using Generative Models to Preempt Future Data Changes in Continual Learning | Mark D. McDonnell et.al. | 2403.07356 | null |
2024-03-12 | How does promoting the minority fraction affect generalization? A theoretical study of the one-hidden-layer neural network on group imbalance | Hongkang Li et.al. | 2403.07310 | null |
2024-03-12 | A Bayesian Approach to OOD Robustness in Image Classification | Prakhar Kaushik et.al. | 2403.07277 | null |
2024-03-11 | LeOCLR: Leveraging Original Images for Contrastive Learning of Visual Representations | Mohammad Alkhalefi et.al. | 2403.06813 | null |
2024-03-11 | Dynamic Perturbation-Adaptive Adversarial Training on Medical Image Classification | Shuai Li et.al. | 2403.06798 | null |
2024-03-11 | Leveraging Internal Representations of Model for Magnetic Image Classification | Adarsh N L et.al. | 2403.06797 | null |
2024-03-11 | Shortcut Learning in Medical Image Segmentation | Manxi Lin et.al. | 2403.06748 | null |
2024-03-11 | Active Generation for Image Classification | Tao Huang et.al. | 2403.06517 | null |
2024-03-11 | Evolving Knowledge Distillation with Large Language Models and Active Learning | Chengyuan Liu et.al. | 2403.06414 | null |
2024-03-11 | 'One size doesn't fit all': Learning how many Examples to use for In-Context Learning for Improved Text Classification | Manish Chandra et.al. | 2403.06402 | null |
2024-03-10 | Probing Image Compression For Class-Incremental Learning | Justin Yang et.al. | 2403.06288 | null |
2024-03-10 | Bayesian Random Semantic Data Augmentation for Medical Image Classification | Yaoyao Zhu et.al. | 2403.06138 | link |
2024-03-10 | Universal Debiased Editing for Fair Medical Image Classification | Ruinan Jin et.al. | 2403.06104 | null |
2024-03-08 | Tune without Validation: Searching for Learning Rate and Weight Decay on Training Sets | Lorenzo Brigato et.al. | 2403.05532 | null |
2024-03-08 | Generalized Correspondence Matching via Flexible Hierarchical Refinement and Patch Descriptor Distillation | Yu Han et.al. | 2403.05388 | null |
2024-03-08 | The Impact of Quantization on the Robustness of Transformer-based Text Classifiers | Seyed Parsa Neshaei et.al. | 2403.05365 | null |
2024-03-08 | Multiple Instance Learning with random sampling for Whole Slide Image Classification | H. Keshvarikhojasteh et.al. | 2403.05351 | null |
2024-03-08 | Learning Expressive And Generalizable Motion Features For Face Forgery Detection | Jingyi Zhang et.al. | 2403.05172 | null |
2024-03-08 | Defending Against Unforeseen Failure Modes with Latent Adversarial Training | Stephen Casper et.al. | 2403.05030 | link |
2024-03-07 | Fooling Neural Networks for Motion Forecasting via Adversarial Attacks | Edgar Medina et.al. | 2403.04954 | null |
2024-03-07 | T-TAME: Trainable Attention Mechanism for Explaining Convolutional Networks and Vision Transformers | Mariano V. Ntrougkas et.al. | 2403.04523 | null |
2024-03-07 | Source Matters: Source Dataset Impact on Model Robustness in Medical Imaging | Dovile Juodelyte et.al. | 2403.04484 | link |
2024-03-07 | Advancing Biomedical Text Mining with Community Challenges | Hui Zong et.al. | 2403.04261 | null |
2024-03-07 | Scalable On-Chip Optical Linear Processing Unit Using a Single Thin-Film Lithium Niobate Ring Modulator | Zhaoang Deng et.al. | 2403.04216 | null |
2024-03-07 | Scalable and Robust Transformer Decoders for Interpretable Image Classification with Foundation Models | Evelyn Mannix et.al. | 2403.04125 | null |
2024-03-07 | Privacy-preserving Fine-tuning of Large Language Models through Flatness | Tiejin Chen et.al. | 2403.04124 | null |
2024-03-06 | MedMamba: Vision Mamba for Medical Image Classification | Yubiao Yue et.al. | 2403.03849 | link |
2024-03-06 | On the Effectiveness of Distillation in Mitigating Backdoors in Pre-trained Encoder | Tingxu Han et.al. | 2403.03846 | link |
2024-03-06 | RADIA -- Radio Advertisement Detection with Intelligent Analytics | Jorge Álvarez et.al. | 2403.03538 | null |
2024-03-06 | Inverse-Free Fast Natural Gradient Descent Method for Deep Learning | Xinwei Ou et.al. | 2403.03473 | null |
2024-03-06 | Sparse Spiking Neural Network: Exploiting Heterogeneity in Timescales for Pruning Recurrent SNN | Biswadeep Chakraborty et.al. | 2403.03409 | null |
2024-03-05 | RulePrompt: Weakly Supervised Text Classification with Prompting PLMs and Self-Iterative Logical Rules | Miaomiao Li et.al. | 2403.02932 | link |
2024-03-05 | Demonstrating Mutual Reinforcement Effect through Information Flow | Chengguang Gan et.al. | 2403.02902 | null |
2024-03-05 | Quantum Mixed-State Self-Attention Network | Fu Chen et.al. | 2403.02871 | null |
2024-03-05 | SOFIM: Stochastic Optimization Using Regularized Fisher Information Matrix | Gayathri C et.al. | 2403.02833 | null |
2024-03-05 | SGD with Partial Hessian for Deep Neural Networks Optimization | Ying Sun et.al. | 2403.02681 | link |
2024-03-05 | G-EvoNAS: Evolutionary Neural Architecture Search Based on Network Growth | Juan Zou et.al. | 2403.02667 | null |
2024-03-05 | Remove that Square Root: A New Efficient Scale-Invariant Version of AdaGrad | Sayantan Choudhury et.al. | 2403.02648 | link |
2024-03-05 | Modeling Collaborator: Enabling Subjective Vision Classification With Minimal Human Effort via LLM Tool-Use | Imad Eddine Toubal et.al. | 2403.02626 | null |
2024-03-04 | When do Convolutional Neural Networks Stop Learning? | Sahan Ahmad et.al. | 2403.02473 | link |
2024-03-04 | NiNformer: A Network in Network Transformer with Token Mixing Generated Gating Function | Abdullah Nazhat Abdullah et.al. | 2403.02411 | link |
2024-03-02 | Can a Confident Prior Replace a Cold Posterior? | Martin Marek et.al. | 2403.01272 | link |
2024-03-02 | Leveraging Self-Supervised Learning for Scene Recognition in Child Sexual Abuse Imagery | Pedro H. V. Valois et.al. | 2403.01183 | null |
2024-03-02 | Auxiliary Tasks Enhanced Dual-affinity Learning for Weakly Supervised Semantic Segmentation | Lian Xu et.al. | 2403.01156 | null |
2024-03-02 | ELA: Efficient Local Attention for Deep Convolutional Neural Networks | Wei Xu et.al. | 2403.01123 | null |
2024-03-01 | Margin Discrepancy-based Adversarial Training for Multi-Domain Text Classification | Yuan Wu et.al. | 2403.00888 | null |
2024-03-01 | Text classification of column headers with a controlled vocabulary: leveraging LLMs for metadata enrichment | Margherita Martorana et.al. | 2403.00884 | null |
2024-03-01 | SURE: SUrvey REcipes for building reliable and robust deep networks | Yuting Li et.al. | 2403.00543 | link |
2024-03-01 | Invariant Test-Time Adaptation for Vision-Language Model Generalization | Huan Ma et.al. | 2403.00376 | null |
2024-02-29 | TELEClass: Taxonomy Enrichment and LLM-Enhanced Hierarchical Text Classification with Minimal Supervision | Yunyi Zhang et.al. | 2403.00165 | null |
2024-02-29 | Assessing Visually-Continuous Corruption Robustness of Neural Networks Relative to Human Performance | Huakun Shen et.al. | 2402.19401 | null |
2024-02-29 | Stitching Gaps: Fusing Situated Perceptual Knowledge with Vision Transformers for High-Level Image Classification | Delfina Sol Martinez Pandiani et.al. | 2402.19339 | null |
2024-02-29 | Generalizable Whole Slide Image Classification with Fine-Grained Visual-Semantic Interaction | Hao Li et.al. | 2402.19326 | null |
2024-02-29 | Decompose-and-Compose: A Compositional Approach to Mitigating Spurious Correlation | Fahimeh Hosseini Noohdani et.al. | 2402.18919 | null |
2024-02-29 | Utilizing Local Hierarchy with Adversarial Training for Hierarchical Text Classification | Zihan Wang et.al. | 2402.18825 | link |
2024-02-28 | Comparing Importance Sampling Based Methods for Mitigating the Effect of Class Imbalance | Indu Panigrahi et.al. | 2402.18742 | link |
2024-02-28 | Deep Neural Network Models Trained With A Fixed Random Classifier Transfer Better Across Domains | Hafiz Tiomoko Ali et.al. | 2402.18614 | null |
2024-02-28 | Orchid: Flexible and Data-Dependent Convolution for Sequence Modeling | Mahdi Karami et.al. | 2402.18508 | null |
2024-02-28 | Prompt-Driven Dynamic Object-Centric Learning for Single Domain Generalization | Deng Li et.al. | 2402.18447 | null |
2024-02-29 | A Modular System for Enhanced Robustness of Multimedia Understanding Networks via Deep Parametric Estimation | Francesco Barbato et.al. | 2402.18402 | null |
2024-02-28 | A Multimodal Handover Failure Detection Dataset and Baselines | Santosh Thoduka et.al. | 2402.18319 | null |
2024-02-28 | Classes Are Not Equal: An Empirical Study on Image Recognition Fairness | Jiequan Cui et.al. | 2402.18133 | null |
2024-02-27 | Understanding Neural Network Binarization with Forward and Backward Proximal Quantizers | Yiwei Lu et.al. | 2402.17710 | null |
2024-02-27 | SDF2Net: Shallow to Deep Feature Fusion Network for PolSAR Image Classification | Mohammed Q. Alkhatib et.al. | 2402.17672 | link |
2024-02-27 | Predict the Next Word: | Evgenia Ilia et.al. | 2402.17527 | null |
2024-02-27 | Scaling Supervised Local Learning with Augmented Auxiliary Networks | Chenxiang Ma et.al. | 2402.17318 | link |
2024-02-26 | Offline Writer Identification Using Convolutional Neural Network Activation Features | Vincent Christlein et.al. | 2402.17029 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-17 | YOLOv11-RGBT: Towards a Comprehensive Single-Stage Multispectral Object Detection Framework | Dahang Wan et.al. | 2506.14696 | null |
2025-06-17 | VisText-Mosquito: A Multimodal Dataset and Benchmark for AI-Based Mosquito Breeding Site Detection and Reasoning | Md. Adnanul Islam et.al. | 2506.14629 | null |
2025-06-17 | GAMORA: A Gesture Articulated Meta Operative Robotic Arm for Hazardous Material Handling in Containment-Level Environments | Farha Abdul Wasay et.al. | 2506.14513 | null |
2025-06-17 | Comparison of Two Methods for Stationary Incident Detection Based on Background Image | Deepak Ghimire et.al. | 2506.14256 | null |
2025-06-16 | A Point Cloud Completion Approach for the Grasping of Partially Occluded Objects and Its Applications in Robotic Strawberry Harvesting | Ali Abouzeid et.al. | 2506.14066 | null |
2025-06-16 | FindMeIfYouCan: Bringing Open Set metrics to |
Daniel Montoya et.al. | 2506.14008 | null |
2025-06-16 | How Real is CARLAs Dynamic Vision Sensor? A Study on the Sim-to-Real Gap in Traffic Object Detection | Kaiyuan Tan et.al. | 2506.13722 | null |
2025-06-17 | Lecture Video Visual Objects (LVVO) Dataset: A Benchmark for Visual Object Detection in Educational Videos | Dipayan Biswas et.al. | 2506.13657 | link |
2025-06-16 | UAV Object Detection and Positioning in a Mining Industrial Metaverse with Custom Geo-Referenced Data | Vasiliki Balaska et.al. | 2506.13505 | null |
2025-06-16 | Sparse Convolutional Recurrent Learning for Efficient Event-based Neuromorphic Object Detection | Shenqi Wang et.al. | 2506.13440 | null |
2025-06-16 | Cognitive Synergy Architecture: SEGO for Human-Centric Collaborative Robots | Jaehong Oh et.al. | 2506.13149 | null |
2025-06-15 | MGDFIS: Multi-scale Global-detail Feature Integration Strategy for Small Object Detection | Yuxiang Wang et.al. | 2506.12697 | null |
2025-06-14 | UniDet-D: A Unified Dynamic Spectral Attention Model for Object Detection under Adverse Weathers | Yuantao Wang et.al. | 2506.12324 | null |
2025-06-14 | MatchPlant: An Open-Source Pipeline for UAV-Based Single-Plant Detection and Data Extraction | Worasit Sangjan et.al. | 2506.12295 | link |
2025-06-13 | Vision-based Lifting of 2D Object Detections for Automated Driving | Hendrik Königshof et.al. | 2506.11839 | null |
2025-06-13 | Teleoperated Driving: a New Challenge for 3D Object Detection in Compressed Point Clouds | Filippo Bragato et.al. | 2506.11804 | null |
2025-06-13 | GPLQ: A General, Practical, and Lightning QAT Method for Vision Transformers | Guang Liang et.al. | 2506.11784 | null |
2025-06-13 | On the Natural Robustness of Vision-Language Models Against Visual Perception Attacks in Autonomous Driving | Pedram MohajerAnsari et.al. | 2506.11472 | null |
2025-06-12 | Teaching in adverse scenes: a statistically feedback-driven threshold and mask adjustment teacher-student framework for object detection in UAV images under adverse scenes | Hongyu Chen et.al. | 2506.11175 | null |
2025-06-12 | Discrete Lorenz Attractors in 3D Sinusoidal Maps | Sishu Shankar Muni et.al. | 2506.10788 | null |
2025-06-12 | Uncertainty-Masked Bernoulli Diffusion for Camouflaged Object Detection Refinement | Yuqi Shen et.al. | 2506.10712 | null |
2025-06-12 | Semantic-decoupled Spatial Partition Guided Point-supervised Oriented Object Detection | Xinyuan Liu et.al. | 2506.10601 | link |
2025-06-12 | Improving Medical Visual Representation Learning with Pathological-level Cross-Modal Alignment and Correlation Exploration | Jun Wang et.al. | 2506.10573 | null |
2025-06-12 | FSATFusion: Frequency-Spatial Attention Transformer for Infrared and Visible Image Fusion | Tianpei Zhang et.al. | 2506.10366 | link |
2025-06-11 | DySS: Dynamic Queries and State-Space Learning for Efficient 3D Object Detection from Multi-Camera Videos | Rajeev Yasarla et.al. | 2506.10242 | null |
2025-06-11 | CEM-FBGTinyDet: Context-Enhanced Foreground Balance with Gradient Tuning for tiny Objects | Tao Liu et.al. | 2506.09897 | null |
2025-06-11 | 3DGeoDet: General-purpose Geometry-aware Image-based 3D Object Detection | Yi Zhang et.al. | 2506.09541 | null |
2025-06-11 | MSSDF: Modality-Shared Self-supervised Distillation for High-Resolution Multi-modal Remote Sensing Image Learning | Tong Wang et.al. | 2506.09327 | null |
2025-06-10 | Efficient Edge Deployment of Quantized YOLOv4-Tiny for Aerial Emergency Object Detection on Raspberry Pi 5 | Sindhu Boddu et.al. | 2506.09300 | null |
2025-06-10 | Lightweight Object Detection Using Quantized YOLOv4-Tiny for Emergency Response in Aerial Imagery | Sindhu Boddu et.al. | 2506.09299 | null |
2025-06-10 | WD-DETR: Wavelet Denoising-Enhanced Real-Time Object Detection Transformer for Robot Perception with Event Cameras | Yangjie Cui et.al. | 2506.09098 | null |
2025-06-11 | Cosmos-Drive-Dreams: Scalable Synthetic Driving Data Generation with World Foundation Models | Xuanchi Ren et.al. | 2506.09042 | null |
2025-06-10 | ADAM: Autonomous Discovery and Annotation Model using LLMs for Context-Aware Annotations | Amirreza Rouhi et.al. | 2506.08968 | null |
2025-06-10 | Data Augmentation For Small Object using Fast AutoAugment | DaeEun Yoon et.al. | 2506.08956 | null |
2025-06-11 | Gaussian2Scene: 3D Scene Representation Learning via Self-supervised Learning with 3D Gaussian Splatting | Keyi Liu et.al. | 2506.08777 | null |
2025-06-09 | CrosswalkNet: An Optimized Deep Learning Framework for Pedestrian Crosswalk Detection in Aerial Images with High-Performance Computing | Zubin Bhuyan et.al. | 2506.07885 | null |
2025-06-09 | SAM2Auto: Auto Annotation Using FLASH | Arash Rocky et.al. | 2506.07850 | null |
2025-06-09 | Design and Evaluation of Deep Learning-Based Dual-Spectrum Image Fusion Methods | Beining Xu et.al. | 2506.07779 | null |
2025-06-09 | SpikeSMOKE: Spiking Neural Networks for Monocular 3D Object Detection with Cross-Scale Gated Coding | Xuemei Chen et.al. | 2506.07737 | null |
2025-06-09 | Domain Randomization for Object Detection in Manufacturing Applications using Synthetic Data: A Comprehensive Study | Xiaomeng Zhu et.al. | 2506.07539 | null |
2025-06-09 | SpatialLM: Training Large Language Models for Structured Indoor Modeling | Yongsen Mao et.al. | 2506.07491 | null |
2025-06-09 | Happiness Finder: Exploring the Role of AI in Enhancing Well-Being During Four-Leaf Clover Searches | Anna Yokokubo et.al. | 2506.07393 | null |
2025-06-09 | Multiple Object Stitching for Unsupervised Representation Learning | Chengchao Shen et.al. | 2506.07364 | link |
2025-06-09 | CBAM-STN-TPS-YOLO: Enhancing Agricultural Object Detection through Spatially Adaptive Attention Mechanisms | Satvik Praveen et.al. | 2506.07357 | null |
2025-06-08 | UCOD-DPL: Unsupervised Camouflaged Object Detection via Dynamic Pseudo-label Learning | Weiqi Yan et.al. | 2506.07087 | null |
2025-06-06 | Domain-RAG: Retrieval-Guided Compositional Image Generation for Cross-Domain Few-Shot Object Detection | Yu Li et.al. | 2506.05872 | null |
2025-06-06 | Token Transforming: A Unified and Training-Free Token Compression Framework for Vision Transformer Acceleration | Fanhu Zeng et.al. | 2506.05709 | null |
2025-06-06 | Integer Binary-Range Alignment Neuron for Spiking Neural Networks | Binghao Ye et.al. | 2506.05679 | null |
2025-06-05 | CL-ISR: A Contrastive Learning and Implicit Stance Reasoning Framework for Misleading Text Detection on Social Media | Tianyi Huang et.al. | 2506.05107 | null |
2025-06-05 | Synthetic Dataset Generation for Autonomous Mobile Robots Using 3D Gaussian Splatting for Vision Training | Aneesh Deogan et.al. | 2506.05092 | null |
2025-06-06 | Bridging Annotation Gaps: Transferring Labels to Align Object Detection Datasets | Mikhail Kennerley et.al. | 2506.04737 | null |
2025-06-05 | Gen-n-Val: Agentic Image Data Generation and Validation | Jing-En Huang et.al. | 2506.04676 | null |
2025-06-05 | VoxDet: Rethinking 3D Semantic Occupancy Prediction as Dense Object Detection | Wuyang Li et.al. | 2506.04623 | null |
2025-06-04 | FALO: Fast and Accurate LiDAR 3D Object Detection on Resource-Constrained Devices | Shizhong Han et.al. | 2506.04499 | null |
2025-06-04 | Neural Object Detection for 4D STEM: High-Throughput Sub-Pixel Electron Diffraction Pattern Recognition | Arda Genc et.al. | 2506.04477 | null |
2025-06-04 | Diffusion Domain Teacher: Diffusion Guided Domain Adaptive Object Detector | Boyong He et.al. | 2506.04211 | link |
2025-06-04 | FSHNet: Fully Sparse Hybrid Network for 3D Object Detection | Shuai Liu et.al. | 2506.03714 | null |
2025-06-04 | How PARTs assemble into wholes: Learning the relative composition of images | Melika Ayoughi et.al. | 2506.03682 | null |
2025-06-05 | MambaNeXt-YOLO: A Hybrid State Space Model for Real-time Object Detection | Xiaochun Lei et.al. | 2506.03654 | null |
2025-06-04 | DiagNet: Detecting Objects using Diagonal Constraints on Adjacency Matrix of Graph Neural Network | Chong Hyun Lee et.al. | 2506.03571 | null |
2025-06-03 | SportMamba: Adaptive Non-Linear Multi-Object Tracking with State Space Models for Team Sports | Dheeraj Khanna et.al. | 2506.03335 | null |
2025-06-03 | Simulate Any Radar: Attribute-Controllable Radar Simulation via Waveform Parameter Embedding | Weiqing Xiao et.al. | 2506.03134 | null |
2025-06-03 | HACo-Det: A Study Towards Fine-Grained Machine-Generated Text Detection under Human-AI Coauthoring | Zhixiong Su et.al. | 2506.02959 | null |
2025-06-03 | Towards Auto-Annotation from Annotation Guidelines: A Benchmark through 3D LiDAR Detection | Yechi Ma et.al. | 2506.02914 | null |
2025-06-03 | A Dynamic Transformer Network for Vehicle Detection | Chunwei Tian et.al. | 2506.02765 | null |
2025-06-03 | Open-PMC-18M: A High-Fidelity Large Scale Medical Dataset for Multimodal Representation Learning | Negin Baghbanzadeh et.al. | 2506.02738 | null |
2025-06-03 | GeneA-SLAM2: Dynamic SLAM with AutoEncoder-Preprocessed Genetic Keypoints Resampling and Depth Variance-Guided Dynamic Region Removal | Shufan Qing et.al. | 2506.02736 | link |
2025-06-03 | Sight Guide: A Wearable Assistive Perception and Navigation System for the Vision Assistance Race in the Cybathlon 2024 | Patrick Pfreundschuh et.al. | 2506.02676 | null |
2025-06-03 | Probabilistic Online Event Downsampling | Andreu Girbau-Xalabarder et.al. | 2506.02547 | null |
2025-06-03 | Efficient Test-time Adaptive Object Detection via Sensitivity-Guided Pruning | Kunyu Wang et.al. | 2506.02462 | null |
2025-06-03 | Auto-Labeling Data for Object Detection | Brent A. Griffin et.al. | 2506.02359 | null |
2025-05-30 | Stress-testing Machine Generated Text Detection: Shifting Language Models Writing Style to Fool Detectors | Andrea Pedrotti et.al. | 2505.24523 | null |
2025-05-30 | Deformable Attention Mechanisms Applied to Object Detection, case of Remote Sensing | Anasse Boutayeb et.al. | 2505.24489 | null |
2025-05-30 | Leadership Assessment in Pediatric Intensive Care Unit Team Training | Liangyang Ouyang et.al. | 2505.24389 | null |
2025-05-30 | D2AF: A Dual-Driven Annotation and Filtering Framework for Visual Grounding | Yichi Zhang et.al. | 2505.24372 | null |
2025-05-29 | Conformal Object Detection by Sequential Risk Control | Léo Andéol et.al. | 2505.24038 | null |
2025-05-29 | Rooms from Motion: Un-posed Indoor 3D Object Detection as Localization and Mapping | Justin Lazarow et.al. | 2505.23756 | null |
2025-05-29 | Boosting Domain Incremental Learning: Selecting the Optimal Parameters is All You Need | Qiang Wang et.al. | 2505.23744 | null |
2025-05-29 | FMG-Det: Foundation Model Guided Robust Object Detection | Darryl Hannan et.al. | 2505.23726 | null |
2025-05-29 | CF-DETR: Coarse-to-Fine Transformer for Real-Time Object Detection | Woojin Shin et.al. | 2505.23317 | null |
2025-05-30 | WTEFNet: Real-Time Low-Light Object Detection for Advanced Driver Assistance Systems | Hao Wu et.al. | 2505.23201 | null |
2025-05-29 | Language-guided Learning for Object Detection Tackling Multiple Variations in Aerial Images | Sungjune Park et.al. | 2505.23193 | null |
2025-05-29 | DIP-R1: Deep Inspection and Perception with RL Looking Through and Understanding Complex Scenes | Sungjune Park et.al. | 2505.23179 | null |
2025-05-29 | The Meeseeks Mesh: Spatially Consistent 3D Adversarial Objects for BEV Detector | Aixuan Li et.al. | 2505.22499 | null |
2025-05-28 | VME: A Satellite Imagery Dataset and Benchmark for Detecting Vehicles in the Middle East and Beyond | Noora Al-Emadi et.al. | 2505.22353 | link |
2025-05-28 | Task-Driven Implicit Representations for Automated Design of LiDAR Systems | Nikhil Behari et.al. | 2505.22344 | null |
2025-05-29 | YH-MINER: Multimodal Intelligent System for Natural Ecological Reef Metric Extraction | Mingzhuang Wang et.al. | 2505.22250 | null |
2025-05-28 | S2AFormer: Strip Self-Attention for Efficient Vision Transformer | Guoan Xu et.al. | 2505.22195 | null |
2025-05-28 | Learning A Robust RGB-Thermal Detector for Extreme Modality Imbalance | Chao Tian et.al. | 2505.22154 | null |
2025-05-28 | Prototype Embedding Optimization for Human-Object Interaction Detection in Livestreaming | Menghui Zhang et.al. | 2505.22011 | null |
2025-05-28 | Cross-DINO: Cross the Deep MLP and Transformer for Small Object Detection | Guiping Cao et.al. | 2505.21868 | null |
2025-05-27 | Object Concepts Emerge from Motion | Haoqian Liang et.al. | 2505.21635 | null |
2025-05-27 | Active-O3: Empowering Multimodal Large Language Models with Active Perception via GRPO | Muzhi Zhu et.al. | 2505.21457 | null |
2025-05-27 | Visual Product Graph: Bridging Visual Products And Composite Images For End-to-End Style Recommendations | Yue Li Du et.al. | 2505.21454 | null |
2025-05-27 | YOLO-SPCI: Enhancing Remote Sensing Object Detection via Selective-Perspective-Class Integration | Xinyuan Wang et.al. | 2505.21370 | null |
2025-05-27 | Assured Autonomy with Neuro-Symbolic Perception | R. Spencer Hallyburton et.al. | 2505.21322 | null |
2025-05-27 | Robust Video-Based Pothole Detection and Area Estimation for Intelligent Vehicles with Depth Map and Kalman Smoothing | Dehao Wang et.al. | 2505.21049 | null |
2025-05-27 | Facial Attribute Based Text Guided Face Anonymization | Mustafa İzzet Muştu et.al. | 2505.21002 | null |
2025-05-27 | YOLO-FireAD: Efficient Fire Detection via Attention-Guided Inverted Residual Learning and Dual-Pooling Feature Preservation | Weichao Pan et.al. | 2505.20884 | null |
2025-05-27 | Open-Det: An Efficient Learning Framework for Open-Ended Detection | Guiping Cao et.al. | 2505.20639 | null |
2025-05-27 | Roboflow100-VL: A Multi-Domain Object Detection Benchmark for Vision-Language Models | Peter Robicheaux et.al. | 2505.20612 | null |
2025-05-26 | From Data to Modeling: Fully Open-vocabulary Scene Graph Generation | Zuyao Chen et.al. | 2505.20106 | null |
2025-05-26 | Target Tracking via LiDAR-RADAR Sensor Fusion for Autonomous Racing | Marcello Cellina et.al. | 2505.20043 | null |
2025-05-26 | Underwater Diffusion Attention Network with Contrastive Language-Image Joint Learning for Underwater Image Enhancement | Afrah Shaahid et.al. | 2505.19895 | null |
2025-05-26 | ADD-SLAM: Adaptive Dynamic Dense SLAM with Gaussian Splatting | Wenhua Wu et.al. | 2505.19420 | null |
2025-05-26 | Neural nanophotonic object detector with ultra-wide field-of-view | Ji Chen et.al. | 2505.19379 | null |
2025-05-25 | What do Blind and Low-Vision People Really Want from Assistive Smart Devices? Comparison of the Literature with a Focus Study | Bhanuka Gamage et.al. | 2505.19325 | null |
2025-05-25 | VL-SAM-V2: Open-World Object Detection with General and Specific Query Fusion | Zhiwei Lin et.al. | 2505.18986 | null |
2025-05-24 | Mitigating Context Bias in Domain Adaptation for Object Detection using Mask Pooling | Hojun Son et.al. | 2505.18446 | null |
2025-05-23 | Sampling Strategies for Efficient Training of Deep Learning Object Detection Algorithms | Gefei Shen et.al. | 2505.18302 | null |
2025-05-23 | One RL to See Them All: Visual Triple Unified Reinforcement Learning | Yan Ma et.al. | 2505.18129 | null |
2025-05-23 | SemSegBench & DetecBench: Benchmarking Reliability and Generalization Beyond Classification | Shashank Agnihotri et.al. | 2505.18015 | null |
2025-05-23 | RQR3D: Reparametrizing the regression targets for BEV-based 3D object detection | Ozsel Kilinc et.al. | 2505.17732 | null |
2025-05-23 | Adaptive Semantic Token Communication for Transformer-based Edge Inference | Alessio Devoto et.al. | 2505.17604 | null |
2025-05-23 | Distance Estimation in Outdoor Driving Environments Using Phase-only Correlation Method with Event Cameras | Masataka Kobayashi et.al. | 2505.17582 | null |
2025-05-23 | OrionBench: A Benchmark for Chart and Human-Recognizable Object Detection in Infographics | Jiangning Zhu et.al. | 2505.17473 | null |
2025-05-23 | Reflectance Prediction-based Knowledge Distillation for Robust 3D Object Detection in Compressed Point Clouds | Hao Jing et.al. | 2505.17442 | null |
2025-05-23 | Optimizing YOLOv8 for Parking Space Detection: Comparative Analysis of Custom YOLOv8 Architecture | Apar Pokhrel et.al. | 2505.17364 | null |
2025-05-22 | Extending Dataset Pruning to Object Detection: A Variance-based Approach | Ryota Yagi et.al. | 2505.17245 | null |
2025-05-22 | Semi-Supervised State-Space Model with Dynamic Stacking Filter for Real-World Video Deraining | Shangquan Sun et.al. | 2505.16811 | null |
2025-05-22 | Robust Vision-Based Runway Detection through Conformal Prediction and Conformal mAP | Alya Zouzou et.al. | 2505.16740 | link |
2025-05-22 | CodeMerge: Codebook-Guided Model Merging for Robust Test-Time Adaptation in Autonomous Driving | Huitong Yang et.al. | 2505.16524 | null |
2025-05-22 | MAFE R-CNN: Selecting More Samples to Learn Category-aware Features for Small Object Detection | Yichen Li et.al. | 2505.16442 | null |
2025-05-22 | AdvReal: Adversarial Patch Generation Framework with Application to Adversarial Safety Evaluation of Object Detection Systems | Yuanhao Huang et.al. | 2505.16402 | link |
2025-05-22 | Self-Classification Enhancement and Correction for Weakly Supervised Object Detection | Yufei Yin et.al. | 2505.16294 | null |
2025-05-21 | MIKU-PAL: An Automated and Standardized Multi-Modal Method for Speech Paralinguistic and Affect Labeling | Cheng Yifan et.al. | 2505.15772 | null |
2025-05-21 | The Devil is in Fine-tuning and Long-tailed Problems:A New Benchmark for Scene Text Detection | Tianjiao Cao et.al. | 2505.15649 | link |
2025-05-21 | SNAP: A Benchmark for Testing the Effects of Capture Conditions on Fundamental Vision Tasks | Iuliia Kotseruba et.al. | 2505.15628 | link |
2025-05-21 | Detection of Underwater Multi-Targets Based on Self-Supervised Learning and Deformable Path Aggregation Feature Pyramid Network | Chang Liu et.al. | 2505.15518 | null |
2025-05-21 | Trends and Challenges in Authorship Analysis: A Review of ML, DL, and LLM Approaches | Nudrat Habib et.al. | 2505.15422 | null |
2025-05-21 | RAZER: Robust Accelerated Zero-Shot 3D Open-Vocabulary Panoptic Reconstruction with Spatio-Temporal Aggregation | Naman Patel et.al. | 2505.15373 | null |
2025-05-21 | AGENT-X: Adaptive Guideline-based Expert Network for Threshold-free AI-generated teXt detection | Jiatao Li et.al. | 2505.15261 | null |
2025-05-21 | Multispectral Detection Transformer with Infrared-Centric Sensor Fusion | Seongmin Hwang et.al. | 2505.15137 | null |
2025-05-20 | Colors Matter: AI-Driven Exploration of Human Feature Colors | Rama Alyoubi et.al. | 2505.14931 | link |
2025-05-20 | Language Models Optimized to Fool Detectors Still Have a Distinct Style (And How to Change It) | Rafael Rivera Soto et.al. | 2505.14608 | null |
2025-05-20 | SCAN: Semantic Document Layout Analysis for Textual and Visual Retrieval-Augmented Generation | Yuyang Dong et.al. | 2505.14381 | null |
2025-05-20 | FAID: Fine-grained AI-generated Text Detection using Multi-task Auxiliary and Multi-level Contrastive Learning | Minh Ngoc Ta et.al. | 2505.14271 | null |
2025-05-20 | Decoupling Classifier for Boosting Few-shot Object Detection and Instance Segmentation | Bin-Bin Gao et.al. | 2505.14239 | null |
2025-05-20 | Intra-class Patch Swap for Self-Distillation | Hongjun Choi et.al. | 2505.14124 | link |
2025-05-20 | Scaling Vision Mamba Across Resolutions via Fractal Traversal | Bo Li et.al. | 2505.14062 | null |
2025-05-20 | Automated Quality Evaluation of Cervical Cytopathology Whole Slide Images Based on Content Analysis | Lanlan Kang et.al. | 2505.13875 | null |
2025-05-20 | Safety2Drive: Safety-Critical Scenario Benchmark for the Evaluation of Autonomous Driving | Jingzheng Li et.al. | 2505.13872 | null |
2025-05-20 | Domain Gating Ensemble Networks for AI-Generated Text Detection | Arihant Tripathi et.al. | 2505.13855 | null |
2025-05-20 | A Challenge to Build Neuro-Symbolic Video Agents | Sahil Shah et.al. | 2505.13851 | null |
2025-05-19 | Dynamic Graph Induced Contour-aware Heat Conduction Network for Event-based Object Detection | Xiao Wang et.al. | 2505.12908 | link |
2025-05-19 | Rethinking Features-Fused-Pyramid-Neck for Object Detection | Hulin Li et.al. | 2505.12820 | link |
2025-05-19 | Enhancing Transformers Through Conditioned Embedded Tokens | Hemanth Saratchandran et.al. | 2505.12789 | null |
2025-05-19 | LiDAR MOT-DETR: A LiDAR-based Two-Stage Transformer for 3D Multiple Object Tracking | Martha Teiko Teye et.al. | 2505.12753 | null |
2025-05-19 | VLC Fusion: Vision-Language Conditioned Sensor Fusion for Robust Object Detection | Aditya Taparia et.al. | 2505.12715 | null |
2025-05-18 | LM |
Xu Zheng et.al. | 2505.12507 | null |
2025-05-17 | EarthSynth: Generating Informative Earth Observation with Diffusion Models | Jiancheng Pan et.al. | 2505.12108 | null |
2025-05-17 | Experimental Study on Automatically Assembling Custom Catering Packages With a 3-DOF Delta Robot Using Deep Learning Methods | Reihaneh Yourdkhani et.al. | 2505.11879 | null |
2025-05-16 | Improving Object Detection Performance through YOLOv8: A Comprehensive Training and Evaluation Study | Rana Poureskandar et.al. | 2505.11424 | null |
2025-05-16 | MTevent: A Multi-Task Event Camera Dataset for 6D Pose Estimation and Moving Object Detection | Shrutarv Awasthi et.al. | 2505.11282 | null |
2025-05-16 | M4-SAR: A Multi-Resolution, Multi-Polarization, Multi-Scene, Multi-Source Dataset and Benchmark for Optical-SAR Fusion Object Detection | Chao Wang et.al. | 2505.10931 | null |
2025-05-16 | A High-Performance Thermal Infrared Object Detection Framework with Centralized Regulation | Jinke Li et.al. | 2505.10825 | null |
2025-05-15 | StoryReasoning Dataset: Using Chain-of-Thought for Scene Understanding and Grounded Story Generation | Daniel A. P. Oliveira et.al. | 2505.10292 | link |
2025-05-15 | Defect Detection in Photolithographic Patterns Using Deep Learning Models Trained on Synthetic Data | Prashant P. Shinde et.al. | 2505.10192 | null |
2025-05-15 | Application of YOLOv8 in monocular downward multiple Car Target detection | Shijie Lyu et.al. | 2505.10016 | null |
2025-05-14 | EdgeAI Drone for Autonomous Construction Site Demonstrator | Emre Girgin et.al. | 2505.09837 | link |
2025-05-14 | WhatsAI: Transforming Meta Ray-Bans into an Extensible Generative AI Platform for Accessibility | Nasif Zaman et.al. | 2505.09823 | null |
2025-05-14 | MoRAL: Motion-aware Multi-Frame 4D Radar and LiDAR Fusion for Robust 3D Object Detection | Xiangyuan Peng et.al. | 2505.09422 | null |
2025-05-14 | A drone that learns to efficiently find objects in agricultural fields: from simulation to the real world | Rick van Essen et.al. | 2505.09278 | null |
2025-05-14 | DRRNet: Macro-Micro Feature Fusion and Dual Reverse Refinement for Camouflaged Object Detection | Jianlin Sun et.al. | 2505.09168 | link |
2025-05-14 | Beyond General Prompts: Automated Prompt Refinement using Contrastive Class Alignment Scores for Disambiguating Objects in Vision-Language Models | Lucas Choi et.al. | 2505.09139 | null |
2025-05-14 | Promoting SAM for Camouflaged Object Detection via Selective Key Point-based Guidance | Guoying Liang et.al. | 2505.09123 | null |
2025-05-13 | Robustness Analysis against Adversarial Patch Attacks in Fully Unmanned Stores | Hyunsik Na et.al. | 2505.08835 | null |
2025-05-13 | Augmented Reality for RObots (ARRO): Pointing Visuomotor Policies Towards Visual Robustness | Reihaneh Mirjalili et.al. | 2505.08627 | null |
2025-05-14 | Thermal Detection of People with Mobility Restrictions for Barrier Reduction at Traffic Lights Controlled Intersections | Xiao Ni et.al. | 2505.08568 | link |
2025-05-13 | MDF: Multi-Modal Data Fusion with CNN-Based Object Detection for Enhanced Indoor Localization Using LiDAR-SLAM | Saqi Hussain Kalan et.al. | 2505.08388 | null |
2025-05-13 | HMPNet: A Feature Aggregation Architecture for Maritime Object Detection from a Shipborne Perspective | Yu Zhang et.al. | 2505.08231 | link |
2025-05-13 | Object detection in adverse weather conditions for autonomous vehicles using Instruct Pix2Pix | Unai Gurbindo et.al. | 2505.08228 | null |
2025-05-13 | MoKD: Multi-Task Optimization for Knowledge Distillation | Zeeshan Hayder et.al. | 2505.08170 | null |
2025-05-12 | LAMM-ViT: AI Face Detection via Layer-Aware Modulation of Region-Guided Attention | Jiangling Zhang et.al. | 2505.07734 | null |
2025-05-12 | Hybrid Spiking Vision Transformer for Object Detection with Event Cameras | Qi Xu et.al. | 2505.07715 | null |
2025-05-12 | Self-Supervised Event Representations: Towards Accurate, Real-Time Perception on SoC FPGAs | Kamil Jeziorek et.al. | 2505.07556 | null |
2025-05-12 | Automated Visual Attention Detection using Mobile Eye Tracking in Behavioral Classroom Studies | Efe Bozkir et.al. | 2505.07552 | null |
2025-05-12 | DepthFusion: Depth-Aware Hybrid Feature Fusion for LiDAR-Camera 3D Object Detection | Mingqian Ji et.al. | 2505.07398 | null |
2025-05-12 | Language-Driven Dual Style Mixing for Single-Domain Generalized Object Detection | Hongda Qin et.al. | 2505.07219 | link |
2025-05-11 | Differentiable NMS via Sinkhorn Matching for End-to-End Fabric Defect Detection | Zhengyang Lu et.al. | 2505.07040 | null |
2025-05-11 | VALISENS: A Validated Innovative Multi-Sensor System for Cooperative Automated Driving | Lei Wan et.al. | 2505.06980 | null |
2025-05-10 | M3CAD: Towards Generic Cooperative Autonomous Driving Benchmark | Morui Zhu et.al. | 2505.06746 | null |
2025-05-10 | Underwater object detection in sonar imagery with detection transformer and Zero-shot neural architecture search | XiaoTong Gu et.al. | 2505.06694 | null |
2025-05-09 | Camera-Only Bird's Eye View Perception: A Neural Approach to LiDAR-Free Environmental Mapping for Autonomous Vehicles | Anupkumar Bochare et.al. | 2505.06113 | null |
2025-05-09 | Artificial intelligence pioneers the double-strangeness factory | Yan He et.al. | 2505.05802 | null |
2025-05-09 | Dome-DETR: DETR with Density-Oriented Feature-Query Manipulation for Efficient Tiny Object Detection | Zhangchi Hu et.al. | 2505.05741 | null |
2025-05-09 | DiGIT: Multi-Dilated Gated Encoder and Central-Adjacent Region Integrated Decoder for Temporal Action Detection Transformer | Ho-Joong Kim et.al. | 2505.05711 | link |
2025-05-08 | PillarMamba: Learning Local-Global Context for Roadside Point Cloud via Hybrid State Space Model | Zhang Zhang et.al. | 2505.05397 | null |
2025-05-08 | PaniCar: Securing the Perception of Advanced Driving Assistance Systems Against Emergency Vehicle Lighting | Elad Feldman et.al. | 2505.05183 | null |
2025-05-08 | Reliably Bounding False Positives: A Zero-Shot Machine-Generated Text Detection Framework via Multiscaled Conformal Prediction | Xiaowei Zhu et.al. | 2505.05084 | null |
2025-05-08 | FG-CLIP: Fine-Grained Visual and Textual Alignment | Chunyu Xie et.al. | 2505.05071 | null |
2025-05-08 | A Simple Detector with Frame Dynamics is a Strong Tracker | Chenxu Peng et.al. | 2505.04917 | null |
2025-05-08 | Mix-QSAM: Mixed-Precision Quantization of the Segment Anything Model | Navin Ranjan et.al. | 2505.04861 | null |
2025-05-07 | Lightweight RGB-D Salient Object Detection from a Speed-Accuracy Tradeoff Perspective | Songsong Duan et.al. | 2505.04758 | null |
2025-05-07 | Hyb-KAN ViT: Hybrid Kolmogorov-Arnold Networks Augmented Vision Transformer | Sainath Dey et.al. | 2505.04740 | null |
2025-05-08 | MonoCoP: Chain-of-Prediction for Monocular 3D Object Detection | Zhihao Zhang et.al. | 2505.04594 | null |
2025-05-07 | Edge-GPU Based Face Tracking for Face Detection and Recognition Acceleration | Asma Baobaid et.al. | 2505.04524 | null |
2025-05-07 | Leveraging Simultaneous Usage of Edge GPU Hardware Engines for Video Face Detection and Recognition | Asma Baobaid et.al. | 2505.04502 | null |
2025-05-07 | DeCLIP: Decoupled Learning for Open-Vocabulary Dense Perception | Junjie Wang et.al. | 2505.04410 | null |
2025-05-06 | LogisticsVLN: Vision-Language Navigation For Low-Altitude Terminal Delivery Based on Agentic UAVs | Xinyuan Zhang et.al. | 2505.03460 | null |
2025-05-06 | Robustness in AI-Generated Detection: Enhancing Resistance to Adversarial Attacks | Sun Haoxuan et.al. | 2505.03435 | null |
2025-05-06 | From Word to Sentence: A Large-Scale Multi-Instance Dataset for Open-Set Aerial Detection | Guoting Wei et.al. | 2505.03334 | null |
2025-05-06 | VISLIX: An XAI Framework for Validating Vision Models with Slice Discovery and Analysis | Xinyuan Yan et.al. | 2505.03132 | null |
2025-05-05 | Sim2Real Transfer for Vision-Based Grasp Verification | Pau Amargant et.al. | 2505.03046 | link |
2025-05-05 | DPNet: Dynamic Pooling Network for Tiny Object Detection | Luqi Gong et.al. | 2505.02797 | null |
2025-05-05 | RGBX-DiffusionDet: A Framework for Multi-Modal RGB-X Object Detection Using DiffusionDet | Eliraz Orfaig et.al. | 2505.02586 | null |
2025-05-05 | Point Cloud Recombination: Systematic Real Data Augmentation Using Robotic Targets for LiDAR Perception Validation | Hubert Padusinski et.al. | 2505.02476 | null |
2025-05-04 | Robust AI-Generated Face Detection with Imbalanced Data | Yamini Sri Krubha et.al. | 2505.02182 | link |
2025-05-04 | Transforming faces into video stories -- VideoFace2.0 | Branko Brkljač et.al. | 2505.02060 | null |
2025-05-03 | DriveNetBench: An Affordable and Configurable Single-Camera Benchmarking System for Autonomous Driving Networks | Ali Al-Bustami et.al. | 2505.01893 | link |
2025-05-03 | OODTE: A Differential Testing Engine for the ONNX Optimizer | Nikolaos Louloudakis et.al. | 2505.01892 | null |
2025-05-03 | CMAWRNet: Multiple Adverse Weather Removal via a Unified Quaternion Neural Architecture | Vladimir Frants et.al. | 2505.01882 | null |
2025-05-03 | DualDiff: Dual-branch Diffusion Model for Autonomous Driving with Semantic Fusion | Haoteng Li et.al. | 2505.01857 | null |
2025-05-03 | Toward Onboard AI-Enabled Solutions to Space Object Detection for Space Sustainability | Wenxuan Zhang et.al. | 2505.01650 | null |
2025-05-02 | Efficient Vision-based Vehicle Speed Estimation | Andrej Macko et.al. | 2505.01203 | null |
2025-05-02 | CDFormer: Cross-Domain Few-Shot Object Detection Transformer Against Feature Confusion | Boyuan Meng et.al. | 2505.00938 | null |
2025-05-01 | Efficient On-Chip Implementation of 4D Radar-Based 3D Object Detection on Hailo-8L | Woong-Chan Byun et.al. | 2505.00757 | null |
2025-05-03 | Vision Mamba in Remote Sensing: A Comprehensive Survey of Techniques, Applications and Outlook | Muyi Bao et.al. | 2505.00630 | null |
2025-05-01 | Visual Trajectory Prediction of Vessels for Inland Navigation | Alexander Puzicha et.al. | 2505.00599 | null |
2025-05-01 | Synthesizing and Identifying Noise Levels in Autonomous Vehicle Camera Radar Datasets | Mathis Morales et.al. | 2505.00584 | null |
2025-05-01 | X-ray illicit object detection using hybrid CNN-transformer neural network architectures | Jorgen Cani et.al. | 2505.00564 | null |
2025-05-01 | A Robust Deep Networks based Multi-Object MultiCamera Tracking System for City Scale Traffic | Muhammad Imran Zaman et.al. | 2505.00534 | null |
2025-05-01 | Inconsistency-based Active Learning for LiDAR Object Detection | Esteban Rivera et.al. | 2505.00511 | null |
2025-05-01 | HeAL3D: Heuristical-enhanced Active Learning for 3D Object Detection | Esteban Rivera et.al. | 2505.00507 | null |
2025-05-01 | Quaternion Wavelet-Conditioned Diffusion Models for Image Super-Resolution | Luigi Sigillo et.al. | 2505.00334 | null |
2025-04-30 | V3LMA: Visual 3D-enhanced Language Model for Autonomous Driving | Jannik Lübberstedt et.al. | 2505.00156 | null |
2025-04-30 | LLM-Empowered Embodied Agent for Memory-Augmented Task Planning in Household Robotics | Marc Glocker et.al. | 2504.21716 | null |
2025-04-30 | Visual Text Processing: A Comprehensive Review and Unified Evaluation | Yan Shu et.al. | 2504.21682 | null |
2025-04-29 | T2ID-CAS: Diffusion Model and Class Aware Sampling to Mitigate Class Imbalance in Neck Ultrasound Anatomical Landmark Detection | Manikanta Varaganti et.al. | 2504.21231 | null |
2025-04-29 | FLIM-based Salient Object Detection Networks with Adaptive Decoders | Gilson Junior Soares et.al. | 2504.20872 | null |
2025-04-29 | A Survey on Event-based Optical Marker Systems | Nafiseh Jabbari Tofighi et.al. | 2504.20736 | null |
2025-04-29 | Purifying, Labeling, and Utilizing: A High-Quality Pipeline for Small Object Detection | Siwei Wang et.al. | 2504.20602 | null |
2025-04-29 | Style-Adaptive Detection Transformer for Single-Source Domain Generalized Object Detection | Jianhong Han et.al. | 2504.20498 | null |
2025-04-28 | More Clear, More Flexible, More Precise: A Comprehensive Oriented Object Detection benchmark for UAV | Kai Ye et.al. | 2504.20032 | null |
2025-04-28 | Lossy Source Coding with Focal Loss | Alex Dytso et.al. | 2504.19913 | null |
2025-04-28 | Neural network task specialization via domain constraining | Roman Malashin et.al. | 2504.19592 | null |
2025-04-28 | GMAR: Gradient-Driven Multi-Head Attention Rollout for Vision Transformer Interpretability | Sehyeong Jo et.al. | 2504.19414 | null |
2025-04-27 | Improving Small Drone Detection Through Multi-Scale Processing and Data Augmentation | Rayson Laroca et.al. | 2504.19347 | null |
2025-04-27 | ODExAI: A Comprehensive Object Detection Explainable AI Evaluation | Loc Phuc Truong Nguyen et.al. | 2504.19249 | null |
2025-04-27 | Boosting Single-domain Generalized Object Detection via Vision-Language Knowledge Interaction | Xiaoran Xu et.al. | 2504.19086 | null |
2025-04-26 | Federated Learning-based Semantic Segmentation for Lane and Object Detection in Autonomous Driving | Gharbi Khamis Alshammari et.al. | 2504.18939 | null |
2025-04-25 | Dream-Box: Object-wise Outlier Generation for Out-of-Distribution Detection | Brian K. S. Isaac-Medina et.al. | 2504.18746 | null |
2025-04-25 | A Review of 3D Object Detection with Vision-Language Models | Ranjan Sapkota et.al. | 2504.18738 | null |
2025-04-25 | Examining the Impact of Optical Aberrations to Image Classification and Object Detection Models | Patrick Müller et.al. | 2504.18510 | null |
2025-04-25 | Iterative Event-based Motion Segmentation by Variational Contrast Maximization | Ryo Yamaki et.al. | 2504.18447 | null |
2025-04-25 | A Multimodal Hybrid Late-Cascade Fusion Network for Enhanced 3D Object Detection | Carlo Sgaravatti et.al. | 2504.18419 | null |
2025-04-25 | A comprehensive review of classifier probability calibration metrics | Richard Oliver Lane et.al. | 2504.18278 | null |
2025-04-25 | LiDAR-Guided Monocular 3D Object Detection for Long-Range Railway Monitoring | Raul David Dominguez Sanchez et.al. | 2504.18203 | null |
2025-04-25 | Multi-Grained Compositional Visual Clue Learning for Image Intent Recognition | Yin Tang et.al. | 2504.18201 | null |
2025-04-25 | E-InMeMo: Enhanced Prompting for Visual In-Context Learning | Jiahao Zhang et.al. | 2504.18158 | null |
2025-04-25 | MASF-YOLO: An Improved YOLOv11 Network for Small Object Detection on Drone View | Liugang Lu et.al. | 2504.18136 | null |
2025-04-25 | Opportunistic Collaborative Planning with Large Vision Model Guided Control and Joint Query-Service Optimization | Jiayi Chen et.al. | 2504.18057 | null |
2025-04-25 | Direct sampling method to retrieve small objects from two-dimensional limited-aperture scattered field data | Won-Kwang Park et.al. | 2504.18036 | null |
2025-04-24 | DIVE: Inverting Conditional Diffusion Models for Discriminative Tasks | Yinqi Li et.al. | 2504.17253 | link |
2025-04-24 | Perspective-Aware Reasoning in Vision-Language Models via Mental Imagery Simulation | Phillip Y. Lee et.al. | 2504.17207 | null |
2025-04-24 | AUTHENTICATION: Identifying Rare Failure Modes in Autonomous Vehicle Perception Systems using Adversarially Guided Diffusion Models | Mohammad Zarei et.al. | 2504.17179 | null |
2025-04-23 | Scene-Aware Location Modeling for Data Augmentation in Automotive Object Detection | Jens Petersen et.al. | 2504.17076 | null |
2025-04-23 | Gaussian Splatting is an Effective Data Generator for 3D Object Detection | Farhad G. Zanjani et.al. | 2504.16740 | null |
2025-04-23 | EHGCN: Hierarchical Euclidean-Hyperbolic Fusion via Motion-Aware GCN for Hybrid Event Stream Perception | Haosheng Chen et.al. | 2504.16616 | null |
2025-04-23 | Beyond Anonymization: Object Scrubbing for Privacy-Preserving 2D and 3D Vision Tasks | Murat Bilgehan Ertan et.al. | 2504.16557 | null |
2025-04-23 | Assessing the Feasibility of Internet-Sourced Video for Automatic Cattle Lameness Detection | Md Fahimuzzman Sohan et.al. | 2504.16404 | null |
2025-04-23 | Revisiting Radar Camera Alignment by Contrastive Learning for 3D Object Detection | Linhua Kong et.al. | 2504.16368 | null |
2025-04-22 | Vision Controlled Orthotic Hand Exoskeleton | Connor Blais et.al. | 2504.16319 | null |
2025-04-22 | Physical Intelligence et.al. | 2504.16054 | null | |
2025-04-22 | SAGA: Semantic-Aware Gray color Augmentation for Visible-to-Thermal Domain Adaptation across Multi-View Drone and Ground-Based Vision Systems | Manjunath D et.al. | 2504.15728 | null |
2025-04-22 | You Sense Only Once Beneath: Ultra-Light Real-Time Underwater Object Detection | Jun Dong et.al. | 2504.15694 | null |
2025-04-22 | A Vision-Enabled Prosthetic Hand for Children with Upper Limb Disabilities | Md Abdul Baset Sarker et.al. | 2504.15654 | null |
2025-04-21 | Context Aware Grounded Teacher for Source Free Object Detection | Tajamul Ashraf et.al. | 2504.15404 | null |
2025-04-21 | SuoiAI: Building a Dataset for Aquatic Invertebrates in Vietnam | Tue Vo et.al. | 2504.15252 | null |
2025-04-21 | An Efficient Aerial Image Detection with Variable Receptive Fields | Liu Wenbin et.al. | 2504.15165 | null |
2025-04-19 | Balancing Privacy and Action Performance: A Penalty-Driven Approach to Image Anonymization | Nazia Aslam et.al. | 2504.14301 | null |
2025-04-19 | Visual Consensus Prompting for Co-Salient Object Detection | Jie Wang et.al. | 2504.14254 | link |
2025-04-18 | Feature Alignment and Representation Transfer in Knowledge Distillation for Large Language Models | Junjie Yang et.al. | 2504.13825 | null |
2025-04-18 | Lightweight LiDAR-Camera 3D Dynamic Object Detection and Multi-Class Trajectory Prediction | Yushen He et.al. | 2504.13647 | link |
2025-04-18 | DenSe-AdViT: A novel Vision Transformer for Dense SAR Object Detection | Yang Zhang et.al. | 2504.13638 | null |
2025-04-18 | HMPE:HeatMap Embedding for Efficient Transformer-Based Small Object Detection | YangChen Zeng et.al. | 2504.13469 | null |
2025-04-18 | Towards a Multi-Agent Vision-Language System for Zero-Shot Novel Hazardous Object Detection for Autonomous Driving Safety | Shashank Shriram et.al. | 2504.13399 | link |
2025-04-17 | VLLFL: A Vision-Language Model Based Lightweight Federated Learning Framework for Smart Agriculture | Long Li et.al. | 2504.13365 | null |
2025-04-17 | SAR Object Detection with Self-Supervised Pretraining and Curriculum-Aware Sampling | Yasin Almalioglu et.al. | 2504.13310 | null |
2025-04-17 | Weak Cube R-CNN: Weakly Supervised 3D Detection using only 2D Bounding Boxes | Andreas Lau Hansen et.al. | 2504.13297 | null |
2025-04-17 | RF-DETR Object Detection vs YOLOv12 : A Study of Transformer-based and CNN-based Architectures for Single-Class and Multi-Class Greenfruit Detection in Complex Orchard Environments Under Label Ambiguity | Ranjan Sapkota et.al. | 2504.13099 | null |
2025-04-17 | Self-Supervised Pre-training with Combined Datasets for 3D Perception in Autonomous Driving | Shumin Wang et.al. | 2504.12709 | null |
2025-04-18 | RoPETR: Improving Temporal Camera-Only 3D Detection by Integrating Enhanced Rotary Position Embedding | Hang Ji et.al. | 2504.12643 | null |
2025-04-16 | Towards a General-Purpose Zero-Shot Synthetic Low-Light Image and Video Pipeline | Joanne Lin et.al. | 2504.12169 | null |
2025-04-16 | RADLER: Radar Object Detection Leveraging Semantic 3D City Models and Self-Supervised Radar-Image Learning | Yuan Luo et.al. | 2504.12167 | null |
2025-04-16 | pix2pockets: Shot Suggestions in 8-Ball Pool from a Single Image in the Wild | Jonas Myhre Schiøtt et.al. | 2504.12045 | null |
2025-04-16 | A Review of YOLOv12: Attention-Based Enhancements vs. Previous Versions | Rahima Khanam et.al. | 2504.11995 | null |
2025-04-16 | Multimodal Spatio-temporal Graph Learning for Alignment-free RGBT Video Object Detection | Qishun Wang et.al. | 2504.11779 | null |
2025-04-15 | Multi-level Cellular Automata for FLIM networks | Felipe Crispim Salvagnini et.al. | 2504.11406 | null |
2025-04-15 | OpenTuringBench: An Open-Model-based Benchmark and Framework for Machine-Generated Text Detection and Attribution | Lucio La Cava et.al. | 2504.11369 | null |
2025-04-15 | CFIS-YOLO: A Lightweight Multi-Scale Fusion Network for Edge-Deployable Wood Defect Detection | Jincheng Kang et.al. | 2504.11305 | null |
2025-04-15 | TSAL: Few-shot Text Segmentation Based on Attribute Learning | Chenming Li et.al. | 2504.11164 | null |
2025-04-15 | Flyweight FLIM Networks for Salient Object Detection in Biomedical Images | Leonardo M. Joao et.al. | 2504.11112 | null |
2025-04-15 | S |
Yu Lin et.al. | 2504.11111 | null |
2025-04-15 | DRIFT open dataset: A drone-derived intelligence for traffic analysis in urban environmen | Hyejin Lee et.al. | 2504.11019 | null |
2025-04-16 | GATE3D: Generalized Attention-based Task-synergized Estimation in 3D* | Eunsoo Im et.al. | 2504.11014 | null |
2025-04-15 | CDUPatch: Color-Driven Universal Adversarial Patch Attack for Dual-Modal Visible-Infrared Detectors | Jiahuan Long et.al. | 2504.10888 | null |
2025-04-15 | Safe-Construct: Redefining Construction Safety Violation Recognition as 3D Multi-View Engagement Task | Aviral Chharia et.al. | 2504.10880 | null |
2025-04-14 | DiffMOD: Progressive Diffusion Point Denoising for Moving Object Detection in Remote Sensing | Jinyue Zhang et.al. | 2504.10278 | null |
2025-04-14 | Balancing Stability and Plasticity in Pretrained Detector: A Dual-Path Framework for Incremental Object Detection | Songze Li et.al. | 2504.10214 | null |
2025-04-14 | WildLive: Near Real-time Visual Wildlife Tracking onboard UAVs | Nguyen Ngoc Dat et.al. | 2504.10165 | null |
2025-04-14 | COUNTS: Benchmarking Object Detectors and Multimodal Large Language Models under Distribution Shifts | Jiansheng Li et.al. | 2504.10158 | null |
2025-04-14 | SemiETS: Integrating Spatial and Content Consistencies for Semi-Supervised End-to-end Text Spotting | Dongliang Luo et.al. | 2504.09966 | null |
2025-04-14 | Small Object Detection with YOLO: A Performance Analysis Across Model Versions and Hardware | Muhammad Fasih Tariq et.al. | 2504.09900 | null |
2025-04-14 | Density-based Object Detection in Crowded Scenes | Chenyang Zhao et.al. | 2504.09819 | null |
2025-04-13 | Uncertainty Guided Refinement for Fine-Grained Salient Object Detection | Yao Yuan et.al. | 2504.09666 | link |
2025-04-13 | Pillar-Voxel Fusion Network for 3D Object Detection in Airborne Hyperspectral Point Clouds | Yanze Jiang et.al. | 2504.09506 | null |
2025-04-13 | Vision-Language Model for Object Detection and Segmentation: A Review and Evaluation | Yongchao Feng et.al. | 2504.09480 | null |
2025-04-11 | TinyCenterSpeed: Efficient Center-Based Object Detection for Autonomous Racing | Neil Reichlin et.al. | 2504.08655 | null |
2025-04-11 | Training-free Guidance in Text-to-Video Generation via Multimodal Planning and Structured Noise Initialization | Jialu Li et.al. | 2504.08641 | null |
2025-04-10 | Enhanced Cooperative Perception Through Asynchronous Vehicle to Infrastructure Framework with Delay Mitigation for Connected and Automated Vehicles | Nithish Kumar Saravanan et.al. | 2504.08172 | null |
2025-04-10 | Multi-Task Learning with Multi-Annotation Triplet Loss for Improved Object Detection | Meilun Zhou et.al. | 2504.08054 | null |
2025-04-10 | Detect Anything 3D in the Wild | Hanxue Zhang et.al. | 2504.07958 | null |
2025-04-11 | Pychop: Emulating Low-Precision Arithmetic in Numerical Methods and Neural Networks | Erin Carson et.al. | 2504.07835 | null |
2025-04-10 | P2Object: Single Point Supervised Object Detection and Instance Segmentation | Pengfei Chen et.al. | 2504.07813 | null |
2025-04-10 | Nonlocal Retinex-Based Variational Model and its Deep Unfolding Twin for Low-Light Image Enhancement | Daniel Torres et.al. | 2504.07810 | null |
2025-04-10 | Adaptive Detection of Fast Moving Celestial Objects Using a Mixture of Experts and Physical-Inspired Neural Network | Peng Jia et.al. | 2504.07777 | null |
2025-04-10 | Prediction of Usage Probabilities of Shopping-Mall Corridors Using Heterogeneous Graph Neural Networks | Malik M Barakathullah et.al. | 2504.07645 | null |
2025-04-10 | VLM-R1: A Stable and Generalizable R1-style Large Vision-Language Model | Haozhan Shen et.al. | 2504.07615 | link |
2025-04-10 | RASMD: RGB And SWIR Multispectral Driving Dataset for Robust Perception in Adverse Conditions | Youngwan Jin et.al. | 2504.07603 | null |
2025-04-10 | WS-DETR: Robust Water Surface Object Detection through Vision-Radar Fusion with Detection Transformer | Huilin Yin et.al. | 2504.07441 | null |
2025-04-10 | Model Discrepancy Learning: Synthetic Faces Detection Based on Multi-Reconstruction | Qingchao Jiang et.al. | 2504.07382 | link |
2025-04-09 | Generalized Semantic Contrastive Learning via Embedding Side Information for Few-Shot Object Detection | Ruoyu Chen et.al. | 2504.07060 | null |
2025-04-09 | UAV Position Estimation using a LiDAR-based 3D Object Detection Method | Uthman Olawoye et.al. | 2504.07028 | null |
2025-04-09 | Towards Efficient Roadside LiDAR Deployment: A Fast Surrogate Metric Based on Entropy-Guided Visibility | Yuze Jiang et.al. | 2504.06772 | null |
2025-04-09 | Domain-Conditioned Scene Graphs for State-Grounded Task Planning | Jonas Herzog et.al. | 2504.06661 | null |
2025-04-09 | Visually Similar Pair Alignment for Robust Cross-Domain Object Detection | Onkar Krishna et.al. | 2504.06607 | null |
2025-04-08 | From Broadcast to Minimap: Achieving State-of-the-Art SoccerNet Game State Reconstruction | Vladimir Golovkin et.al. | 2504.06357 | null |
2025-04-08 | Analyzing the Impact of Low-Rank Adaptation for Cross-Domain Few-Shot Object Detection in Aerial Images | Hicham Talaoubrid et.al. | 2504.06330 | null |
2025-04-08 | Security Analysis of Thumbnail-Preserving Image Encryption and a New Framework | Dong Xie et.al. | 2504.06083 | null |
2025-04-08 | Balancing long- and short-term dynamics for the modeling of saliency in videos | Theodor Wulff et.al. | 2504.05913 | null |
2025-04-08 | PRIMEDrive-CoT: A Precognitive Chain-of-Thought Framework for Uncertainty-Aware Object Interaction in Driving Scene Scenario | Sriram Mandalika et.al. | 2504.05908 | null |
2025-04-08 | Intrinsic Saliency Guided Trunk-Collateral Network for Unsupervised Video Object Segmentation | Xiangyu Zheng et.al. | 2504.05904 | null |
2025-04-08 | KAN-SAM: Kolmogorov-Arnold Network Guided Segment Anything Model for RGB-T Salient Object Detection | Xingyuan Li et.al. | 2504.05878 | null |
2025-04-08 | DefMamba: Deformable Visual State Space Model | Leiye Liu et.al. | 2504.05794 | null |
2025-04-08 | Event-based Civil Infrastructure Visual Defect Detection: ev-CIVIL Dataset and Benchmark | Udayanga G. W. K. N. Gamage et.al. | 2504.05679 | null |
2025-04-08 | POD: Predictive Object Detection with Single-Frame FMCW LiDAR Point Cloud | Yining Shi et.al. | 2504.05649 | null |
2025-04-08 | AD-Det: Boosting Object Detection in UAV Images with Focused Small Objects and Balanced Tail Classes | Zhenteng Li et.al. | 2504.05601 | null |
2025-04-07 | SSLFusion: Scale & Space Aligned Latent Fusion Model for Multimodal 3D Object Detection | Bonan Ding et.al. | 2504.05170 | null |
2025-04-07 | Inland Waterway Object Detection in Multi-environment: Dataset and Approach | Shanshan Wang et.al. | 2504.04835 | null |
2025-04-07 | Playing Non-Embedded Card-Based Games with Reinforcement Learning | Tianyang Wu et.al. | 2504.04783 | null |
2025-04-07 | Feedback-Enhanced Hallucination-Resistant Vision-Language Model for Real-Time Scene Understanding | Zahir Alsulaimawi et.al. | 2504.04772 | null |
2025-04-07 | Inverse++: Vision-Centric 3D Semantic Occupancy Prediction Assisted with 3D Object Detection | Zhenxing Ming et.al. | 2504.04732 | null |
2025-04-06 | Enhance Then Search: An Augmentation-Search Strategy with Foundation Models for Cross-Domain Few-Shot Object Detection | Jiancheng Pan et.al. | 2504.04517 | link |
2025-04-06 | eKalibr-Stereo: Continuous-Time Spatiotemporal Calibration for Event-Based Stereo Visual Systems | Shuolong Chen et.al. | 2504.04451 | link |
2025-04-05 | Autoregressive High-Order Finite Difference Modulo Imaging: High-Dynamic Range for Computer Vision Applications | Brayan Monroy et.al. | 2504.04228 | null |
2025-04-05 | An Optimized Density-Based Lane Keeping System for A Cost-Efficient Autonomous Vehicle Platform: AurigaBot V1 | Farbod Younesi et.al. | 2504.04217 | null |
2025-04-05 | Learning about the Physical World through Analytic Concepts | Jianhua Sun et.al. | 2504.04170 | null |
2025-04-04 | VISTA-OCR: Towards generative and interactive end to end OCR models | Laziz Hamdi et.al. | 2504.03621 | null |
2025-04-04 | PF3Det: A Prompted Foundation Feature Assisted Visual LiDAR 3D Detector | Kaidong Li et.al. | 2504.03563 | null |
2025-04-04 | ZFusion: An Effective Fuser of Camera and 4D Radar for 3D Object Perception in Autonomous Driving | Sheng Yang et.al. | 2504.03438 | null |
2025-04-04 | Infrared bubble recognition in the Milky Way and beyond using deep learning | Shimpei Nishimoto et.al. | 2504.03367 | null |
2025-04-04 | Real-Time Roadway Obstacle Detection for Electric Scooters Using Deep Learning and Multi-Sensor Fusion | Zeyang Zheng et.al. | 2504.03171 | null |
2025-04-04 | Finding the Reflection Point: Unpadding Images to Remove Data Augmentation Artifacts in Large Open Source Image Datasets for Machine Learning | Lucas Choi et.al. | 2504.03168 | null |
2025-04-03 | Attention-Aware Multi-View Pedestrian Tracking | Reef Alturki et.al. | 2504.03047 | null |
2025-04-03 | LiDAR-based Object Detection with Real-time Voice Specifications | Anurag Kulkarni et.al. | 2504.02920 | null |
2025-04-03 | BOP Challenge 2024 on Model-Based and Model-Free 6D Object Pose Estimation | Van Nguyen Nguyen et.al. | 2504.02812 | null |
2025-04-03 | Rip Current Segmentation: A Novel Benchmark and YOLOv8 Baseline Results | Andrei Dumitriu et.al. | 2504.02558 | null |
2025-04-03 | Multimodal Fusion and Vision-Language Models: A Survey for Robot Vision | Xiaofeng Han et.al. | 2504.02477 | null |
2025-04-03 | CornerPoint3D: Look at the Nearest Corner Instead of the Center | Ruixiao Zhang et.al. | 2504.02464 | null |
2025-04-03 | Hyperspectral Remote Sensing Images Salient Object Detection: The First Benchmark Dataset and Baseline | Peifu Liu et.al. | 2504.02416 | null |
2025-04-03 | SemiISP/SemiIE: Semi-Supervised Image Signal Processor and Image Enhancement Leveraging One-to-Many Mapping sRGB-to-RAW | Masakazu Yoshimura et.al. | 2504.02345 | null |
2025-04-03 | Improving Harmful Text Detection with Joint Retrieval and External Knowledge | Zidong Yu et.al. | 2504.02310 | null |
2025-04-03 | LLM-Guided Evolution: An Autonomous Model Optimization for Object Detection | YiMing Yu et.al. | 2504.02280 | null |
2025-04-02 | Cat-Eye Inspired Active-Passive-Composite Aperture-Shared Sub-Terahertz Meta-Imager for Non-Interactive Concealed Object Detection | Mingshuang Hu et.al. | 2504.01473 | null |
2025-04-02 | CFMD: Dynamic Cross-layer Feature Fusion for Salient Object Detection | Jin Lian et.al. | 2504.01326 | null |
2025-04-01 | Enabling Efficient Processing of Spiking Neural Networks with On-Chip Learning on Commodity Neuromorphic Processors for Edge AI Systems | Rachmad Vidya Wicaksana Putra et.al. | 2504.00957 | null |
2025-04-01 | NeuRadar: Neural Radiance Fields for Automotive Radar Point Clouds | Mahan Rafidashti et.al. | 2504.00859 | null |
2025-04-01 | AttentiveGRU: Recurrent Spatio-Temporal Modeling for Advanced Radar-Based BEV Object Detection | Loveneet Saini et.al. | 2504.00559 | null |
2025-04-01 | Archival Faces: Detection of Faces in Digitized Historical Documents | Marek Vaško et.al. | 2504.00558 | null |
2025-04-01 | High-Quality Pseudo-Label Generation Based on Visual Prompt Assisted Cloud Model Update | Xinrun Xu et.al. | 2504.00526 | null |
2025-04-01 | Intrinsic-feature-guided 3D Object Detection | Wanjing Zhang et.al. | 2504.00382 | null |
2025-04-01 | CamoSAM2: Motion-Appearance Induced Auto-Refining Prompts for Video Camouflaged Object Detection | Xin Zhang et.al. | 2504.00375 | null |
2025-03-31 | Towards Precise Action Spotting: Addressing Temporal Misalignment in Labels with Dynamic Label Assignment | Masato Tamura et.al. | 2504.00149 | null |
2025-03-31 | SU-YOLO: Spiking Neural Network for Efficient Underwater Object Detection | Chenyang Li et.al. | 2503.24389 | link |
2025-03-31 | MB-ORES: A Multi-Branch Object Reasoner for Visual Grounding in Remote Sensing | Karim Radouane et.al. | 2503.24219 | link |
2025-03-31 | Spectral-Adaptive Modulation Networks for Visual Perception | Guhnoo Yun et.al. | 2503.23947 | null |
2025-03-31 | Reliable Traffic Monitoring Using Low-Cost Doppler Radar Units | Mishay Naidoo et.al. | 2503.23926 | null |
2025-03-31 | Expanding-and-Shrinking Binary Neural Networks | Xulong Shi et.al. | 2503.23709 | link |
2025-03-30 | Beyond Detection: Designing AI-Resilient Assessments with Automated Feedback Tool to Foster Critical Thinking | Muhammad Sajjad Akbar et.al. | 2503.23622 | null |
2025-03-30 | Re-Aligning Language to Visual Objects with an Agentic Workflow | Yuming Chen et.al. | 2503.23508 | null |
2025-03-30 | EagleVision: Object-level Attribute Multimodal LLM for Remote Sensing | Hongxiang Jiang et.al. | 2503.23330 | null |
2025-03-29 | Context in object detection: a systematic literature review | Mahtab Jamali et.al. | 2503.23249 | null |
2025-03-29 | Large Self-Supervised Models Bridge the Gap in Domain Adaptive Object Detection | Marc-Antoine Lavoie et.al. | 2503.23220 | null |
2025-03-28 | AnnoPage Dataset: Dataset of Non-Textual Elements in Documents with Fine-Grained Categorization | Martin Kišš et.al. | 2503.22526 | null |
2025-03-28 | Data Quality Matters: Quantifying Image Quality Impact on Machine Learning Performance | Christian Steinhauser et.al. | 2503.22375 | null |
2025-03-28 | ForcePose: A Deep Learning Approach for Force Calculation Based on Action Recognition Using MediaPipe Pose Estimation Combined with Object Detection | Nandakishor M et.al. | 2503.22363 | null |
2025-03-28 | SKDU at De-Factify 4.0: Natural Language Features for AI-Generated Text-Detection | Shrikant Malviya et.al. | 2503.22338 | link |
2025-03-28 | Knowledge Rectification for Camouflaged Object Detection: Unlocking Insights from Low-Quality Data | Juwei Guan et.al. | 2503.22180 | null |
2025-03-28 | A Survey on Remote Sensing Foundation Models: From Vision to Multimodality | Ziyue Huang et.al. | 2503.22081 | null |
2025-03-27 | AGILE: A Diffusion-Based Attention-Guided Image and Label Translation for Efficient Cross-Domain Plant Trait Identification | Earl Ranario et.al. | 2503.22019 | null |
2025-03-27 | FACETS: Efficient Once-for-all Object Detection via Constrained Iterative Search | Tony Tran et.al. | 2503.21999 | null |
2025-03-27 | Exponentially Weighted Instance-Aware Repeat Factor Sampling for Long-Tailed Object Detection Model Training in Unmanned Aerial Vehicles Surveillance Scenarios | Taufiq Ahmed et.al. | 2503.21893 | null |
2025-03-27 | Learning Class Prototypes for Unified Sparse Supervised 3D Object Detection | Yun Zhu et.al. | 2503.21099 | link |
2025-03-26 | SaViD: Spectravista Aesthetic Vision Integration for Robust and Discerning 3D Object Detection in Challenging Environments | Tanmoy Dam et.al. | 2503.20614 | link |
2025-03-26 | Small Object Detection: A Comprehensive Survey on Challenges, Techniques and Real-World Applications | Mahya Nikouei et.al. | 2503.20516 | null |
2025-03-25 | Gemini Robotics: Bringing AI into the Physical World | Gemini Robotics Team et.al. | 2503.20020 | null |
2025-03-25 | Hyperdimensional Uncertainty Quantification for Multimodal Uncertainty Fusion in Autonomous Vehicles Perception | Luke Chen et.al. | 2503.20011 | null |
2025-03-25 | Mind the Gap: Benchmarking Spatial Reasoning in Vision-Language Models | Ilias Stogiannidis et.al. | 2503.19707 | null |
2025-03-25 | BiblioPage: A Dataset of Scanned Title Pages for Bibliographic Metadata Extraction | Jan Kohút et.al. | 2503.19658 | null |
2025-03-25 | Single Shot AI-assisted quantification of KI-67 proliferation index in breast cancer | Deepti Madurai Muthu et.al. | 2503.19606 | null |
2025-03-25 | MATT-GS: Masked Attention-based 3DGS for Robot Perception and Object Detection | Jee Won Lee et.al. | 2503.19330 | null |
2025-03-25 | Multiscale Feature Importance-based Bit Allocation for End-to-End Feature Coding for Machines | Junle Liu et.al. | 2503.19278 | null |
2025-03-24 | Benchmarking Object Detectors under Real-World Distribution Shifts in Satellite Imagery | Sara Al-Emadi et.al. | 2503.19202 | null |
2025-03-24 | Pitch Contour Exploration Across Audio Domains: A Vision-Based Transfer Learning Approach | Jakob Abeßer et.al. | 2503.19161 | null |
2025-03-24 | Cooperative Control of Multi-Quadrotors for Transporting Cable-Suspended Payloads: Obstacle-Aware Planning and Event-Based Nonlinear Model Predictive Control | Tohid Kargar Tasooji et.al. | 2503.19135 | null |
2025-03-24 | Building Blocks for Robust and Effective Semi-Supervised Real-World Object Detection | Moussa Kassem Sbeyti et.al. | 2503.18903 | null |
2025-03-24 | LGI-DETR: Local-Global Interaction for UAV Object Detection | Zifa Chen et.al. | 2503.18785 | null |
2025-03-25 | Frequency Dynamic Convolution for Dense Image Prediction | Linwei Chen et.al. | 2503.18783 | null |
2025-03-24 | CQ-DINO: Mitigating Gradient Dilution via Category Queries for Vast Vocabulary Object Detection | Zhichao Sun et.al. | 2503.18430 | null |
2025-03-24 | Vision-Guided Loco-Manipulation with a Snake Robot | Adarsh Salagame et.al. | 2503.18308 | null |
2025-03-23 | Extended Visibility of Autonomous Vehicles via Optimized Cooperative Perception under Imperfect Communication | Ahmad Sarlak et.al. | 2503.18192 | null |
2025-03-22 | MAMAT: 3D Mamba-Based Atmospheric Turbulence Removal and its Object Detection Capability | Paul Hill et.al. | 2503.17700 | null |
2025-03-22 | Sense4FL: Vehicular Crowdsensing Enhanced Federated Learning for Autonomous Driving | Yanan Ma et.al. | 2503.17697 | null |
2025-03-21 | Should we pre-train a decoder in contrastive learning for dense prediction tasks? | Sébastien Quetin et.al. | 2503.17526 | null |
2025-03-21 | Event-Based Crossing Dataset (EBCD) | Joey Mulé et.al. | 2503.17499 | null |
2025-03-21 | An Iterative Feedback Mechanism for Improving Natural Language Class Descriptions in Open-Vocabulary Object Detection | Louis Y. Kim et.al. | 2503.17285 | null |
2025-03-21 | Which2comm: An Efficient Collaborative Perception Framework for 3D Object Detection | Duanrui Yu et.al. | 2503.17175 | null |
2025-03-21 | Hi-ALPS -- An Experimental Robustness Quantification of Six LiDAR-based Object Detection Systems for Autonomous Driving | Alexandra Arzberger et.al. | 2503.17168 | null |
2025-03-21 | R-LiViT: A LiDAR-Visual-Thermal Dataset Enabling Vulnerable Road User Focused Roadside Perception | Jonas Mirlach et.al. | 2503.17122 | null |
2025-03-21 | Exploring Few-Shot Object Detection on Blood Smear Images: A Case Study of Leukocytes and Schistocytes | Davide Antonio Mura et.al. | 2503.17107 | null |
2025-03-21 | R2LDM: An Efficient 4D Radar Super-Resolution Framework Leveraging Diffusion Model | Boyuan Zheng et.al. | 2503.17097 | null |
2025-03-21 | Superpowering Open-Vocabulary Object Detectors for X-ray Vision | Pablo Garcia-Fernandez et.al. | 2503.17071 | null |
2025-03-21 | Scoring, Remember, and Reference: Catching Camouflaged Objects in Videos | Yuang Feng et.al. | 2503.17050 | null |
2025-03-21 | Salient Object Detection in Traffic Scene through the TSOD10K Dataset | Yu Qiu et.al. | 2503.16910 | null |
2025-03-21 | Seg2Box: 3D Object Detection by Point-Wise Semantics Supervision | Maoji Zheng et.al. | 2503.16811 | null |
2025-03-20 | RESFL: An Uncertainty-Aware Framework for Responsible Federated Learning by Balancing Privacy, Fairness and Utility in Autonomous Vehicles | Dawood Wasif et.al. | 2503.16251 | null |
2025-03-20 | MapGlue: Multimodal Remote Sensing Image Matching | Peihao Wu et.al. | 2503.16185 | null |
2025-03-20 | Uncertainty Meets Diversity: A Comprehensive Active Learning Framework for Indoor 3D Object Detection | Jiangyi Wang et.al. | 2503.16125 | null |
2025-03-20 | Semantic-Guided Global-Local Collaborative Networks for Lightweight Image Super-Resolution | Wanshu Fan et.al. | 2503.16056 | null |
2025-03-19 | A Context-Driven Training-Free Network for Lightweight Scene Text Segmentation and Recognition | Ritabrata Chakraborty et.al. | 2503.15639 | null |
2025-03-19 | DCA: Dividing and Conquering Amnesia in Incremental Object Detection | Aoting Zhang et.al. | 2503.15295 | null |
2025-03-19 | Test-Time Backdoor Detection for Object Detection Models | Hangtao Zhang et.al. | 2503.15293 | null |
2025-03-19 | GO-N3RDet: Geometry Optimized NeRF-enhanced 3D Object Detector | Zechuan Li et.al. | 2503.15211 | null |
2025-03-19 | UltraFlwr -- An Efficient Federated Medical and Surgical Object Detection Framework | Yang Li et.al. | 2503.15161 | null |
2025-03-19 | An Investigation of Beam Density on LiDAR Object Detection Performance | Christoph Griesbacher et.al. | 2503.15087 | null |
2025-03-19 | SPADE: Systematic Prompt Framework for Automated Dialogue Expansion in Machine-Generated Text Detection | Haoyi Li et.al. | 2503.15044 | null |
2025-03-19 | Fine-Grained Open-Vocabulary Object Detection with Fined-Grained Prompts: Task, Dataset and Benchmark | Ying Liu et.al. | 2503.14862 | null |
2025-03-19 | State Space Model Meets Transformer: A New Paradigm for 3D Object Detection | Chuxin Wang et.al. | 2503.14493 | null |
2025-03-18 | Panoramic Distortion-Aware Tokenization for Person Detection and Localization Using Transformers in Overhead Fisheye Images | Nobuhiko Wakai et.al. | 2503.14228 | null |
2025-03-18 | A Revisit to the Decoder for Camouflaged Object Detection | Seung Woo Ko et.al. | 2503.14035 | null |
2025-03-18 | Shift, Scale and Rotation Invariant Multiple Object Detection using Balanced Joint Transform Correlator | Xi Shen et.al. | 2503.14034 | null |
2025-03-18 | LEGNet: Lightweight Edge-Gaussian Driven Network for Low-Quality Remote Sensing Image Object Detection | Wei Lu et.al. | 2503.14012 | null |
2025-03-18 | FrustumFusionNets: A Three-Dimensional Object Detection Network Based on Tractor Road Scene | Lili Yang et.al. | 2503.13951 | null |
2025-03-18 | Is Discretization Fusion All You Need for Collaborative Perception? | Kang Yang et.al. | 2503.13946 | null |
2025-03-18 | PSA-SSL: Pose and Size-aware Self-Supervised Learning on LiDAR Point Clouds | Barza Nisar et.al. | 2503.13914 | null |
2025-03-18 | HSOD-BIT-V2: A New Challenging Benchmarkfor Hyperspectral Salient Object Detection | Yuhao Qiu et.al. | 2503.13906 | null |
2025-03-18 | TGBFormer: Transformer-GraphFormer Blender Network for Video Object Detection | Qiang Qi et.al. | 2503.13903 | null |
2025-03-17 | Beyond RGB: Adaptive Parallel Processing for RAW Object Detection | Shani Gamrian et.al. | 2503.13163 | null |
2025-03-17 | Who Wrote This? Identifying Machine vs Human-Generated Text in Hausa | Babangida Sani et.al. | 2503.13101 | null |
2025-03-17 | SparseAlign: A Fully Sparse Framework for Cooperative Object Detection | Yunshuang Yuan et.al. | 2503.12982 | null |
2025-03-17 | Efficient Multimodal 3D Object Detector via Instance-Level Contrastive Distillation | Zhuoqun Su et.al. | 2503.12914 | null |
2025-03-16 | Point Cloud Based Scene Segmentation: A Survey | Dan Halperin et.al. | 2503.12595 | null |
2025-03-16 | GeoRSMLLM: A Multimodal Large Language Model for Vision-Language Tasks in Geoscience and Remote Sensing | Zilun Zhang et.al. | 2503.12490 | null |
2025-03-16 | Deepfake Detection with Optimized Hybrid Model: EAR Biometric Descriptor via Improved RCNN | Ruchika Sharma et.al. | 2503.12381 | null |
2025-03-15 | An Efficient Deep Learning-Based Approach to Automating Invoice Document Validation | Aziz Amari et.al. | 2503.12267 | null |
2025-03-15 | Minuscule Cell Detection in AS-OCT Images with Progressive Field-of-View Focusing | Boyu Chen et.al. | 2503.12249 | null |
2025-03-15 | SFMNet: Sparse Focal Modulation for 3D Object Detection | Oren Shrout et.al. | 2503.12093 | null |
2025-03-14 | FLASHμ: Fast Localizing And Sizing of Holographic Microparticles | Ayush Paliwal et.al. | 2503.11538 | null |
2025-03-14 | Falcon: A Remote Sensing Vision-Language Foundation Model | Kelu Yao et.al. | 2503.11070 | null |
2025-03-14 | FMNet: Frequency-Assisted Mamba-Like Linear Attention Network for Camouflaged Object Detection | Ming Deng et.al. | 2503.11030 | null |
2025-03-14 | Comparative Analysis of Advanced AI-based Object Detection Models for Pavement Marking Quality Assessment during Daytime | Gian Antariksa et.al. | 2503.11008 | null |
2025-03-14 | Cyclic Contrastive Knowledge Transfer for Open-Vocabulary Object Detection | Chuhan Zhang et.al. | 2503.11005 | null |
2025-03-14 | Enhanced Multi-View Pedestrian Detection Using Probabilistic Occupancy Volume | Reef Alturki et.al. | 2503.10982 | null |
2025-03-13 | The Power of One: A Single Example is All it Takes for Segmentation in VLMs | Mir Rayat Imtiaz Hossain et.al. | 2503.10779 | null |
2025-03-13 | HeightFormer: Learning Height Prediction in Voxel Features for Roadside Vision Centric 3D Object Detection via Transformer | Zhang Zhang et.al. | 2503.10777 | null |
2025-03-13 | Semantic-Supervised Spatial-Temporal Fusion for LiDAR-based 3D Object Detection | Chaoqun Wang et.al. | 2503.10579 | null |
2025-03-13 | RoCo-Sim: Enhancing Roadside Collaborative Perception through Foreground Simulation | Yuwen Du et.al. | 2503.10410 | link |
2025-03-13 | RoMA: Scaling up Mamba-based Foundation Models for Remote Sensing | Fengxiang Wang et.al. | 2503.10392 | link |
2025-03-13 | Object detection characteristics in a learning factory environment using YOLOv8 | Toni Schneidereit et.al. | 2503.10356 | null |
2025-03-13 | TARS: Traffic-Aware Radar Scene Flow Estimation | Jialong Wu et.al. | 2503.10210 | null |
2025-03-13 | A Hierarchical Semantic Distillation Framework for Open-Vocabulary Object Detection | Shenghao Fu et.al. | 2503.10152 | link |
2025-03-13 | Deep Learning-Based Direct Leaf Area Estimation using Two RGBD Datasets for Model Development | Namal Jayasuriya et.al. | 2503.10129 | null |
2025-03-13 | Style Evolving along Chain-of-Thought for Unknown-Domain Object Detection | Zihao Zhang et.al. | 2503.09968 | null |
2025-03-12 | CleverDistiller: Simple and Spatially Consistent Cross-modal Distillation | Hariprasath Govindarajan et.al. | 2503.09878 | null |
2025-03-12 | How good are deep learning methods for automated road safety analysis using video data? An experimental study | Qingwu Liu et.al. | 2503.09807 | null |
2025-03-12 | Deep Learning for Climate Action: Computer Vision Analysis of Visual Narratives on X | Katharina Prasse et.al. | 2503.09361 | null |
2025-03-12 | Fully-Synthetic Training for Visual Quality Inspection in Automotive Production | Christoph Huber et.al. | 2503.09354 | null |
2025-03-12 | DitHub: A Modular Framework for Incremental Open-Vocabulary Object Detection | Chiara Cappellino et.al. | 2503.09271 | null |
2025-03-12 | Polygonizing Roof Segments from High-Resolution Aerial Images Using Yolov8-Based Edge Detection | Qipeng Mei et.al. | 2503.09187 | null |
2025-03-12 | RFUAV: A Benchmark Dataset for Unmanned Aerial Vehicle Detection and Identification | Rui Shi et.al. | 2503.09033 | null |
2025-03-12 | Dual-Domain Homogeneous Fusion with Cross-Modal Mamba and Progressive Decoder for 3D Object Detection | Xuzhong Hu et.al. | 2503.08992 | null |
2025-03-11 | GBlobs: Explicit Local Structure via Gaussian Blobs for Improved Cross-Domain LiDAR-based 3D Object Detection | Dušan Malić et.al. | 2503.08639 | null |
2025-03-11 | Referring to Any Person | Qing Jiang et.al. | 2503.08507 | null |
2025-03-11 | SuperCap: Multi-resolution Superpixel-based Image Captioning | Henry Senior et.al. | 2503.08496 | null |
2025-03-13 | Learning to Detect Objects from Multi-Agent LiDAR Scans without Manual Labels | Qiming Xia et.al. | 2503.08421 | null |
2025-03-11 | Embodied Crowd Counting | Runling Long et.al. | 2503.08367 | null |
2025-03-11 | Physics-based AI methodology for Material Parameter Extraction from Optical Data | M. Koumans et.al. | 2503.08183 | null |
2025-03-11 | Bring Remote Sensing Object Detect Into Nature Language Model: Using SFT Method | Fei Wang et.al. | 2503.08144 | null |
2025-03-11 | Accelerate 3D Object Detection Models via Zero-Shot Attention Key Pruning | Lizhen Xu et.al. | 2503.08101 | link |
2025-03-11 | SparseVoxFormer: Sparse Voxel-based Transformer for Multi-modal 3D Object Detection | Hyeongseok Son et.al. | 2503.08092 | null |
2025-03-11 | Simulating Automotive Radar with Lidar and Camera Inputs | Peili Song et.al. | 2503.08068 | null |
2025-03-10 | YOLOE: Real-Time Seeing Anything | Ao Wang et.al. | 2503.07465 | link |
2025-03-10 | HGO-YOLO: Advancing Anomaly Behavior Detection with Hierarchical Features and Lightweight Optimized Detection | Qizhi Zheng et.al. | 2503.07371 | null |
2025-03-10 | Mitigating Hallucinations in YOLO-based Object Detection Models: A Revisit to Out-of-Distribution Detection | Weicheng He et.al. | 2503.07330 | null |
2025-03-10 | Semantic Communications with Computer Vision Sensing for Edge Video Transmission | Yubo Peng et.al. | 2503.07252 | null |
2025-03-10 | MIRAM: Masked Image Reconstruction Across Multiple Scales for Breast Lesion Risk Prediction | Hung Q. Vo et.al. | 2503.07157 | null |
2025-03-10 | A Light Perspective for 3D Object Detection | Marcelo Eduardo Pederiva et.al. | 2503.07133 | null |
2025-03-10 | SimROD: A Simple Baseline for Raw Object Detection with Global and Local Enhancements | Haiyang Xie et.al. | 2503.07101 | null |
2025-03-10 | RS2V-L: Vehicle-Mounted LiDAR Data Generation from Roadside Sensor Observations | Ruidan Xing et.al. | 2503.07085 | null |
2025-03-10 | Availability-aware Sensor Fusion via Unified Canonical Space for 4D Radar, LiDAR, and Camera | Dong-Hee Paek et.al. | 2503.07029 | null |
2025-03-10 | Large Language Model Guided Progressive Feature Alignment for Multimodal UAV Object Detection | Wentao Wu et.al. | 2503.06948 | null |
2025-03-06 | Collaborative Evaluation of Deepfake Text with Deliberation-Enhancing Dialogue Systems | Jooyoung Lee et.al. | 2503.04945 | null |
2025-03-06 | Fine-Tuning Florence2 for Enhanced Object Detection in Un-constructed Environments: Vision-Language Model Approach | Soumyadeep Ro et.al. | 2503.04918 | null |
2025-03-06 | Floxels: Fast Unsupervised Voxel Based Scene Flow Estimation | David T. Hoffmann et.al. | 2503.04718 | null |
2025-03-06 | DEAL-YOLO: Drone-based Efficient Animal Localization using YOLO | Aditya Prashant Naidu et.al. | 2503.04698 | null |
2025-03-06 | Teach YOLO to Remember: A Self-Distillation Approach for Continual Object Detection | Riccardo De Monte et.al. | 2503.04688 | null |
2025-03-06 | ReynoldsFlow: Exquisite Flow Estimation via Reynolds Transport Theorem | Yu-Hsi Chen et.al. | 2503.04500 | null |
2025-03-06 | A lightweight model FDM-YOLO for small target improvement based on YOLOv8 | Xuerui Zhang et.al. | 2503.04452 | null |
2025-03-06 | Shaken, Not Stirred: A Novel Dataset for Visual Understanding of Glasses in Human-Robot Bartending Tasks | Lukáš Gajdošech et.al. | 2503.04308 | null |
2025-03-06 | CA-W3D: Leveraging Context-Aware Knowledge for Weakly Supervised Monocular 3D Detection | Chupeng Liu et.al. | 2503.04154 | null |
2025-03-06 | Robust Computer-Vision based Construction Site Detection for Assistive-Technology Applications | Junchi Feng et.al. | 2503.04139 | null |
2025-03-06 | Fractional Correspondence Framework in Detection Transformer | Masoumeh Zareapoor et.al. | 2503.04107 | null |
2025-03-05 | DualDiff+: Dual-Branch Diffusion for High-Fidelity Video Generation with Reward Guidance | Zhao Yang et.al. | 2503.03689 | null |
2025-03-05 | 4D Radar Ground Truth Augmentation with LiDAR-to-4D Radar Data Synthesis | Woo-Jin Jung et.al. | 2503.03637 | null |
2025-03-05 | Feature-Level Insights into Artificial Text Detection with Sparse Autoencoders | Kristian Kuznetsov et.al. | 2503.03601 | null |
2025-03-05 | Simulation-Based Performance Evaluation of 3D Object Detection Methods with Deep Learning for a LiDAR Point Cloud Dataset in a SOTIF-related Use Case | Milin Patel et.al. | 2503.03548 | link |
2025-03-05 | AI-Driven Multi-Stage Computer Vision System for Defect Detection in Laser-Engraved Industrial Nameplates | Adhish Anitha Vilasan et.al. | 2503.03395 | null |
2025-03-05 | MIAdapt: Source-free Few-shot Domain Adaptive Object Detection for Microscopic Images | Nimra Dilawar et.al. | 2503.03370 | null |
2025-03-05 | Automated Attendee Recognition System for Large-Scale Social Events or Conference Gathering | Dhruv Motwani et.al. | 2503.03330 | null |
2025-03-05 | BEVMOSNet: Multimodal Fusion for BEV Moving Object Segmentation | Hiep Truong Cong et.al. | 2503.03280 | null |
2025-03-05 | Find Matching Faces Based On Face Parameters | Setu A. Bhatt et.al. | 2503.03204 | null |
2025-03-04 | Revolutionizing Traffic Management with AI-Powered Machine Vision: A Step Toward Smart Cities | Seyed Hossein Hosseini DolatAbadi et.al. | 2503.02967 | null |
2025-03-04 | Class-Aware PillarMix: Can Mixed Sample Data Augmentation Enhance 3D Object Detection with Radar Point Clouds? | Miao Zhang et.al. | 2503.02687 | null |
2025-03-04 | Exploring Model Quantization in GenAI-based Image Inpainting and Detection of Arable Plants | Sourav Modak et.al. | 2503.02420 | null |
2025-03-04 | Robust detection of overlapping bioacoustic sound events | Louis Mahon et.al. | 2503.02389 | null |
2025-03-04 | YOLO-PRO: Enhancing Instance-Specific Object Detection with Full-Channel Global Self-Attention | Lin Huang et.al. | 2503.02348 | null |
2025-03-04 | SSNet: Saliency Prior and State Space Model-based Network for Salient Object Detection in RGB-D Images | Gargi Panda et.al. | 2503.02270 | null |
2025-03-03 | Generalized Diffusion Detector: Mining Robust Features from Diffusion Models for Domain-Generalized Detection | Boyong He et.al. | 2503.02101 | null |
2025-03-03 | Uncertainty Representation in a SOTIF-Related Use Case with Dempster-Shafer Theory for LiDAR Sensor-Based Object Detection | Milin Patel et.al. | 2503.02087 | link |
2025-03-03 | Visual-RFT: Visual Reinforcement Fine-Tuning | Ziyu Liu et.al. | 2503.01785 | link |
2025-03-03 | Enhancing Object Detection Accuracy in Underwater Sonar Images through Deep Learning-based Denoising | Ziyu Wang et.al. | 2503.01655 | null |
2025-03-03 | Evaluating Stenosis Detection with Grounding DINO, YOLO, and DINO-DETR | Muhammad Musab Ansari et.al. | 2503.01601 | null |
2025-02-28 | The Common Objects Underwater (COU) Dataset for Robust Underwater Object Detection | Rishi Mukherjee et.al. | 2502.20651 | null |
2025-02-28 | RTGen: Real-Time Generative Detection Transformer | Chi Ruan et.al. | 2502.20622 | null |
2025-02-28 | LV-DOT: LiDAR-visual dynamic obstacle detection and tracking for autonomous robot navigation | Zhefan Xu et.al. | 2502.20607 | null |
2025-02-27 | Multi-Scale Neighborhood Occupancy Masked Autoencoder for Self-Supervised Learning in LiDAR Point Clouds | Mohamed Abdelsamad et.al. | 2502.20316 | null |
2025-02-27 | OverLoCK: An Overview-first-Look-Closely-next ConvNet with Context-Mixing Dynamic Kernels | Meng Lou et.al. | 2502.20087 | link |
2025-02-27 | Night-Voyager: Consistent and Efficient Nocturnal Vision-Aided State Estimation in Object Maps | Tianxiao Gao et.al. | 2502.20054 | null |
2025-02-27 | Learning Mask Invariant Mutual Information for Masked Image Modeling | Tao Huang et.al. | 2502.19718 | null |
2025-02-27 | BEVDiffuser: Plug-and-Play Diffusion Model for BEV Denoising with Ground-Truth Guidance | Xin Ye et.al. | 2502.19694 | null |
2025-02-26 | Ev-3DOD: Pushing the Temporal Boundaries of 3D Object Detection with Event Cameras | Hoonhee Cho et.al. | 2502.19630 | null |
2025-02-26 | Is Your Paper Being Reviewed by an LLM? A New Benchmark Dataset and Approach for Detecting AI Text in Peer Review | Sungduk Yu et.al. | 2502.19614 | null |
2025-02-23 | Rewards-based image analysis in microscopy | Kamyar Barakati et.al. | 2502.18522 | null |
2025-02-25 | Multi-Perspective Data Augmentation for Few-shot Object Detection | Anh-Khoa Nguyen Vu et.al. | 2502.18195 | null |
2025-02-25 | Progressive Local Alignment for Medical Multimodal Pre-training | Huimin Yan et.al. | 2502.18047 | null |
2025-02-25 | Automatic Vehicle Detection using DETR: A Transformer-Based Approach for Navigating Treacherous Roads | Istiaq Ahmed Fahad et.al. | 2502.17843 | null |
2025-02-24 | Semi-Supervised Weed Detection in Vegetable Fields: In-domain and Cross-domain Experiments | Boyang Deng et.al. | 2502.17673 | null |
2025-02-24 | Experimental validation of UAV search and detection system in real wilderness environment | Stella Dumenčić et.al. | 2502.17372 | null |
2025-02-24 | LCV2I: Communication-Efficient and High-Performance Collaborative Perception Framework with Low-Resolution LiDAR | Xinxin Feng et.al. | 2502.17039 | null |
2025-02-24 | Sarang at DEFACTIFY 4.0: Detecting AI-Generated Text Using Noised Data and an Ensemble of DeBERTa Models | Avinash Trivedi et.al. | 2502.16857 | null |
2025-02-23 | Geometry-Aware 3D Salient Object Detection Network | Chen Wang et.al. | 2502.16488 | null |
2025-02-26 | MQADet: A Plug-and-Play Paradigm for Enhancing Open-Vocabulary Object Detection via Multimodal Question Answering | Caixiong Li et.al. | 2502.16486 | null |
2025-02-23 | Cross-domain Few-shot Object Detection with Multi-modal Textual Enrichment | Zeyu Shangguan et.al. | 2502.16469 | null |
2025-02-23 | Deep learning approaches to surgical video segmentation and object detection: A Scoping Review | Devanish N. Kamtam et.al. | 2502.16459 | null |
2025-02-22 | FeatSharp: Your Vision Model Features, Sharper | Mike Ranzinger et.al. | 2502.16025 | null |
2025-02-21 | Generative AI Framework for 3D Object Generation in Augmented Reality | Majid Behravan et.al. | 2502.15869 | null |
2025-02-21 | Machine-generated text detection prevents language model collapse | George Drayson et.al. | 2502.15654 | null |
2025-02-21 | Depth-aware Fusion Method based on Image and 4D Radar Spectrum for 3D Object Detection | Yue Sun et.al. | 2502.15516 | null |
2025-02-21 | Q-PETR: Quant-aware Position Embedding Transformation for Multi-View 3D Object Detection | Jiangyong Yu et.al. | 2502.15488 | null |
2025-02-21 | PFSD: A Multi-Modal Pedestrian-Focus Scene Dataset for Rich Tasks in Semi-Structured Environments | Yueting Liu et.al. | 2502.15342 | null |
2025-02-20 | Synth It Like KITTI: Synthetic Data Generation for Object Detection in Driving Scenarios | Richard Marcus et.al. | 2502.15076 | null |
2025-02-20 | YOLOv12: A Breakdown of the Key Architectural Features | Mujadded Al Rabbani Alif et.al. | 2502.14740 | null |
2025-02-20 | LXLv2: Enhanced LiDAR Excluded Lean 3D Object Detection with Fusion of 4D Radar and Camera | Weiyi Xiong et.al. | 2502.14503 | null |
2025-02-20 | ODVerse33: Is the New YOLO Version Always Better? A Multi Domain benchmark from YOLO v5 to v11 | Tianyou Jiang et.al. | 2502.14314 | null |
2025-02-19 | PedDet: Adaptive Spectral Optimization for Multimodal Pedestrian Detection | Rui Zhao et.al. | 2502.14063 | link |
2025-02-19 | Image compositing is all you need for data augmentation | Ang Jia Ning Shermaine et.al. | 2502.13936 | null |
2025-02-19 | MSVCOD:A Large-Scale Multi-Scene Dataset for Video Camouflage Object Detection | Shuyong Gao et.al. | 2502.13859 | null |
2025-02-19 | An Overall Real-Time Mechanism for Classification and Quality Evaluation of Rice | Wanke Xia et.al. | 2502.13764 | null |
2025-02-18 | Multiple Distribution Shift -- Aerial (MDS-A): A Dataset for Test-Time Error Detection and Model Adaptation | Noel Ngu et.al. | 2502.13289 | null |
2025-02-18 | RobuRCDet: Enhancing Robustness of Radar-Camera Fusion in Bird's Eye View for 3D Object Detection | Jingtong Yue et.al. | 2502.13071 | null |
2025-02-18 | Task-Oriented Semantic Communication for Stereo-Vision 3D Object Detection | Zijian Cao et.al. | 2502.12735 | null |
2025-02-18 | Iron Sharpens Iron: Defending Against Attacks in Machine-Generated Text Detection with Adversarial Training | Yuanfan Li et.al. | 2502.12734 | null |
2025-02-18 | DAMamba: Vision State Space Model with Dynamic Adaptive Scan | Tanzhe Li et.al. | 2502.12627 | null |
2025-02-18 | Who Writes What: Unveiling the Impact of Author Roles on AI-generated Text Detection | Jiatao Li et.al. | 2502.12611 | null |
2025-02-18 | Gaseous Object Detection | Kailai Zhou et.al. | 2502.12415 | null |
2025-02-17 | AI-generated Text Detection with a GLTR-based Approach | Lucía Yan Wu et.al. | 2502.12064 | null |
2025-02-17 | Enhancing Transparent Object Pose Estimation: A Fusion of GDR-Net and Edge Detection | Tessa Pulli et.al. | 2502.12027 | null |
2025-02-17 | ExaGPT: Example-Based Machine-Generated Text Detection for Human Interpretability | Ryuto Koike et.al. | 2502.11336 | null |
2025-02-16 | DAViMNet: SSMs-Based Domain Adaptive Object Detection | A. Enes Doruk et.al. | 2502.11178 | null |
2025-02-15 | CLoCKDistill: Consistent Location-and-Context-aware Knowledge Distillation for DETRs | Qizhen Lan et.al. | 2502.10683 | null |
2025-02-14 | Text-guided Sparse Voxel Pruning for Efficient 3D Visual Grounding | Wenxuan Guo et.al. | 2502.10392 | null |
2025-02-14 | Object Detection and Tracking | Md Pranto et.al. | 2502.10310 | null |
2025-02-14 | Artificial Intelligence to Assess Dental Findings from Panoramic Radiographs -- A Multinational Study | Yin-Chih Chelsea Wang et.al. | 2502.10277 | null |
2025-02-13 | Instance Segmentation of Scene Sketches Using Natural Image Priors | Mia Tang et.al. | 2502.09608 | null |
2025-02-13 | Wholly-WOOD: Wholly Leveraging Diversified-quality Labels for Weakly-supervised Oriented Object Detection | Yi Yu et.al. | 2502.09471 | link |
2025-02-13 | Mitigating the Impact of Prominent Position Shift in Drone-based RGBT Object Detection | Yan Zhang et.al. | 2502.09311 | null |
2025-02-13 | Billet Number Recognition Based on Test-Time Adaptation | Yuan Wei et.al. | 2502.09026 | null |
2025-02-12 | Uncertainty Aware Human-machine Collaboration in Camouflaged Object Detection | Ziyue Yang et.al. | 2502.08373 | link |
2025-02-12 | Modification and Generated-Text Detection: Achieving Dual Detection Capabilities for the Outputs of LLM by Watermark | Yuhang Cai et.al. | 2502.08332 | null |
2025-02-12 | Plantation Monitoring Using Drone Images: A Dataset and Performance Review | Yashwanth Karumanchi et.al. | 2502.08233 | null |
2025-02-12 | Take What You Need: Flexible Multi-Task Semantic Communications with Channel Adaptation | Xiang Chen et.al. | 2502.08221 | null |
2025-02-13 | SARChat-Bench-2M: A Multi-Task Vision-Language Benchmark for SAR Image Interpretation | Zhiming Ma et.al. | 2502.08168 | null |
2025-02-12 | Knowledge Swapping via Learning and Unlearning | Mingyu Xing et.al. | 2502.08075 | null |
2025-02-11 | Visual-based spatial audio generation system for multi-speaker environments | Xiaojing Liu et.al. | 2502.07538 | null |
2025-02-11 | Quantitative Analysis of Objects in Prisoner Artworks | Thea Christoffersen et.al. | 2502.07440 | null |
2025-02-11 | Fast-COS: A Fast One-Stage Object Detector Based on Reparameterized Attention Vision Transformer for Autonomous Driving | Novendra Setyawan et.al. | 2502.07417 | null |
2025-02-11 | Multi-Task-oriented Nighttime Haze Imaging Enhancer for Vision-driven Measurement Systems | Ai Chen et.al. | 2502.07351 | link |
2025-02-11 | SparseFormer: Detecting Objects in HRW Shots via Sparse Vision Transformer | Wenxi Li et.al. | 2502.07216 | null |
2025-02-11 | Dense Object Detection Based on De-homogenized Queries | Yueming Huang et.al. | 2502.07194 | null |
2025-02-11 | Foreign-Object Detection in High-Voltage Transmission Line Based on Improved YOLOv8m | Zhenyue Wang et.al. | 2502.07175 | null |
2025-02-11 | A Survey on Mamba Architecture for Vision Applications | Fady Ibrahim et.al. | 2502.07161 | null |
2025-02-10 | Multimodal Search on a Line | Jared Coleman et.al. | 2502.07000 | null |
2025-02-10 | AgilePilot: DRL-Based Drone Agent for Real-Time Motion Planning in Dynamic Environments by Leveraging Object Detection | Roohan Ahmed Khan et.al. | 2502.06725 | null |
2025-02-10 | EdgeMLBalancer: A Self-Adaptive Approach for Dynamic Model Switching on Resource-Constrained Edge Devices | Akhila Matathammal et.al. | 2502.06493 | null |
2025-02-10 | PLATTER: A Page-Level Handwritten Text Recognition System for Indic Scripts | Badri Vishal Kasuba et.al. | 2502.06172 | null |
2025-02-10 | Enhancing Document Key Information Localization Through Data Augmentation | Yue Dai et.al. | 2502.06132 | null |
2025-02-10 | Improved YOLOv5s model for key components detection of power transmission lines | Chen Chen et.al. | 2502.06127 | null |
2025-02-10 | A Novel Multi-Teacher Knowledge Distillation for Real-Time Object Detection using 4D Radar | Seung-Hyun Song et.al. | 2502.06114 | null |
2025-02-09 | Training-free Anomaly Event Detection via LLM-guided Symbolic Pattern Discovery | Yuhui Zeng et.al. | 2502.05843 | null |
2025-02-08 | Demystifying Catastrophic Forgetting in Two-Stage Incremental Object Detector | Qirui Wu et.al. | 2502.05540 | null |
2025-02-07 | Invizo: Arabic Handwritten Document Optical Character Recognition Solution | Alhossien Waly et.al. | 2502.05277 | null |
2025-02-07 | LP-DETR: Layer-wise Progressive Relations for Object Detection | Zhengjian Kang et.al. | 2502.05147 | null |
2025-02-07 | Counting Fish with Temporal Representations of Sonar Video | Kai Van Brunt et.al. | 2502.05129 | null |
2025-02-07 | DetVPCC: RoI-based Point Cloud Sequence Compression for 3D Object Detection | Mingxuan Yan et.al. | 2502.04804 | null |
2025-02-07 | MHAF-YOLO: Multi-Branch Heterogeneous Auxiliary Fusion YOLO for accurate object detection | Zhiqiang Yang et.al. | 2502.04656 | null |
2025-02-07 | AIQViT: Architecture-Informed Post-Training Quantization for Vision Transformers | Runqing Jiang et.al. | 2502.04628 | null |
2025-02-06 | An Optimized YOLOv5 Based Approach For Real-time Vehicle Detection At Road Intersections Using Fisheye Cameras | Md. Jahin Alam et.al. | 2502.04566 | null |
2025-02-06 | Group-Adaptive Threshold Optimization for Robust AI-Generated Text Detection | Minseok Jung et.al. | 2502.04528 | null |
2025-02-06 | OneTrack-M: A multitask approach to transformer-based MOT models | Luiz C. S. de Araujo et.al. | 2502.04478 | null |
2025-02-07 | Point2RBox-v2: Rethinking Point-supervised Oriented Object Detection with Spatial Layout Among Instances | Yi Yu et.al. | 2502.04268 | null |
2025-02-06 | An object detection approach for lane change and overtake detection from motion profiles | Andrea Benericetti et.al. | 2502.04244 | null |
2025-02-06 | YOLOv4: A Breakthrough in Real-Time Object Detection | Athulya Sundaresan Geetha et.al. | 2502.04161 | null |
2025-02-06 | Advanced Object Detection and Pose Estimation with Hybrid Task Cascade and High-Resolution Networks | Yuhui Jin et.al. | 2502.03877 | null |
2025-02-06 | Pursuing Better Decision Boundaries for Long-Tailed Object Detection via Category Information Amount | Yanbiao Ma et.al. | 2502.03852 | null |
2025-02-06 | Single-Domain Generalized Object Detection by Balancing Domain Diversity and Invariance | Zhenwei He et.al. | 2502.03835 | null |
2025-02-06 | UAV Cognitive Semantic Communications Enabled by Knowledge Graph for Robust Object Detection | Xi Song et.al. | 2502.03761 | null |
2025-02-06 | RAMOTS: A Real-Time System for Aerial Multi-Object Tracking based on Deep Learning and Big Data Technology | Nhat-Tan Do et.al. | 2502.03760 | null |
2025-02-05 | An Empirical Study of Methods for Small Object Detection from Satellite Imagery | Xiaohui Yuan et.al. | 2502.03674 | null |
2025-02-05 | Gompertz Linear Units: Leveraging Asymmetry for Enhanced Learning Dynamics | Indrashis Das et.al. | 2502.03654 | null |
2025-02-05 | RoboGrasp: A Universal Grasping Policy for Robust Robotic Control | Yiqi Huang et.al. | 2502.03072 | null |
2025-02-05 | Enhancing Quantum-ready QUBO-based Suppression for Object Detection with Appearance and Confidence Features | Keiichiro Yamamura et.al. | 2502.02895 | null |
2025-02-05 | RS-YOLOX: A High Precision Detector for Object Detection in Satellite Remote Sensing Images | Lei Yang et.al. | 2502.02850 | null |
2025-02-04 | Learning the RoPEs: Better 2D and 3D Position Encodings with STRING | Connor Schenck et.al. | 2502.02562 | null |
2025-02-04 | Uncertainty Quantification for Collaborative Object Detection Under Adversarial Attacks | Huiqun Huang et.al. | 2502.02537 | null |
2025-02-04 | Improving Generalization Ability for 3D Object Detection by Learning Sparsity-invariant Features | Hsin-Cheng Lu et.al. | 2502.02322 | null |
2025-02-04 | From Fog to Failure: How Dehazing Can Harm Clear Image Object Detection | Ashutosh Kumar et.al. | 2502.02027 | null |
2025-02-04 | Memory Efficient Transformer Adapter for Dense Predictions | Dong Zhang et.al. | 2502.01962 | null |
2025-02-04 | INTACT: Inducing Noise Tolerance through Adversarial Curriculum Training for LiDAR-based Safety-Critical Perception and Autonomy | Nastaran Darabi et.al. | 2502.01896 | null |
2025-02-04 | SimBEV: A Synthetic Multi-Task Multi-Sensor Driving Data Generation Tool and Dataset | Goodarz Mehr et.al. | 2502.01894 | null |
2025-02-03 | Reliability-Driven LiDAR-Camera Fusion for Robust 3D Object Detection | Reza Sadeghian et.al. | 2502.01856 | null |
2025-02-03 | GauCho: Gaussian Distributions with Cholesky Decomposition for Oriented Object Detection | Jeffri Murrugarra-LLerena et.al. | 2502.01565 | null |
2025-02-03 | Human Body Restoration with One-Step Diffusion Model and A New Benchmark | Jue Gong et.al. | 2502.01411 | null |
2025-01-31 | Let Human Sketches Help: Empowering Challenging Image Segmentation Task with Freehand Sketches | Ying Zang et.al. | 2501.19329 | null |
2025-01-31 | Beyond checkmate: exploring the creative chokepoints in AI text | Nafis Irtiza Tripto et.al. | 2501.19301 | link |
2025-01-31 | GO: The Great Outdoors Multimodal Dataset | Peng Jiang et.al. | 2501.19274 | null |
2025-01-31 | Adversarial Attacks on AI-Generated Text Detection Models: A Token Probability-Based Approach Using Embeddings | Ahmed K. Kadhim et.al. | 2501.18998 | null |
2025-01-31 | Early Diagnosis and Severity Assessment of Weligama Coconut Leaf Wilt Disease and Coconut Caterpillar Infestation using Deep Learning-based Image Processing Techniques | Samitha Vidhanaarachchi et.al. | 2501.18835 | null |
2025-01-30 | Tuning Event Camera Biases Heuristic for Object Detection Applications in Staring Scenarios | David El-Chai Ben-Ezra et.al. | 2501.18788 | null |
2025-01-30 | Adaptive Object Detection for Indoor Navigation Assistance: A Performance Evaluation of Real-Time Algorithms | Abhinav Pratap et.al. | 2501.18444 | null |
2025-01-29 | Real Time Scheduling Framework for Multi Object Detection via Spiking Neural Networks | Donghwa Kang et.al. | 2501.18412 | null |
2025-01-30 | IROAM: Improving Roadside Monocular 3D Object Detection Learning from Autonomous Vehicle Data Domain | Zhe Wang et.al. | 2501.18162 | null |
2025-02-03 | Efficient Feature Fusion for UAV Object Detection | Xudong Wang et.al. | 2501.17983 | null |
2025-01-29 | TransRAD: Retentive Vision Transformer for Enhanced Radar Object Detection | Lei Cheng et.al. | 2501.17977 | link |
2025-01-28 | Object Detection with Deep Learning for Rare Event Search in the GADGET II TPC | Tyler Wheeler et.al. | 2501.17892 | null |
2025-01-29 | Detection of Oscillation-like Patterns in Eclipsing Binary Light Curves using Neural Network-based Object Detection Algorithms | Burak Ulaş et.al. | 2501.17538 | null |
2025-01-30 | Assessing the Capability of YOLO- and Transformer-based Object Detectors for Real-time Weed Detection | Alicia Allmendinger et.al. | 2501.17387 | null |
2025-01-28 | DINOSTAR: Deep Iterative Neural Object Detector Self-Supervised Training for Roadside LiDAR Applications | Muhammad Shahbaz et.al. | 2501.17076 | null |
2025-01-28 | Contextual Self-paced Learning for Weakly Supervised Spatio-Temporal Video Grounding | Akash Kumar et.al. | 2501.17053 | null |
2025-01-28 | Approach Towards Semi-Automated Certification for Low Criticality ML-Enabled Airborne Applications | Chandrasekar Sridhar et.al. | 2501.17028 | null |
2025-01-28 | Modulating CNN Features with Pre-Trained ViT Representations for Open-Vocabulary Object Detection | Xiangyu Gao et.al. | 2501.16981 | null |
2025-01-28 | B-FPGM: Lightweight Face Detection via Bayesian-Optimized Soft FPGM Pruning | Nikolaos Kaparinos et.al. | 2501.16917 | null |
2025-01-28 | SSF-PAN: Semantic Scene Flow-Based Perception for Autonomous Navigation in Traffic Scenarios | Yinqi Chen et.al. | 2501.16754 | null |
2025-01-28 | DebugAgent: Efficient and Interpretable Error Slice Discovery for Comprehensive Model Debugging | Muxi Chen et.al. | 2501.16751 | null |
2025-01-28 | DFCon: Attention-Driven Supervised Contrastive Learning for Robust Deepfake Detection | MD Sadik Hossain Shanto et.al. | 2501.16704 | null |
2025-01-27 | Efficient Object Detection of Marine Debris using Pruned YOLO Model | Abi Aryaza et.al. | 2501.16571 | null |
2025-01-27 | Object Detection for Medical Image Analysis: Insights from the RT-DETR Model | Weijie He et.al. | 2501.16469 | null |
2025-01-27 | The Linear Attention Resurrection in Vision Transformer | Chuanyang Zheng et.al. | 2501.16182 | null |
2025-01-27 | Real-Time Brain Tumor Detection in Intraoperative Ultrasound Using YOLO11: From Model Training to Deployment in the Operating Room | Santiago Cepeda et.al. | 2501.15994 | null |
2025-01-26 | Classifying Deepfakes Using Swin Transformers | Aprille J. Xi et.al. | 2501.15656 | null |
2025-01-26 | A Privacy Enhancing Technique to Evade Detection by Street Video Cameras Without Using Adversarial Accessories | Jacob Shams et.al. | 2501.15653 | null |
2025-01-26 | Breaking the SSL-AL Barrier: A Synergistic Semi-Supervised Active Learning Framework for 3D Object Detection | Zengran Wang et.al. | 2501.15449 | null |
2025-01-26 | FAVbot: An Autonomous Target Tracking Micro-Robot with Frequency Actuation Control | Zhijian Hao et.al. | 2501.15426 | null |
2025-01-26 | Doracamom: Joint 3D Detection and Occupancy Prediction with Multi-view 4D Radars and Cameras for Omnidirectional Perception | Lianqing |