MMStar-Benchmark / MMStar Star 118 Code Issues Pull requests This repo contains evaluation code for the paper "Are We on the Right Way for Evaluating Large Vision-Language Models" evaluation multimodality multimodal-learning visual-question-answering multimodal large-language-models llm llms large-vision-language-model large-vision-language-models large-multimodal-models lvlms lvlm Updated Apr 17, 2024 Python
BillChan226 / HALC Star 49 Code Issues Pull requests [ICML 2024] Official implementation for "HALC: Object Hallucination Reduction via Adaptive Focal-Contrast Decoding" hallucinations large-language-models lvlms Updated May 14, 2024 Python
Benchmark-Dysca / Dysca Star 0 Code Issues Pull requests Dysca: A Dynamic and Scalable Benchmark for Evaluating Perception Ability of LVLMs benchmark lvlms Updated Jun 9, 2024 Python