Skip to content

sepermetric/seper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

SePer: Official Implementation

Arxiv python 3.11 transformer 4.40 license Apache-2.0


TL;DR: SePer is an accurate / fast / free-of-API metric to measure document quality via information gain.


This repository contains the official implementation of our ICLR 2025 Spotlight🎉 paper:

SePer: Measure Retrieval Utility Through The Lens Of Semantic Perplexity Reduction

Authors: Lu Dai, Yijie Xu, Jinhui Ye, Hao Liu, Hui Xiong

Link: OpenReview

Project Page: SePer Project Page

Overview

SePer introduces a novel framework to evaluate retrieval utility by analyzing semantic perplexity and semantic perplexity reduction. This approach provides a more fine-grained measurement of retrieval utility, offering insights beyond traditional evaluation metrics.

Below is a illustration of SePer's ability to perform fine-grained evaluation: illustration

Installation

Below is the steps for environment preparation:

conda create -n seper python=3.11
conda activate seper
pip install torch
pip install -r requirements.txt

Quick Start

We provide a jupyter notebook example.ipynb to demonstrate the minimal use of seper. You can quickly go through this notebook.

Benchmark For Retrievers

The benchmark for retrievers is available at here.

Citation

If you find our work useful, please cite our paper:

@inproceedings{dai2025seper,
  title={SePer: Measure Retrieval Utility Through The Lens Of Semantic Perplexity Reduction},
  author={Dai, Lu and Xu, Yijie and Ye, Jinhui and Liu, Hao and Xiong, Hui},
  booktitle={International Conference on Learning Representations (ICLR)},
  year={2025}
}

About

SePer is an accurate / fast / free-of-API metric to measure document quality via information gain

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •