Adaptive Minimum Bayes Risk Decoding

This repository contains the code for the experiments in Hyperparameter-Free Approach for Faster Minimum Bayes Risk Decoding by Yuu Jinnai and Ariu Kaito.

The code is tested on Ubuntu 20.04 using Python 3.8 and CUDA 11.0 (Docker image nvidia/cuda:11.0.3-cudnn8-devel-ubuntu20.04). The code is provided mostly as is with little effort on refactoring.

Installation

git clone [email protected]:CyberAgentAILab/adaptive-mbr
cd adaptive-mbr
pip install -r requirements.txt

Usage

The code runs in two steps.

sample.sh samples candidates.
run_mbr.sh computes the MBR candidate from the candidates sampled.

Sampling candidates

./experiments/sample.sh -d [DATASET] -s [NUMBER OF SAMPLES]

Computing MBR

./experiments/run_mbr.sh -d [DATASET] -s [NUMBER OF SAMPLES] -a [ALGORITHM]

Example: WMT'21 En-De

Use sacrebleu to prepare the benchmark dataset.

mkdir -p ./dataset/wmt21
sacrebleu -t wmt21 -l en-de --echo src > ./dataset/wmt21/wmt21.en-de.en
sacrebleu -t wmt21 -l en-de --echo ref > ./dataset/wmt21/wmt21.en-de.de

Sample candidates

./experiments/sample.sh -d wmt21.en-de

Run adaptive MBR

./experiments/run_mbr.sh -d wmt21.en-de -a approx

Run confidence based pruning (CBP)

./experiments/run_mbr.sh -d wmt21.en-de -a pruning

Reference

Yuu Jinnai and Kaito Ariu. 2024. Hyperparameter-Free Approach for Faster Minimum Bayes Risk Decoding. In Findings of the Association for Computational Linguistics ACL 2024, pages 8547–8566, Bangkok, Thailand and virtual meeting. Association for Computational Linguistics.

Bibtex:

@inproceedings{jinnai-ariu-2024-hyperparameter,
    title = "Hyperparameter-Free Approach for Faster Minimum {B}ayes Risk Decoding",
    author = "Jinnai, Yuu  and
      Ariu, Kaito",
    editor = "Ku, Lun-Wei  and
      Martins, Andre  and
      Srikumar, Vivek",
    booktitle = "Findings of the Association for Computational Linguistics ACL 2024",
    month = aug,
    year = "2024",
    address = "Bangkok, Thailand and virtual meeting",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2024.findings-acl.505",
    pages = "8547--8566",
}

Contact

For any questions, feel free to raise an issue or contact me at [email protected].

Acknowledgements

MS COCO dataset is licensed under a Creative Commons BY 4.0.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
experiments		experiments
mbr		mbr
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Adaptive Minimum Bayes Risk Decoding

Installation

Usage

Sampling candidates

Computing MBR

Example: WMT'21 En-De

Reference

Contact

Acknowledgements

About

Contributors 2

Languages

License

CyberAgentAILab/adaptive-mbr

Folders and files

Latest commit

History

Repository files navigation

Adaptive Minimum Bayes Risk Decoding

Installation

Usage

Sampling candidates

Computing MBR

Example: WMT'21 En-De

Reference

Contact

Acknowledgements

About

Resources

License

Stars

Watchers

Forks

Contributors 2

Languages