Hybrid autoregressive transducer #1271

desh2608 · 2023-09-24T20:48:38Z

I was wondering if there are any existing recipes for the HAT model. It is a straightforward change by modeling the blank distribution as a Bernoulli distribution, and was shown to be useful to integrate external LMs, among other things.

Has anyone tried it in icefall, especially with the pruned loss?

csukuangfj · 2023-09-24T23:14:34Z

I was wondering if there are any existing recipes for the HAT model. It is a straightforward change by modeling the blank distribution as a Bernoulli distribution, and was shown to be useful to integrate external LMs, among other things.

Has anyone tried it in icefall, especially with the pruned loss?

We have not tried that. Would be great if you can add that.

desh2608 · 2023-09-27T13:33:04Z

@csukuangfj Do you have advice on what would be a good evaluation setup for using HAT to integrate external LMs? For example, how did you evaluate the LODR methods?

desh2608 · 2023-09-27T13:34:03Z

For a POC, I was just training a model on LibriSpeech, and was planning to use an external RNNLM. But Dan pointed out that LibriSpeech may not be the best test-bed for these experiments.

csukuangfj · 2023-09-27T14:28:54Z

@marcoyang1998

Could you have a look?

marcoyang1998 · 2023-09-27T14:36:26Z

You may try cross-domain evaluation scenarios, e.g. decoding the LibriSpeech model on the Gigaspeech test sets using an RNNLM trained on the Gigaspeech transcripts. I believe I tested LODR in this scenario and it yielded better results than using only shallow fusion.

desh2608 linked a pull request Oct 5, 2023 that will close this issue

Adding ILM beam search and decoding #1291

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Hybrid autoregressive transducer #1271

Hybrid autoregressive transducer #1271

desh2608 commented Sep 24, 2023

csukuangfj commented Sep 24, 2023

desh2608 commented Sep 27, 2023

desh2608 commented Sep 27, 2023

csukuangfj commented Sep 27, 2023

marcoyang1998 commented Sep 27, 2023

Hybrid autoregressive transducer #1271

Hybrid autoregressive transducer #1271

Comments

desh2608 commented Sep 24, 2023

csukuangfj commented Sep 24, 2023

desh2608 commented Sep 27, 2023

desh2608 commented Sep 27, 2023

csukuangfj commented Sep 27, 2023

marcoyang1998 commented Sep 27, 2023