18781Project: Mamba E-Branchformer

This is the CMU course Speech Recognition Project

This repository contains the official implementation of Mamba in E-Branchformer, thanks to the code from https://github.com/Tonyyouyou/Mamba-in-Speech with paper Mamba in Speech: Towards an Alternative to Self-Attention. Also, thanks to the code framework from ESPnet

Install the Mamba

To build your ConBiMamba Model, you first need to install Mamba. The causal-conv1d component, which requires CUDA version 11.8 or higher, is essential for Mamba.

Install the causal-conv1d package:
```
pip install causal-conv1d>=1.2.0
```
Install the mamba-ssm package:
```
pip install mamba-ssm
```

ESPnet version of Mamba

The following files are used for implementing ASR tasks in ESPnet.

outer_bimamba.py

This file contains bidirectional outer mamba. should be placed in /mamba_ssm/modules(in your enviroment package of mamba)

e_branchformer_encoder_mamba.py (this is the file for E-branchformer encoder with self-attention replaced by mamba layer, in this version we only used outer_bimamba)

should be placed in espnet/espnet2/asr/encoder

e_branchformer_encoder_mamba_parallel.py (this is the file for E-branchformer encoder adding another bimamba branch, in this version we only used outer_bimamba)

should be placed in espnet/espnet2/asr/encoder

asr.py

This is asr.py file, which should be in espnet/espnet2/tasks

train_asr_e_branchformermamba.yaml

This file is the example training config file, showing the required parameters for training Mamba based E-Branchformer Arch.

train_asr_e_branchformermambaparallel.yaml

This file is the example training config file, showing the required parameters for training E-Branchformer Mamba parellel Arch.

Model Architecture

Figure 1: External Bidirectional Mamba layer: Two common Mamba layers are applied to process the forward and backward input, ”Act“ means SiLU activation.

Figure 2: E-Branchformer Mamba Parallel Structure.

Project Report

For detailed analysis and results, please check our final project report.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
ESPnet_file		ESPnet_file
images		images
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

18781Project: Mamba E-Branchformer

Install the Mamba

ESPnet version of Mamba

outer_bimamba.py

e_branchformer_encoder_mamba.py (this is the file for E-branchformer encoder with self-attention replaced by mamba layer, in this version we only used outer_bimamba)

e_branchformer_encoder_mamba_parallel.py (this is the file for E-branchformer encoder adding another bimamba branch, in this version we only used outer_bimamba)

asr.py

train_asr_e_branchformermamba.yaml

train_asr_e_branchformermambaparallel.yaml

Model Architecture

Project Report

About

Releases

Packages

Languages

uwanny/18781Project-Mamba-E-Branchformer

Folders and files

Latest commit

History

Repository files navigation

18781Project: Mamba E-Branchformer

Install the Mamba

ESPnet version of Mamba

outer_bimamba.py

e_branchformer_encoder_mamba.py (this is the file for E-branchformer encoder with self-attention replaced by mamba layer, in this version we only used outer_bimamba)

e_branchformer_encoder_mamba_parallel.py (this is the file for E-branchformer encoder adding another bimamba branch, in this version we only used outer_bimamba)

asr.py

train_asr_e_branchformermamba.yaml

train_asr_e_branchformermambaparallel.yaml

Model Architecture

Project Report

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages