Skip to content

Implementation of the paper : Not all attention is needed - Gated Attention Network for Sequence Data (GA-Net) [https://arxiv.org/abs/1912.00349]

Notifications You must be signed in to change notification settings

keya-desai/Gated-Attention

Repository files navigation

Gated Attention

Implementation of the paper : Not all attention is needed - Gated Attention Network for Sequence Data (GA-Net)

Flow Diagram for the network:

There are two networks in the model:

  1. Backbone Network
  2. Auxiliary Network

Comparison with soft attention network:

Soft Attention gives some attention (low or high) to all the input tokens whereas gated attention network chooses the most important tokens to attend.

Gate Probability and gated attention:

Visualization of probability for gate to be open for input token and the actual gated attention weight.

About

Implementation of the paper : Not all attention is needed - Gated Attention Network for Sequence Data (GA-Net) [https://arxiv.org/abs/1912.00349]

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published