Skip to content

High training time for a model with QLSTM layer #896

Answered by nickfraser
shashwat1198 asked this question in Q&A
Discussion options

You must be logged in to vote

I haven't had the chance to run your code, but on first glance, I don't see anything obviously wrong with your training script. I don't currently have the bandwidth to analyse your code any further.

I set this as an environment variable. But it did not change the training time! Is there something else that I might be doing wrong?

This is a red flag to me that your network is not being compiled with PyTorch's JIT, with that is mind, I'd say your have 3 options:

  1. Debug your environment to confirm if TorchScript code is being compiled properly (i.e., with PYTORCH_JIT=1, without Brevitas)
  2. Use a "retraining" flow, instead of training from "scratch" (first train a floating point model, and re…

Replies: 1 comment 6 replies

Comment options

You must be logged in to vote
6 replies
@nickfraser
Comment options

@shashwat1198
Comment options

@shashwat1198
Comment options

@nickfraser
Comment options

Answer selected by shashwat1198
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
3 participants