Text generation can use raw_model instead of model #56

sapphire008 · 2024-07-10T18:06:14Z

The current script bypasses the text generation step when the model is compiled. However, if we change from model(...) to raw_model(...), we can still generate the text when the model is compiled.

build-nanogpt/train_gpt2.py

Lines 459 to 461 in 6104ab1

    
           with torch.no_grad(): 
        
               with torch.autocast(device_type=device_type, dtype=torch.bfloat16): 
        
                   logits, loss = model(xgen) # (B, T, vocab_size)

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Text generation can use raw_model instead of model #56

Text generation can use raw_model instead of model #56

sapphire008 commented Jul 10, 2024 •

edited

Loading

Text generation can use raw_model instead of model #56

Text generation can use raw_model instead of model #56

Comments

sapphire008 commented Jul 10, 2024 • edited Loading

sapphire008 commented Jul 10, 2024 •

edited

Loading