in video 103 why we used enumerate to loop through test_dataloader but didn't use enumerate to loop through test_dataloader #584

D1Asif · 2023-08-02T16:32:51Z

D1Asif
Aug 2, 2023

I have added 2 comments preceding with many question marks in the code below. We used enumerate to loop through test_dataloader but didn't use enumerate to loop through test_dataloader. why so? (The code is from 16:10:30 timeframe/video 103. Training and testing loops for batch data)

from tqdm.auto import tqdm

# set the seed and start the timer
torch.manual_seed(42)
train_time_start_on_cpu = timer()

#set the number of epochs
epochs = 3

# create training and test loop
for epoch in tqdm(range(epochs)):
  print(f"Epoch: {epoch}\n---------")
  ### Training 
  train_loss = 0
  # add a loop to loop through the training batches
  for batch, (X, y) in enumerate(train_dataloader): #??????????????????????? why we use enumerate here?
    model_0.train()

    # 1. forward pass
    y_pred = model_0(X)

    # 2. calculate the loss(per batch)
    loss = loss_fn(y_pred, y)
    train_loss += loss

    # 3. optimizer zero grad
    optimizer.zero_grad()

    # 4. loss backward
    loss.backward()

    # 5. optimizer step
    optimizer.step()

    # Ptrint out what's happening
    if batch % 400 == 0:
      print(f"Looked at {batch * len(X)}/{len(train_dataloader.dataset)} samples")
  # Divide total train loss by length of train dataloader
  train_loss /= len(train_dataloader)

  ### Testing
  test_loss, test_acc = 0, 0
  model_0.eval()
  with torch.inference_mode():
    for X_test, y_test in test_dataloader: #????????????????????????????????? but not here?
      # 1. forward pass
      test_pred = model_0(X_test)

      # 2. Calculate the loss
      test_loss += loss_fn(test_pred, y_test)

      # 3. Calculate the accuracy
      test_acc += accuracy_fn(y_true=y_test, y_pred=test_pred.argmax(dim=1))

    # calculate the avg test loss per batch
    test_loss /= len(test_dataloader)

    # calculate the avg test accuracy per batch
    test_acc /= len(test_dataloader)

  #print out what's happening
  print(f"\nTrain Loss: {train_loss:.4f} | Test Loss: {test_loss:.4f}, Test acc: {test_acc:.4f}")

#  calculate the training time
train_time_end_on_cpu = timer()
total_train_time_model_0 = print_train_time(train_time_start_on_cpu, train_time_end_on_cpu, device=str(next(model_0.parameters()).device))

Answered by wittyalias

Aug 3, 2023

Enumerate (have a look at the documentation) returns an iterator that are pairs of elements and their indexes. In the first loop, the index is assigned to batch, while the values are put into X and y. It looks like this is done so that ever 400 batches, it gives an update: if batch % 400 == 0:...
There's no need to have access to the batch number because there's no updates in the second loop, so no need for enumerate.

View full answer

wittyalias · 2023-08-03T23:54:12Z

wittyalias
Aug 3, 2023

Enumerate (have a look at the documentation) returns an iterator that are pairs of elements and their indexes. In the first loop, the index is assigned to batch, while the values are put into X and y. It looks like this is done so that ever 400 batches, it gives an update: if batch % 400 == 0:...
There's no need to have access to the batch number because there's no updates in the second loop, so no need for enumerate.

2 replies

mrdbourke Aug 4, 2023
Maintainer

Hi @D1Asif ,

@wittyalias is correct.

Try running a small for loop with and without enumerate() and see what happens.

enumerate() is helpful for knowing which index you're enumerating over:

fruits = ['apple', 'banana', 'mango', 'grape', 'cherry']

for i, fruit in enumerate(fruits):
    print("Fruit number", i, "is a", fruit)

Output:

Fruit number 0 is a apple
Fruit number 1 is a banana
Fruit number 2 is a mango
Fruit number 3 is a grape
Fruit number 4 is a cherry

In our case, we use the index to print out metrics every 400 batches with the line:

# Print out what's happening
    if batch % 400 == 0:
      print(f"Looked at {batch * len(X)}/{len(train_dataloader.dataset)} samples")
      ```

D1Asif Aug 4, 2023
Author

got it! Thanks guys!

ciroAlosa · 2024-05-22T17:17:47Z

ciroAlosa
May 22, 2024

Hello, First of all, thank you for the very useful course.

I have a follow-up question on this topic.

When we iterate through the batches, do we provide the full batch (32 images) at a time to the neural network? How does it work, since the input size of the network is 28*28, how can it accept a full batch of 32 images?

Thank you very much.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

in video 103 why we used enumerate to loop through test_dataloader but didn't use enumerate to loop through test_dataloader #584

{{title}}

Replies: 2 comments 2 replies

{{title}}

{{title}}

{{title}}

{{title}}

Select a reply

in video 103 why we used enumerate to loop through test_dataloader but didn't use enumerate to loop through test_dataloader #584

D1Asif Aug 2, 2023

Replies: 2 comments · 2 replies

wittyalias Aug 3, 2023

mrdbourke Aug 4, 2023 Maintainer

D1Asif Aug 4, 2023 Author

ciroAlosa May 22, 2024

D1Asif
Aug 2, 2023

Replies: 2 comments 2 replies

wittyalias
Aug 3, 2023

mrdbourke Aug 4, 2023
Maintainer

D1Asif Aug 4, 2023
Author

ciroAlosa
May 22, 2024