BCELoss between logits and labels not working

iamneerav · January 26, 2023, 8:09pm

I am using a GPT2 model that outputs logits (before softmax) in the shape (batch_size, num_input_ids, vocab_size) and I need to compare it with the labels that are of shape (batch_size, num_input_ids) to calculate BCELoss. How do I calculate it?

logits = output.logits #--of shape (32, 56, 592)
logits = torch.nn.Softmax()(logits)
labels = labels #---------of shape (32, 56)

torch.nn.BCELoss()(logits, labels)

but the dimensions do not match, so how do I contract logits to labels shape or expand labels to logits shape?

Topic		Replies	Views
Determining size of a logits Beginners	0	20	December 4, 2024
How to label dataset for Causal Language Modeling Beginners	0	497	January 27, 2023
BartForConditionalGeneration "logits" shape is wrong/unexpected 🤗Transformers	4	884	November 11, 2020
GPT-2 shift logits and labels 🤗Transformers	5	5474	May 12, 2023
Labels shape when using model.fit and TFGPT2LMHeadModel 🤗Transformers	0	752	February 1, 2021

BCELoss between logits and labels not working

Related topics