How to debug NaN output of a logits in training

seand0101 · December 26, 2024, 4:07pm

This has been a recurring problem because apparently I made my own id2labels.json, you can look at it if it adds more context in here. I did try “Getting all possible labels by grouping it after each inference” in this question for better context and the program actually confused at what to segment.

I tried making it 0 and 1 like what I had today and actually get probably a false positive due to small epoch on first training on here. So how is it really to make an id2label from a pretrained model id2label?

Thanks for the example, will try to see how to implement that in my code.

Btw if I just let the model training running, does the model actually broke or I’m just missing some stats I could probably circumvent by it’s products like maybe creating new masks of the new model and count the MIoU manually?

Topic		Replies	Views
Trainer doesn't show the loss at each step 🤗Transformers	20	33230	May 9, 2024
Determining size of a logits Beginners	0	16	December 4, 2024
Extracting Logits From T5 Output Beginners	5	1942	January 9, 2024
What outputs are defined when using custom compute_loss? Beginners	0	295	December 14, 2022
Inconsistency in logit values between generation and direct model prediction #31127 🤗Transformers	0	165	May 30, 2024

How to debug NaN output of a logits in training

Related topics