label smoothing inf err #109

jerett · 2023-04-10T07:22:41Z

when running label smoothing section, I found the code 'crit(x=predict, target=torch.LongTensor([2, 1, 0, 3, 3]))' return inf.
I think the var predict shouldn't add log, for log(0) is -inf. and the loss section draws nothing.

satya400 · 2023-09-21T14:43:18Z

Hi jerett - we need the inputs of KLDivLoss to be in log space. Hence we need to apply log() - The -inf issue is because we have zeros in the tensor. So the log() applied to the predict tensor is creating issue with LabelSmoothing().

Hence I propose to use softmax_log() instead of log()

I also raised a pr.

Thanks
Satya

alaneuler · 2023-10-12T03:18:13Z

Same problem here, but i don't think we should use softmax_log instead of log because the predict is already defined as probabilities.

Rather, I changed the predict tensor to:

predict = torch.FloatTensor([[1e-9, x/d - 1e-9, 1/d, 1/d, 1/d]])

to avoid the inf.

The result I get is the same to the example provided:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

label smoothing inf err #109

label smoothing inf err #109

jerett commented Apr 10, 2023

satya400 commented Sep 21, 2023

alaneuler commented Oct 12, 2023

label smoothing inf err #109

label smoothing inf err #109

Comments

jerett commented Apr 10, 2023

satya400 commented Sep 21, 2023

alaneuler commented Oct 12, 2023