Pull requests: harvardnlp/annotated-transformer
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
use biased estimate of std in layernorm as in the original paper
#119
opened Jan 25, 2024 by
Arunprakash-A
Loading…
ProTip!
Follow long discussions with comments:>50.