Some doubts about SublayerConnection #100

watersounds · 2022-09-19T06:26:29Z

According to what you wrote：
“That is, the output of each sub-layer is $\mathrm{LayerNorm}(x + \mathrm{Sublayer}(x))$, where $\mathrm{Sublayer}(x)$ is the function implemented by the sub-layer itself. We apply dropout (cite) to the output of each sub-layer, before it is added to the sub-layer input and normalized.”
I think the rentun value should be self.norm(x + self.dropout(sublayer(x))) rather than x + self.dropout(sublayer(self.norm(x))).

Look forward to your reply.

The text was updated successfully, but these errors were encountered:

StellaAthena · 2022-09-24T21:42:20Z

Where do we write x + self.dropout(sublayer(self.norm(x)))? That's not what the passage you quote says.

Bruising6802 · 2022-09-24T22:10:07Z

In the_annotated_transformer.py on line 357. In the function documentation it even says that the norm was moved.

lvXiangwei · 2022-09-26T06:11:49Z

I have the same question as you. The explanation can be found in #92 .

Bruising6802 · 2022-09-26T06:30:53Z

Maybe it's best to mention this issue in the notebook, because it causes confusion for many.

watersounds · 2022-09-26T08:31:21Z

Where do we write x + self.dropout(sublayer(self.norm(x)))? That's not what the passage you quote says.

annotated-transformer/the_annotated_transformer.py

Line 357 in debc9fd

return x + self.dropout(sublayer(self.norm(x)))

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Some doubts about SublayerConnection #100

Some doubts about SublayerConnection #100

watersounds commented Sep 19, 2022 •

edited

StellaAthena commented Sep 24, 2022

Bruising6802 commented Sep 24, 2022 •

edited

lvXiangwei commented Sep 26, 2022

Bruising6802 commented Sep 26, 2022

watersounds commented Sep 26, 2022 •

edited

Some doubts about SublayerConnection #100

Some doubts about SublayerConnection #100

Comments

watersounds commented Sep 19, 2022 • edited

StellaAthena commented Sep 24, 2022

Bruising6802 commented Sep 24, 2022 • edited

lvXiangwei commented Sep 26, 2022

Bruising6802 commented Sep 26, 2022

watersounds commented Sep 26, 2022 • edited

watersounds commented Sep 19, 2022 •

edited

Bruising6802 commented Sep 24, 2022 •

edited

watersounds commented Sep 26, 2022 •

edited