How to get the high attention regions of a given sequence. #104

ytye2010 · 2023-05-18T03:31:16Z

I want to extract those high attention regions for some given sequences. I have try #11 to get the last embedding vector for each token.
After take their means, I find some values are negative. Is it right? And how to compare these values? Larger those absolute values, higher attention?
The following is my try for your example in #11 using 6-new-12w-0 downloaded from this github.
sequence = "AATCTA ATCTAG TCTAGC CTAGCA"
output[0][0].mean(1) = [0.0017, -0.0006, -0.0032, -0.0047, 0.0022, -0.0069]
As my understand, the first 0.0017 and last -0.0069 stand for [CLS] and [SEP]. Right?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to get the high attention regions of a given sequence. #104

How to get the high attention regions of a given sequence. #104

ytye2010 commented May 18, 2023

How to get the high attention regions of a given sequence. #104

How to get the high attention regions of a given sequence. #104

Comments

ytye2010 commented May 18, 2023