why does the variable 'vocab' there have the property - 'vectors'? #100

HandanDiana · 2022-02-15T08:38:29Z

Hi, I have some confusions about the variable **'vocab'** there. I'm the beginner in this field, so maybe these are some silly questions. But hopefully, you guys can help to give me some suggestions. Thanks!

Firstly, whether the variable 'vocab' was generated by CharVocab.from_data function as mentioned in README.md. Additionally, if so, I couldn't find a property called 'vectors'.
What the exactly type of the variable 'vocab' there? Whether I missed other key points on 'vocab'?

davidegraff · 2022-09-21T21:25:14Z

vectors is an attribute of a OneHotVocabulary, a subclass of CharVocab. It's uncessary though- the purpose of the class is really to initialize the embedding layer (nn.Embedding) with the identity matrix (L18). Two notes:

the weights of the embedding layer can be learned and there's nothing special about the identity matrix
the embedding size does not need to be the same as the vocab size. many VAE implementations project the vocabulary into a higher-dimensional embedding space (i.e., d_emb > n_vocab) with this typically improving performance

HandanDiana changed the title ~~why the variable 'vocab' there has the property - 'vectors'?~~ why does the variable 'vocab' there have the property - 'vectors'? Feb 15, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

why does the variable 'vocab' there have the property - 'vectors'? #100

why does the variable 'vocab' there have the property - 'vectors'? #100

HandanDiana commented Feb 15, 2022

davidegraff commented Sep 21, 2022

why does the variable 'vocab' there have the property - 'vectors'? #100

why does the variable 'vocab' there have the property - 'vectors'? #100

Comments

HandanDiana commented Feb 15, 2022

davidegraff commented Sep 21, 2022