Skip to content

Pretraining #235

Answered by Linux-cpp-lisp
IHumonen asked this question in Q&A
Aug 1, 2022 · 3 comments · 22 replies
Discussion options

You must be logged in to vote

Hi @IHumonen ,

This is definitely possible, and is one of the main uses of the initialize_from_state model builder (see docstring

def initialize_from_state(config: Config, model: GraphModuleMixin, initialize: bool):
).

Basically:

  1. Copy your config file and give it a new run name for the new training (the finetuning)
  2. Change the dataset and any other trainer settings that you want to update for your finetuning/further training
  3. Add the initialize_from_state model builder to the end of your model_builders list
  4. Add the initial_model_state key to your config and set it to the path to a state dict of saved weights from which…

Replies: 3 comments 22 replies

Comment options

You must be logged in to vote
16 replies
@Linux-cpp-lisp
Comment options

@wwwjjj123yyyy
Comment options

@Linux-cpp-lisp
Comment options

@wwwjjj123yyyy
Comment options

@Mathieu-Istas
Comment options

Answer selected by IHumonen
Comment options

You must be logged in to vote
4 replies
@Linux-cpp-lisp
Comment options

@IZugec
Comment options

@Linux-cpp-lisp
Comment options

@IZugec
Comment options

Comment options

You must be logged in to vote
2 replies
@Linux-cpp-lisp
Comment options

@ken1944
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
8 participants