Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Integration of Continual Learning tasks and algorithm (WIP) #45

Draft
wants to merge 171 commits into
base: master
Choose a base branch
from

Conversation

kalifou
Copy link
Collaborator

@kalifou kalifou commented May 21, 2019

This PR is a draft - the code is still in development:

  • add of tasks for CL: circular and square shaped move around a target, reaching a target (fex updates)
  • Updated data-generation: on-policy, grid walker, generative replay
  • CL algo : policy distillation
  • evaluation of catastrophic forgetting

TODO

  • doc & tests for CF
  • doc & tests for Distillation
  • doc & tests for all envs
  • Refactor code: config files, util functions ...
  • fix for multiprocessing of on-policy data generation
    i.e:
    Loading a model without an environment, this model cannot be trained until it has a valid environment. THCudaCheck FAIL file=/pytorch/aten/src/THC/THCGeneral.cpp line=55 error=3 : initialization error Process Process-2: Traceback (most recent call last): File "/home/rene/anaconda3/envs/py35/lib/python3.5/multiprocessing/process.py", line 252, in _bootstrap self.run() File "/home/rene/anaconda3/envs/py35/lib/python3.5/multiprocessing/process.py", line 93, in run self._target(*self._args, **self._kwargs) File "/home/rene/Documents/duplicate/robotics-rl-srl/rl_baselines/utils.py", line 186, in _run env_object=None) File "/home/rene/Documents/duplicate/robotics-rl-srl/state_representation/models.py", line 97, in loadSRLModel split_dimensions=split_dimensions, inverse_model_type=inverse_model_type) File "/home/rene/Documents/duplicate/robotics-rl-srl/state_representation/models.py", line 173, in __init__ self.model = self.model.to(self.device) File "/home/rene/anaconda3/envs/py35/lib/python3.5/site-packages/torch/nn/modules/module.py", line 381, in to return self._apply(convert) File "/home/rene/anaconda3/envs/py35/lib/python3.5/site-packages/torch/nn/modules/module.py", line 187, in _apply module._apply(fn) File "/home/rene/anaconda3/envs/py35/lib/python3.5/site-packages/torch/nn/modules/module.py", line 193, in _apply param.data = fn(param.data) File "/home/rene/anaconda3/envs/py35/lib/python3.5/site-packages/torch/nn/modules/module.py", line 379, in convert return t.to(device, dtype if t.is_floating_point() else None, non_blocking) File "/home/rene/anaconda3/envs/py35/lib/python3.5/site-packages/torch/cuda/__init__.py", line 162, in _lazy_init torch._C._cuda_init() RuntimeError: cuda runtime error (3) : initialization error at /pytorch/aten/src/THC/THCGeneral.cpp:55

Caselles and others added 30 commits March 14, 2019 11:56
kalifou and others added 30 commits June 13, 2019 11:52
…fusioner_issue

reward can be float for circular task and escaping task
…bot_data_fusioner_issue

bug fix for dataset_merger
fail to resample, delte
adding escaping task and modify the `dataset merger` to give the option for preserving the dataset after the merge
delete jupyter notebook created for testing purpose
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

4 participants