In questa repository una collezione di tutorial sulle basi del Reinforcement Learning, sviluppati in Python, interamente in italiano.
-
Updated
Jun 3, 2024 - Jupyter Notebook
In questa repository una collezione di tutorial sulle basi del Reinforcement Learning, sviluppati in Python, interamente in italiano.
testing MLP, DQN, PPO, SAC, policy-gradient by snake
强化学习中文教程(蘑菇书🍄),在线阅读地址:https://datawhalechina.github.io/easy-rl/
📚 List of Top-tier Conference Papers on Reinforcement Learning (RL),including: NeurIPS, ICML, AAAI, IJCAI, AAMAS, ICLR, ICRA, etc.
Several RL-agents are tested on classical environments and benchmarked against their stable-baselines implementation.
A PyTorch-based framework to conduct deep reinforcement learning research in multiple autonomous vehicle simulators
An elegant PyTorch deep reinforcement learning library.
Reinforcement Learning Short Course
This project uses LLMs to generate music from text by understanding prompts, creating lyrics, determining genre, and composing melodies. It harnesses LLM capabilities to create songs based on text inputs through a multi-step approach.
This project provides a comprehensive understanding of reinforcement learning, focusing on Actor Critic Algorithms. It involves exploring the OpenAI Gym library, implementing the A2C algorithm from DeepMind's seminal paper, and enhancing the A2C algorithm for improved performance and stability.
Clean baseline implementation of PPO using an episodic TransformerXL memory
Book repository for AlphaGo Simplified (CRC Press 2024). Implement ideas behind Deep Blue (rule-based AI) and AlphaGo (rule-based AI + Deep Learning) in three simple games: Last Coin Standing, Tic Tac Toe, and Connect Four.
Multi-hop knowledge graph reasoning learned via policy gradient with reward shaping and action dropout
DEEp Reinforcement learning framework
Baseline implementation of recurrent PPO using truncated BPTT
Simple maze solver by reinforcement learning
HandyRL is a handy and simple framework based on Python and PyTorch for distributed reinforcement learning that is applicable to your own environments.
Jaxplorer is a Jax reinforcement learning (RL) framework for exploring new ideas.
Yet another 2048 in reinforcement learning
Simple Reinforcement learning tutorials, 莫烦Python 中文AI教学
Add a description, image, and links to the policy-gradient topic page so that developers can more easily learn about it.
To associate your repository with the policy-gradient topic, visit your repo's landing page and select "manage topics."