Master classic RL, deep RL, distributional RL, inverse RL, and more using OpenAI Gym and TensorFlow with extensive Math
-
Updated
Apr 1, 2021 - Jupyter Notebook
Master classic RL, deep RL, distributional RL, inverse RL, and more using OpenAI Gym and TensorFlow with extensive Math
PyDiffGame is a Python implementation of a Nash Equilibrium solution to Differential Games, based on a reduction of Game Hamilton-Bellman-Jacobi (GHJB) equations to Game Algebraic and Differential Riccati equations, associated with Multi-Objective Dynamical Control Systems
Find the shortest route using A* algorithm and graphs (Route Planner application)
Algorithms for Policy Evaluation, Estimation of Action Values, Policy Improvement, Policy Iteration, Truncated Policy Evaluation, Truncated Policy Iteration, Value Iteration . From Udacity's Deep Reinforcement Learning Nanodegree program.
Infinite horizon policy optimization for drone navigation. Graded project for the ETH course "Dynamic Programming and Optimal Control".
Solving high dimensional HJB equation using tensor decomposition
Reinforcement Learning applied to Autonomous Networking to issue scheduling and decision to drones.
Foundations Of Intelligent Learning Agents (FILA) Assignments
Q-Learning from scratch in Python
A GPU-accelerated toolbox for hyperbolic PDEs in a weaker (viscosity) sense. It leverages the integral to the solution of the conservation of momentum problem (being equivalent to the derivative of Hamilton-Jacobi equations) in one spatial dimension. We resolve such hyperbolic differential equations using wave-front propagating schemes on a spat…
A visualization tool for policy iteration and value iteration
Policy Iteration for Continuous Dynamics
Reinforcement learning
calculate the optimum route in a warehouse using the Q-Learning algorithm (Bellman equation)
CSCI-561 AI Assignments.
In this project we use Reinforcement Learning to extract features from an image.
Provides methods to efficiently set up and solve dynamic programming problems of one choice variable using Value Function Iteration and the Endogenous Grid Method.
Dynamic Optimization project working on an economic model
This repository contains a variety of projects related to reinforcement learning, showcasing different approaches to implementing it in various scenarios.
Jupyter notebooks implementing Reinforcement Learning algorithms in Numpy and Tensorflow
Add a description, image, and links to the bellman-equation topic page so that developers can more easily learn about it.
To associate your repository with the bellman-equation topic, visit your repo's landing page and select "manage topics."