Scroll Down to get the courseGet The Course

Modern Reinforcement-learning using Deep Learning | Free Udemy Course

Development Data Science Reinforcement Learning

Free Free100% off

New Free Udemy Course

Model types, Algorithms and approaches, Function approximation, Deep reinforcement-learning, Deep Multi-agent Reinforcem - Free Course | Free Udemy Course

1.58

(8 ratings)

2312 students

Created by:

Nitsan Soffair

Course Language EnglishCourse Caption English [Auto]Course Length 41:32 to be exact 2492 seconds!Number of Lectures 23

This course includes:

42 mins hours of on-demand video
19 additional resources

Hello I am Nitsan Soffair, A Deep RL researcher at BGU.In my Deep reinforcement-learning course you will learn the newest state-of-the-art Deep reinforcement-learning knowledge.You will do the followingGet state-of-the-art knowledge regardingModel typesAlgorithms and approachesFunction approximationDeep reinforcement-learningDeep Multi-agent Reinforcement-learningValidate your knowledge by answering short and very short quizzes of each lecture.Be able to complete the course by ~2 hours.SyllabusModel typesMarkov decision process (MDP)A discrete-time stochastic control process.Partially observable Markov decision process (POMDP)A generalization of MDP in which an agent cannot observe the state.Decentralized Partially observable Markov decision process (Dec-POMDP)A generalization of POMDP to consider multiple decentralized agents.Algorithms and approachesBellman equationsA condition for optimality of optimization of dynamic programming.Model-freeA model-free algorithm is an algorithm which does not use the policy of the MDP.Off-policyAn off-policy algorithm is an algorithm that use policy 1 for learning and policy 2 for acting in the environment.Exploration-exploitationA trade-off in Reinforcement-learning between exploring new policies to use existing policies.Value-iterationAn iterative algorithm applying bellman optimality backup.SARSAAn algorithm for learning a Markov decision process policyQ-learningA model-free reinforcement learning algorithm to learn the value of an action in a particular state.Function approximationFunction approximatorsThe problem asks us to select a function among a well-defined class that closely matches ("approximates") a target function in a task-specific way.Policy-gradientValue-based, Policy-based, Actor-critic, policy-gradient, and softmax policyREINFORCEA policy-gradient algorithm.Deep reinforcement-learningDeep Q-Network (DQN)A deep reinforcement-learning algorithm using experience reply and fixed Q-targets.Deep Recurrent Q-Learning (DRQN)Deep reinforcement-learning algorithm for POMDP extends DQN and uses LSTM.Optimistic Exploration with Pessimistic Initialization (OPIQ)A deep reinforcement-learning for MDP based on DQN.Value Decomposition Networks (VDN)A multi-agent deep reinforcement-learning algorithm for Dec-POMDP.QMIXA multi-agent deep reinforcement-learning algorithm for Dec-POMDP.QTRANA multi-agent deep reinforcement-learning algorithm for Dec-POMDP.Weighted QMIXA deep multi-agent reinforcement-learning for Dec-POMDP.ResourcesWikipediaDavid Silver's Reinforcement-learning courseWho this course is for:Anyone who interests in Deep reinforcement-learning

Course Content:

Sections are minimized for better readability, click the section title to view the course content

3 Lectures | 03:08

7 Lectures | 05:13

3 Lectures | 03:08

3 Lectures | 04:50

4 Lectures | 05:24

3 Lectures | 19:49

1.58

(8 course ratings)

7/8

0/8

1/8

0/8

JOIN OUR WHATSAPP GROUP TO GET LATEST COUPON AS SOON AS UPDATED

JOIN WHATSAPP

JOIN OUR TELEGRAM CHANNEL TO GET LATEST COUPON

JOIN TELEGRAM

JOIN OUR FACEBOOK GROUP TO GET LATEST COUPON

JOIN FACEBOOK

Get The Course

If you like to get inspired by great web projects, you should check out Made with Javascript. If you have a project that you wish to share with the world, feel free to submit your project on Made with Javascript Club website.