Reinforcement Learning - Amrita Vishwa Vidyapeetham

Course Detail

Course Name	Reinforcement Learning
Course Code	23DLS640
Program
Credits	3

Syllabus

Course outcomes

CO1: Understand the basics of reinforcement learning. Its elements and limitations.
CO2: Understand the finite Markov decision process.
CO3: Understand the temporal difference learning and its advantages.
CO4: Understand the Sarsa maximization bias and double learning.

Introduction: Reinforcement Learning, Elements of Reinforcement Learning, Limitations and Scope, An Extended Example- Tic-Tac-Toe. Multi-armed Bandits: A k-armed Bandit Problem, Action-value Methods, The 10-armed Testbed, Incremental Implementation, Tracking a Nonstationary Problem, Optimistic Initial Values, Upper-Confidence-Bound Action Selection, Gradient Bandit Algorithms.
Finite Markov Decision Processes: The Agent–Environment Interface, Goals and Rewards, Returns and Episodes , Unified Notation for Episodic and Continuing Tasks, Policies and Value Functions, Optimal Policies and Optimal Value Functions, Optimality and Approximation.
Review of Markov process and Dynamic Programming.
Temporal-Difference Learning: TD Prediction, Advantages of TD Prediction Methods, Optimality of TD, Sarsa: On-policy TD Control, Q-learning: Policy TD Control. Expected Sarsa. Maximization Bias and Double Learning.

Text/ References Book

1. Richard S. Sutton and Andrew G. Barto, Reinforcement Learning:An Introduction, MIT Press, 2018.
2. Sudharsan Ravichandiran, Hand-on Reinforcement Learning with Python, Packt Publications, 2018.
3. Sayon Dutta, Reinforcement Learning with Tensor Flow: A beginner’s guide, Packt Publications, 2018.

DISCLAIMER: The appearance of external links on this web site does not constitute endorsement by the School of Biotechnology/Amrita Vishwa Vidyapeetham or the information, products or services contained therein. For other than authorized activities, the Amrita Vishwa Vidyapeetham does not exercise any editorial control over the information you may find at these locations. These links are provided consistent with the stated purpose of this web site.

About Amrita Vishwa Vidyapeetham

Rankings

Accreditation

Governance

Chancellor

Leadership

Press Media

Newsletters

Amritapuri
Campus

Amaravati
Campus

Bengaluru
Campus

Chennai
Campus

Coimbatore
Campus

Faridabad
Campus

Kochi
Campus

Mysuru
Campus

Nagercoil
Campus

Haridwar
(Proposed Campus)

Research

Centers

Patents

Course

Course Detail

Syllabus

Course outcomes

Text/ References Book

Interests

Programs

Research

About Amrita

Resources

Locations

Reports

About Amrita Vishwa Vidyapeetham

Amritapuri Campus

Amaravati Campus

Bengaluru Campus

Chennai Campus

Coimbatore Campus

Faridabad Campus

Kochi Campus

Mysuru Campus

Nagercoil Campus

Haridwar (Proposed Campus)

Research

Programs

From the news

Others

Course

Course Detail

Syllabus

Course outcomes

Text/ References Book

Amritapuri
Campus

Amaravati
Campus

Bengaluru
Campus

Chennai
Campus

Coimbatore
Campus

Faridabad
Campus

Kochi
Campus

Mysuru
Campus

Nagercoil
Campus

Haridwar
(Proposed Campus)