Back close

A novel Q-learning algorithm with function approximation for constrained Markov decision processes

Publication Type : Conference Paper

Publisher : Allerton

Source : Communication, Control, and Computing (Allerton), 2012 50th Annual Allerton Conference on (2012)

Keywords : Approximation algorithms, average cost control, constrained Markov decision processes, Constrained MDP, constrained routing problem, Decision theory, function approximation, inequality constraints, Lagrange multiplier method, learning (artificial intelligence), Markov processes, Minimization, multi-stage stochastic shortest path problem, multistage queueing network, multitimescale Q-learning algorithm, network theory (graphs), Parameter estimation, parameter update, policy parameter, Q-learning with linear function approximation, Q-value parameter, Queueing theory, Reinforcement learning, routing, Vectors, zinc

Campus : Bengaluru

School : School of Engineering

Department : Computer Science

Year : 2012

Abstract :

Cite this Research Publication : L. K. and Bhatnagar, S., “A novel Q-learning algorithm with function approximation for constrained Markov decision processes”, in Communication, Control, and Computing (Allerton), 2012 50th Annual Allerton Conference on, 2012.

Admissions Apply Now