Q-Learning|A concept on AnyLearn

English

a guide for that

Concept

Q-Learning 0

Q-Learning is a model-free reinforcement learning algorithm used to find the optimal action-selection policy for a given finite Markov decision process. It learns the quality of actions, represented as a Q-value, which indicates the expected utility of taking a given action in a specific state and following the optimal policy thereafter.

Concepts

Reinforcement Learning

Markov Decision Process

Q-Value

Optimal Policy

Exploration Vs Exploitation

Temporal Difference Learning

Relevant Degrees

Computer Science and Data Processing 78%

Probability and Statistics 22%