Abstract
This paper introduces a method for learning finite-horizon optimal control for systems with control-affine dynamics without using the model of the system dynamics. We approximate the time- and state-dependent optimal control policy using model-free relearning of simple linear control policies. Assuming persistent excitation, we prove the convergence and optimality of the proposed learning method and demonstrate its use through a numerical example.
| Original language | English |
|---|---|
| Article number | 106161 |
| Journal | Systems and Control Letters |
| Volume | 203 |
| DOIs | |
| Publication status | Published - Sept 2025 |
Keywords
- Control-affine systems
- Learning
- Model-free optimal control