Abstract: Temporal difference (TD) learning is a fundamental technique in reinforcement learning that updates value function estimates for states or state-action pairs using a TD target. This target ...
Abstract: This paper investigates the robust control problem of Markov jump linear systems (MJLSs) with unknown transition probabilities (TPs). While existing temporal difference learning (TDL) ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results