Abstract: Temporal difference (TD) learning is a fundamental technique in reinforcement learning that updates value function estimates for states or state-action pairs using a TD target. This target ...
Abstract: This paper investigates the robust control problem of Markov jump linear systems (MJLSs) with unknown transition probabilities (TPs). While existing temporal difference learning (TDL) ...