Abstract: In this paper, we present an off-policy reinforcement learning (RL) method used to tune the optimal weights of a nonlinear model predictive control (NMPC) scheme. The objective is to find ...
Abstract: Compared with Zhang dynamics (ZD) method, gradient dynamics (GD) method, which is intrinsically feasible and efficient to solve time-invariant problems, could have a simpler hardware ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results