Optimal Management of the Peak Power Penalty for Smart Grids Using MPC-based Reinforcement Learning | IEEE Conference Publication | IEEE Xplore