1 article
New methods address bootstrapping error, inverse reward inference, and offline learning challenges with distributional and theoretical approaches.