Lirio Research: A Novel Policy Comparison Metric for Reinforcement Learning
Lirio’s Behavioral Reinforcement Learning Lab (BReLL) recently published a paper describing a new approach, the Limited Data Estimator, for comparing reinforcement learning policies using limited historical data.