This is one of the reasons that relying on a crowd-sourced or previously published Avios reward chart might not give you the full picture. Only by using the British Airways calculator, or attempting ...
In reinforcement learning (RL), a reward function that aligns exactly with a task's true performance metric is often sparse. For example, a true task metric might encode a reward of 1 upon success and ...