We propose a method to capture the handling abilities of fast jet pilots...
Recent efforts to learn reward functions from human feedback have tended...
We generalise the problem of reward modelling (RM) for reinforcement lea...
We introduce a data-driven, model-agnostic technique for generating a
hu...
The potential of reinforcement learning (RL) to deliver aligned and
perf...
In explainable artificial intelligence, there is increasing interest in
...
The rule extraction literature contains the notion of a fidelity-accurac...
As we deploy autonomous agents in safety-critical domains, it becomes
im...