next up previous
Next: Example Up: l9 Previous: Active Learning in Unknown

Learning an Action-Value Function