Q-learning in Javascript

Use start, step and stop buttons to control the learning process.

When it is stoped, click to add -1, and double click to add 1.


alpha (learning rate):
epsilon (exploration rate):
discount rate: