Description
A
Questions:
1. [20 marks] What are the optimal Q values for the T-maze below, assuming that we value
diminishing returns with y=0.5?
2. [80 marks] Implement a neural network version of an RL to solve the linear maze example and
submit your program as jupyter notebook.