The simpleValueFunctionVis and manualValueFunctionVis methods require a policy to be passed in as a parameter. From what I understand, Sarsa(lambda) only updates the Q-values given S1, A1, R, S2, and A2.
Is there any way to set up a result visualizer for the Sarsa(lambda) algorithm shown in the "Basic Planning and Learning" tutorial?