News
Q-Learning, Sarsa: Model-free methods suitable for high-dimensional state spaces. Deep Q-Network (DQN): Combines Q-Learning with Deep Learning. Policy Gradients: Directly optimize the policy function.
About the speaker. Dr. Louis Ricardez-Sandoval is an Associate Professor in the Department of Chemical Engineering at the University of Waterloo (UW).Dr. Ricardez-Sandoval holds a Canada Research ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results