Scalable Trust-Region Method for Deep Reinforcement Learning Using Kronecker-Factored Approximation

Sign in to queue

The Discussion

Add Your 2 Cents