We Provide AI To Companies
Report from the OpenAI hackathon
Variance reduction for policy gradient with action-dependent factorized baselines