Skip to content

Latest commit

 

History

History
7 lines (7 loc) · 347 Bytes

File metadata and controls

7 lines (7 loc) · 347 Bytes

Reinforcement-Learning-An-Introduction-programs

这里是对 Reinforcement-Learning-An-Introduction 中 example 的实现。

chapter 6

  1. random walker rmse
  2. windy grid world windy grid world