The extension of reinforcement learning to MDPs with large state,action space and high complexity has inevitably encountered the problem of the curse of dimensionality,which results in slow convergence and long training time.

英美

释义

- 传统的强化学习算法应用到大状态、动作空间和任务复杂的马尔可夫决策过程问题时；存在收敛速度慢；训练时间长等问题.

以上内容独家创作，受著作权保护，侵权必究

海词词典，十七年品牌

把海词放在桌面上，查词最方便

触屏版| 电脑版

©2003 - 2024 海词词典(Dict.cn)

立即下载

立即下载