sarsa 中文意思是什麼

sarsa 解釋
撒爾沙
  1. Reinforcement learning algorithms that use cerebellar model articulation controller ( cmac ) are studied to estimate the optimal value function of markov decision processes ( mdps ) with continuous states and discrete actions. the state discretization for mdps using sarsa - learning algorithms based on cmac networks and direct gradient rules is analyzed. two new coding methods for cmac neural networks are proposed so that the learning efficiency of cmac - based direct gradient learning algorithms can be improved

    在求解離散行為空間markov決策過程( mdp )最優策略的增強學習演算法研究方面,研究了小腦模型關節控制器( cmac )在mdp行為值函數逼近中的應用,分析了基於cmac的直接梯度演算法對mdp狀態空間離散化的特點,研究了兩種改進的cmac編碼結構,即:非鄰接重疊編碼和變尺度編碼,以提高直接梯度學習演算法的收斂速度和泛化性能。
分享友人