强化学习算法 meaning in Chinese
reinforcement learning algorithm
Examples
- Q - learning method is an online reinforcement learning recently
Q - learning算法是近期所提出的在线强化学习算法。 - A reinforcement learning algorithm based on process reward and prioritized sweeping is presented as interference solving strategy
本文提出了基于过程奖赏和优先扫除的强化学习算法作为多机器人系统的冲突消解策略。 - ( 4 ) a new cooperation model called macm is presentd and based on this model , an improved distributed reinforcement learning algorithm is also proposed
( 4 )提出一种新的多agent协作模型macm及一种改进的分布式强化学习算法。 - The macrl - cc analyses the speciality of the system ’ s goal , then decomposes it , and , by using the commitment - and - conventions - based method , the system achieves cooperative problem solving betweem agents
本文的主要成果及创新是,提出了两种多agent协同强化学习算法,并进行了实验验证。 - In this paper , we elaborate some domains an information agent involves , introduce reinforce learning into dynamic scheduling of searching engines , and realize intelligent scheduling of searching engines
本文对信息agent所涉及的关键技术进行了比较深刻的研究,把强化学习算法引入到搜索引擎的动态调度中来,实现了搜索引擎的智能化调度。