马尔可夫决策过程 meaning in English
markovian decision process
Examples
- Markov decision process , in short mdp , is also called sequential stochastic optimization stochastic optimum control . the controlled markov process or stochastic dynamic programming is the theory on stochastic sequential decision
马尔可夫决策过程( markovdecisionprocesses ,简称mdp ,又称序贯随机最优化、随机最优控制、受控的马尔可夫过程或随机动态规划)是研究随机序贯决策的问题的理论。 - On the optimization of cbm policy for repeated single mission , the markov decision process ( mdp ) is applied , and also a numerical method is proposed in which the maintenance threshold of state is determined according to the relation between the average availability or the average cost and system states when the condition data is not periodical
对重复单一任务的视情维修策略,应用了马尔可夫决策过程模型进行求解。针对试验数据是非定期检测的情况,提出了直接由数据得到优化目标与状态、工龄的关系曲线,从而得到维修状态阈值的数值方法。