Reinforcement learning versus heuristics for order acceptance on a single resource期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

Reinforcement learning versus heuristics for order acceptance on a single resource

Authors:	M. Mainegra Hing A. van Harten P. C. Schuur

Affiliation:	(1) School of Business, Public Administration and Technology, University of Twente, Enschede, The Netherlands;(2) Present address: 36 Isabelle, Gatineau, Québec, J8Y 5G5, Canada

Abstract:	Order Acceptance (OA) is one of the main functions in business control. Accepting an order when capacity is available could disable the system to accept more profitable orders in the future with opportunity losses as a consequence. Uncertain information is also an important issue here. We use Markov decision models and learning methods from Artificial Intelligence to find decision policies under uncertainty. Reinforcement Learning (RL) is quite a new approach in OA. It is shown here that RL works well compared with heuristics. It is demonstrated that employing an RL trained agent is a robust, flexible approach that in addition can be used to support the detection of good heuristics.

Keywords:	Order acceptance Opportunity costs Decisions under uncertainty Markov decision process Reinforcement learning Artificial neural networks
本文献已被 SpringerLink 等数据库收录！