Semi-Markovian decision models with vector-valued reward
Abstract:
The paper deals with vector-valued semi-Markovian decision process (VSMDP). Thereby we derive a suitable definition of the optimal average reward of a VSMDP. We construct an algorithm for improving policies in the vector-valued case.