News

We consider a discrete time Markov Decision Process with infinite horizon. The criterion to be maximized is the sum of a number of standard discounted rewards, each with a different discount factor.
3. A Weighted Decision Matrix The decision matrix was first developed in the 1960s by Stuart Pugh, a British engineer and product designer.