I am currently working on a project concerning optimal replacement strategies for sows with Erik Jørgensen. The problem is modelled using a multi-level hierarchic Markov decision process. The original model was developed by Anders Ringgaard Kristensen (KVL) whom I have had fruitful discussions with.
I have realized that directed hypergraphs actually can be used to model finite-horizon Markov decision processes. They provide us with an efficient way of storing the process and by finding the shortest hyperpath we actually can fnd the optimal policy. Moreover, it should be possible the find the K best
policies.