Foolproof convergence in multichain policy iteration