Finding optimal (s, S) policies is about as simple as evaluating a single policy