1 min readJun 29, 2017
In the first case you calculate the values š¯”¼[approval[T](a)] for each action a, and then choose the argmax.
In the second case you calculate the causal counterfactuals, š¯”¼[approval[T](a)|do(a)], and then choose the argmax.