Paul Christiano
1 min readJun 29, 2017

--

In the first case you calculate the values š¯”¼[approval[T](a)] for each action a, and then choose the argmax.

In the second case you calculate the causal counterfactuals, š¯”¼[approval[T](a)|do(a)], and then choose the argmax.

--

--

Responses (1)