$Eπ[Rt+1+γGt+1|St=s]=∑aπ(a|s)Eπ[Rt+1+γGt+1|St=s,At=a] =∑aπ(a|s)∑s′∑rp(s′,r|s,a)Eπ[Rt+1+γGt+1|St=s,At=a,St+1=s′,Rt+1=r] =∑aπ(a|s)∑s′∑rp(s′,r|s,a)(r+γEπ[Gt+1|St+1=s′])$