$Eπ[Rt+1+γGt+1|St=s]=aπ(a|s)Eπ[Rt+1+γGt+1|St=s,At=a] =aπ(a|s)srp(s,r|s,a)Eπ[Rt+1+γGt+1|St=s,At=a,St+1=s,Rt+1=r] =aπ(a|s)srp(s,r|s,a)(r+γEπ[Gt+1|St+1=s])$