O'Reilly logo
  • Joji Monma thinks this is interesting:

Q(t+1, a)

From

Cover of Deep Reinforcement Learning Hands-On

Note

Q(s_{t+1}, a_{t+1}) to be correct