Policy gradients with memory-augmented critic: Stabilizing off-policy policy gradients via differentiable memory

Takuma Seno, Michita Imai

Research output: Contribution to journalArticlepeer-review

Fingerprint

Dive into the research topics of 'Policy gradients with memory-augmented critic: Stabilizing off-policy policy gradients via differentiable memory'. Together they form a unique fingerprint.

Engineering & Materials Science