Signature Approach for Contextual Bandits with Nonlinear and Path-dependent Rewards
arXiv:2605.10313v1 Announce Type: new
Abstract: We study contextual bandits with nonlinear and path-dependent rewards through a novel signature-transform-based approach. Leveraging the universal nonlinearity property of signatures, we approximate cont…