GRASP: Gradient Realignment via Active Shared Perception for Multi-Agent Collaborative Optimization
arXiv:2604.00717v1 Announce Type: cross
Abstract: Non-stationarity arises from concurrent policy updates and leads to persistent environmental fluctuations. Existing approaches like Centralized Training with Decentralized Execution (CTDE) and sequenti…