Our code is based on open-r1, with our customized Trainer for mixed SFT+GRPO training. Some other updates focus on the white-box RL (reward function design) and post-completion training (replacement ...
Abstract: Integrated sensing and communications (ISAC) is envisioned to be a key to advanced applications in future wireless networks. In this paper, we study the coded modulation designs for ISAC ...