Chen Qing Chen Jia Yifan

Pioneering Perception Policy with Reinforcement Learning

We present Perception-R1, a scalable RL framework using Group Relative Policy Optimization (GRPO) during MLLM post-training. Key innovations: 🎯 Perceptual Perplexity Analysis: We introduce a novel ...

GitHub

OpenMOSS/Thus-Spake-Long-Context-LLM

This repository provides a collection of papers and resources focused on long-context LLMs, including architecture, infrastructure, training, and evaluation. For a clear taxonomy and more insights ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Pioneering Perception Policy with Reinforcement Learning

OpenMOSS/Thus-Spake-Long-Context-LLM

Trending now