Essay Gems

Performance Comparison

As shown in Table 1, the experimental results demonstrate that O-Researcher-RL establishes a new state-of-the-art among open-weights deep research models, significantly outperforming concurrent open-source works such as Tongyi-Deep Research [63]1 and MiroThinker [64]. Furthermore, our model transcends the capabilities of leading search-enhanced commercial LLMs, surpassing both the GPT-5 baseline [58] and OpenAI O3 [58], while also outperforming specialized proprietary agents like Perplexity Deep Research [55]. This indicates that our framework successfully bridges the performance gap between open-source models and top-tier closed-source systems.
Furthermore, in terms of relevance (KPR), O-Researcher-72B substantially outperforms（大大优于） standard search-enhanced LLMs and effectively rivals（有效竞争，没打过但是也不相上下了） specialized agents, demonstrating a robust ability to synthesize comprehensive reports that satisfy complex user information needs

实验

KD字典不一致

Generative AI

生成式AI时代下的机器学习

Essay Gems

Performance Comparison

Essay Gems ​

Performance Comparison ​

Essay Gems

Performance Comparison