Skip to content
View OliverLeeXZ's full-sized avatar

Block or report OliverLeeXZ

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
OliverLeeXZ/README.md

Hi, this is Xiaozhe. (Homepage)

I am currently a PhD student at the School of Computer Science and Technology, Tongji University. I am interning with the Post-Training Group of the OpenLM team at Shanghai AI Lab (January 2025 – Present), focusing on (M)LLMs and agentic post-training. If you are interested in collaboration, please feel free to reach out to me.

Oliver's GitHub stats

Pinned Loading

  1. open-compass/VLMEvalKit open-compass/VLMEvalKit Public

    Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

    Python 4.2k 722

  2. OPT-BENCH OPT-BENCH Public

    [ACL 2026] OPT-BENCH: Evaluating the Iterative Self-Optimization of LLM Agents in Large-Scale Search Spaces

    Python 125 8

  3. NP-Engine NP-Engine Public

    [ACL 2026] Official implement on 'Forge: Quality-Aware Reinforcement Learning for NP-Hard Optimization in LLMs'

    Python 55 5

  4. SERL SERL Public

    Official implement on 'What and When to Distill: Selective Hindsight Distillation for Multi-Turn Agents'

    Python 66

  5. DMPO DMPO Public

    [ICML 2026] Official implement on 'Beyond Mode Collapse: Distribution Matching for Diverse Reasoning'

    Python 45 1

  6. InternLM/InternBootcamp InternLM/InternBootcamp Public

    Official implement on InternBootCamp

    Python 349 27