Welcome to XTuner V1 English Documentation#

xtuner

LLM One-Stop Toolbox

Star Watch Fork

XTuner V1 is a new generation large model training engine specifically designed for ultra-large-scale MoE models. Compared with traditional 3D parallel training architectures, XTuner V1 has been deeply optimized for the current mainstream MoE training scenarios in academia.

πŸš€ Speed Benchmark#

_images/benchmark.png

Core Features#

πŸ“Š Dropless Training

  • Flexible Scaling, No Complex Configuration: 200B scale MoE without expert parallelism; 600B MoE only requires intra-node expert parallelism

  • Optimized Parallel Strategy: Compared with traditional 3D parallel solutions, smaller expert parallel dimensions enable more efficient Dropless training

πŸ“ Long Sequence Support

  • Memory Efficient Design: Through advanced memory optimization technology combinations, 200B MoE models can train 64k sequence length without sequence parallelism

  • Flexible Extension Capability: Full support for DeepSpeed Ulysses sequence parallelism, maximum sequence length can be linearly extended

  • Stable and Reliable: Insensitive to expert load imbalance during long sequence training, maintaining stable performance

⚑ Excellent Efficiency

  • Ultra-Large Scale Support: Supports MoE model training up to 1T parameters

  • Breakthrough Performance Bottleneck: First time achieving FSDP training throughput surpassing traditional 3D parallel solutions on MoE models above 200B scale

  • Hardware Optimization: Training efficiency surpasses NVIDIA H800 on Ascend A3 NPU supernodes

Performance comparison

πŸ”₯ Roadmap#

XTuner V1 is committed to continuously improving the pretraining, instruction fine-tuning, and reinforcement learning training efficiency of ultra-large-scale MoE models, with a focus on optimizing Ascend NPU support.

πŸš€ Training Engine#

Our vision is to build XTuner V1 into a universal training backend that seamlessly integrates into a broader open-source ecosystem.

Model

GPU(FP8)

GPU(BF16)

NPU(BF16)

Intern S1

βœ…

βœ…

βœ…

Intern VL

βœ…

βœ…

βœ…

Qwen3 Dense

βœ…

βœ…

βœ…

Qwen3 MoE

βœ…

βœ…

βœ…

GPT OSS

βœ…

βœ…

❌

Deepseek V3

βœ…

βœ…

❌

KIMI K2

βœ…

βœ…

❌

🧠 Algorithm Suite#

Algorithm components are rapidly iterating. Community contributions are welcome - use XTuner V1 to scale your algorithms to unprecedented scales!

Implemented

  • βœ… Multimodal Pretraining - Full support for vision-language model training

  • βœ… Multimodal Supervised Fine-tuning - Optimized for instruction following

  • βœ… GRPO - Group Relative Policy Optimization

Coming Soon

  • πŸ”„ MPO - Mixed Preference Optimization

  • πŸ”„ DAPO - Dynamic Sampling Policy Optimization

  • πŸ”„ Multi-round Agent Reinforcement Learning - Advanced agent training capabilities

⚑ Inference Engine Integration#

Seamless integration with mainstream inference frameworks

  • βœ“ LMDeploy

  • βœ— vLLM

  • βœ— SGLang

🀝 Contribution Guidelines#

We thank all contributors for their efforts to improve and enhance XTuner. Please refer to the Contribution Guidelines to understand the relevant guidelines for participating in the project.

πŸ™ Acknowledgments#

The development of XTuner V1 is deeply inspired and supported by excellent projects in the open-source community. We extend our sincere gratitude to the following pioneering projects:

Training Engines:

Reinforcement Learning:

XTuner V1’s reinforcement learning capabilities draw on the excellent practices and experience of the following projects

  • [veRL](volcengine/verl) - Volcano Engine Reinforcement Learning for LLMs

  • [SLIME](THUDM/slime) - THU’s scalable RLHF implementation

  • [AReal](inclusionAI/AReaL) - Ant Reasoning Reinforcement Learning for LLMs

  • [OpenRLHF](OpenRLHF/OpenRLHF) - An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray

We sincerely thank all contributors and maintainers of these projects for their continuous advancement of the large-scale model training field.

πŸ–ŠοΈ Citation#

@misc{2023xtuner,
    title={XTuner: A Toolkit for Efficiently Fine-tuning LLM},
    author={XTuner Contributors},
    howpublished = {\url{https://github.com/InternLM/xtuner}},
    year={2023}
}

Open Source License#

This project adopts the Apache License 2.0 Open Source License. At the same time, please comply with the licenses of the models and datasets used.