feat: Add npu megatron support by UsernameFull · Pull Request #380 · alibaba/ROLL

UsernameFull · 2026-03-16T01:12:23Z

Summary

This PR adds support for Huawei Ascend NPU devices with Megatron-Core backend, enabling ROLL framework to run reinforcement learning training on NPU hardware.

Key Changes

1. Platform Detection Priority

File: roll/platforms/__init__.py

Changes: Reordered platform detection to check NPU before CUDA.

Reason: NPU devices were incorrectly falling back to CUDA platform. Prioritizing NPU detection ensures NpuPlatform is properly initialized when torch_npu is available.

2. Device-Agnostic Operations

File: roll/pipeline/base_worker.py

Changes:

Replaced "cuda" with current_platform.device_type
Replaced torch.cuda.memory_allocated() with current_platform.memory_allocated()

3. MindSpeed Integration

File: mcore_adapter/src/mcore_adapter/training_args.py

Changes: Added optional import of mindspeed.megatron_adaptor .

Reason: MindSpeed is Huawei's library providing NPU-specific Megatron optimizations. The adaptor patches Megatron-Core for NPU compatibility while maintaining GPU compatibility via try-except.

4. NPU Attention Mask Format

File: roll/distributed/strategy/megatron_strategy.py

Changes: Added NPU-specific attention mask transformation to 4D format.

Reason: NPU requires 4D attention masks [B, 1, S, S] instead of standard 2D [B, S] . This hardware-specific requirement ensures correct attention computation on NPU.

if hasattr(torch, "npu") and torch.npu.is_available() and attention_mask is 
not None:
    attention_mask = attention_mask.bool()
    attention_mask = attention_mask[:, None, None, :].expand(B, 1, S, S)

5. Optimizer Compatibility

File: roll/third_party/megatron/optimizer.py

Changes: Added support for no_weight_decay_cond , scale_lr_cond , lr_mult parameters.

6. Example Configurations

Files:

- examples/ascend_examples/qwen3_4B_dpo_megatron.yaml
- examples/ascend_examples/qwen3_8b_rlvr_deepspeed.yaml
- examples/ascend_examples/run_dpo_pipeline.sh

Reason: Provides ready-to-use NPU training examples demonstrating proper device mapping and strategy configuration for both DPO and RLVR pipelines.

Impact

Benefits:

Enables megatron on Huawei Ascend NPU hardware
Maintains full backward compatibility with GPU systems
Follows existing platform abstraction patterns

Requirements

Huawei Ascend NPU with torch_npu installed
MindSpeed(v0.15.3) library for NPU Megatron support

HuangJoJo · 2026-04-16T11:19:19Z

plz attach qwen3-8B rlvr megatron training curves

UsernameFull · 2026-05-15T14:00:59Z

plz attach qwen3-8B rlvr megatron training curves

# Conflicts: # roll/pipeline/sft/sft_pipeline.py

feat: fix ascend example fix: ascend rlvr yaml fix fix: megatron fix

UsernameFull force-pushed the npu_megatron branch 2 times, most recently from fb2e7dc to acfad89 Compare March 17, 2026 06:54

UsernameFull changed the title ~~[WIP]Add npu megatron support~~ feat: Add npu megatron support Apr 2, 2026

UsernameFull force-pushed the npu_megatron branch 3 times, most recently from 1e7f794 to 0af5e74 Compare April 2, 2026 08:42

HuangJoJo reviewed Apr 16, 2026

View reviewed changes

Comment thread roll/platforms/__init__.py

HuangJoJo reviewed Apr 16, 2026

View reviewed changes

Comment thread mcore_adapter/src/mcore_adapter/training_args.py

UsernameFull force-pushed the npu_megatron branch from a80756c to 98064e2 Compare May 11, 2026 09:11

UsernameFull and others added 10 commits May 15, 2026 22:11

fix: add sft support on npu

dc30b93

# Conflicts: # roll/pipeline/sft/sft_pipeline.py

adapt mindspeed

91a7b79

feat: fix ascend example fix: ascend rlvr yaml fix fix: megatron fix

fix: add base worker npu support, add npu example yaml

8bbba61

fix: add base worker npu support

0720e28

feat: Add npu megatron support

2d35671

feat: Add npu megatron support

59f4529

feat: Add npu megatron support

29b9516

feat: Add npu megatron support

683bed3

feat: Add npu megatron support

8ddc156

fix: restore rlvr deepspeed max steps after rebase

0f2886d

UsernameFull force-pushed the npu_megatron branch from 97e1a0c to 0f2886d Compare May 15, 2026 14:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Add npu megatron support#380

feat: Add npu megatron support#380
UsernameFull wants to merge 10 commits into
alibaba:mainfrom
UsernameFull:npu_megatron

UsernameFull commented Mar 16, 2026 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

HuangJoJo commented Apr 16, 2026

Uh oh!

UsernameFull commented May 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

UsernameFull commented Mar 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Key Changes

1. Platform Detection Priority

2. Device-Agnostic Operations

3. MindSpeed Integration

4. NPU Attention Mask Format

5. Optimizer Compatibility

6. Example Configurations

Impact

Requirements

Uh oh!

Uh oh!

Uh oh!

HuangJoJo commented Apr 16, 2026

Uh oh!

UsernameFull commented May 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

UsernameFull commented Mar 16, 2026 •

edited

Loading