[TRITON] Tuned DSV4-Flash FP8 GEMM configs by skysnow2001 · Pull Request #3814 · ROCm/aiter

skysnow2001 · 2026-06-19T15:58:02Z

Motivation

To run DeepSeek-V4-Flash on gfx1201 (RDNA4 / Navi48) via vLLM, the FP8 A8W8 block-scale Triton GEMM (GEMM-A8W8_BLOCKSCALE) needs gfx1201 tuning configs for the per-rank (N, K) shapes the model hits in its non-MoE linears at TP=8. This PR adds those configs so the operator runs tuned on this arch.

Submission Checklist

Look over the contributing guidelines at https://github.com/ROCm/ROCm/blob/develop/CONTRIBUTING.md#pull-requests.

github-actions · 2026-06-19T15:59:08Z

🏷️ CI Guide

Runs automatically on every PR:

✅ Pre-checks (submodule verification, code formatting)
✅ Aiter op tests (gfx942 + gfx950)
✅ Triton tests on MI35X (only when aiter/ops/triton/** or related paths are changed)

Extended tests (opt-in via labels):

Label	Tests
`ci:triton-300x`	Run an additional Triton test job on MI300X in PRs; main branch always runs both MI35X and MI300X
`ci:sglang`	SGLang integration tests: DeepSeek-R1-MXFP4 accuracy, Qwen 3.5 accuracy
`ci:atom`	ATOM benchmark: DeepSeek-R1-0528, GPT-OSS-120B
`ci:atom_full`	ATOM accuracy suite for PR and main models from ATOM `models_accuracy.json`
`ci:vllm`	vLLM benchmark: GPT-OSS-120B, DeepSeek-R1-0528, Kimi-K2.5
`ci:all`	All standard extended tests (excludes `ci:atom_full`)

Only add ci:atom_full for FlyDSL or Triton upgrades.
Add labels via the sidebar or gh pr edit 3814 --add-label <label>

Tuned DSV4-Flash GEMM configs

f0e4aee

skysnow2001 requested a review from a team June 19, 2026 15:58

skysnow2001 changed the title ~~Tuned DSV4-Flash FP8 GEMM configs~~ [TRITON] Tuned DSV4-Flash FP8 GEMM configs Jun 19, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[TRITON] Tuned DSV4-Flash FP8 GEMM configs#3814

[TRITON] Tuned DSV4-Flash FP8 GEMM configs#3814
skysnow2001 wants to merge 1 commit into
ROCm:mainfrom
skysnow2001:gemm-dsv4-config

skysnow2001 commented Jun 19, 2026

Uh oh!

github-actions Bot commented Jun 19, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

skysnow2001 commented Jun 19, 2026

Motivation

Submission Checklist

Uh oh!

github-actions Bot commented Jun 19, 2026

🏷️ CI Guide

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant