Skip to content

Pull requests: ROCm/aiter

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[TRITON] Tuned DSV4-Flash FP8 GEMM configs
#3814 opened Jun 19, 2026 by skysnow2001 Contributor Loading…
1 task done
Simplify ck_gemm_a8w8_blockscale GemmSpecialization construction
#3813 opened Jun 19, 2026 by jbelloncastro Loading…
1 task done
Port/aakbarza/flydsl blockmoe fusion
#3810 opened Jun 19, 2026 by amirakb89 Loading…
3 tasks done
[Triton] Add DSV4 BMM config for gfx950
#3808 opened Jun 19, 2026 by k50112113 Contributor Loading…
[HIP][FLYDSL]: add multi-backend prefill causal_conv1d kernels for GDN
#3803 opened Jun 18, 2026 by yiijin Contributor Loading…
[fea]: gfx1250 allreduce poc
#3802 opened Jun 18, 2026 by TennyWang1223 Contributor Loading…
1 task
[feature] Extract C++ code to jinja template files
#3801 opened Jun 18, 2026 by jbelloncastro Loading…
1 task done
[gfx950] Add JIT grouped_gemm_mxfp8 for MXFP8 prefill MoE
#3800 opened Jun 18, 2026 by fanxingran Loading…
1 task
idxsknorm_shuffle_layout support shuffle kv cache layout
#3795 opened Jun 18, 2026 by ganyi1996ppo Contributor Loading…
1 task
[FlyDSL] Port compress_attn kernels to gfx1250 (wave32)
#3787 opened Jun 18, 2026 by jli-melchior Contributor Loading…
1 task
[fea] Add fp32 RMSNorm output for fused qk group quant
#3785 opened Jun 18, 2026 by wuhuikx Contributor Loading…
perf: use vectorized LDS loads for mhc_pre_gemm_sqrsum on gfx942
#3781 opened Jun 17, 2026 by kudomcho Loading…
1 task done
[module_fused_split_gdr_update] refactor
#3777 opened Jun 17, 2026 by amd-ruitang3 Contributor Loading…
1 task
[gfx1250][FlyDSL]opt conc1 moe.
#3774 opened Jun 17, 2026 by lalala-sh Contributor Loading…
1 task
Fix/topk decode dispatch seqlen
#3773 opened Jun 17, 2026 by chuanbowang2026 Contributor Loading…
[FlyDSL AOT] Parallelize standalone main() compile drivers
#3769 opened Jun 17, 2026 by zhiding512 Contributor Loading…
[gfx1250][flydsl]moe group gemm swiglu limit for dsv4
#3767 opened Jun 17, 2026 by Zzz9990 Contributor Loading…
1 task
ProTip! Mix and match filters to narrow down what you’re looking for.