Skip to content

Use fp16 for convolution on navi#4895

Draft
pfultz2 wants to merge 22 commits into
developfrom
fast-conv
Draft

Use fp16 for convolution on navi#4895
pfultz2 wants to merge 22 commits into
developfrom
fast-conv

Conversation

@pfultz2
Copy link
Copy Markdown
Collaborator

@pfultz2 pfultz2 commented May 19, 2026

Motivation

Since navi doesnt have fp32 wmmas, we can use fp16 instead to get better performance.

Technical Details

This only converts the wmma to fp16 but still outputs fp32. To improve accuracy beyond just vanilla fp16 it extends the weights with residuals.

Changelog Category

Add a CHANGELOG.md entry for any option other than Not Applicable

    • Added: New functionality.
    • Changed: Changes to existing functionality.
    • Removed: Functionality or support that has been removed. (Compared to a previous release)
    • Optimized: Component performance that has been optimized or improved.
    • Resolved Issues: Known issues from a previous version that have been resolved.
    • Not Applicable: This PR is not to be included in the changelog.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant