[CK TILE] Add gemm basic v1 interwave pipeline #3616

bartekxk · 2026-01-20T11:43:12Z

Proposed changes

Please describe the motivation behind the pull request, whether it enables a new feature or fixes a bug. If there are associated pull requests or issues, please link them to the pull request.

Checklist

Please put an x into the boxes that apply. You can also fill these out after creating the PR. If you're not sure, please don't hesitate to ask.

I have added tests relevant to the introduced functionality, and the unit tests are passing locally
I have added the test to REGRESSION_TESTS list defined at the top of CMakeLists.txt in tests/CMakeLists.txt, IF the test takes more than 30 seconds to run.
I have added inline documentation which enables the maintainers with understanding the motivation
I have removed the stale documentation which is no longer relevant after this pull request
(If this change is user-facing) I have added release notes which provide the end users with a brief summary of the improvement from this pull request
I have run clang-format on all changed files
Any dependent changes have been merged

Discussion

If this is a relatively large or complex change, feel free to start a discussion by explaining why you chose the solution you did and what alternatives you considered

…ocot/basic-v1-interwave

Copilot

Pull request overview

This PR adds a new interwave pipeline implementation for GEMM operations in the CK Tile library. The changes refactor the existing pipeline code to support multiple scheduling strategies (Intrawave and Interwave) through template specialization.

Changes:

Introduces GemmPipelineScheduler::Interwave specialization for GemmPipelineAGmemBGmemCRegV1
Refactors pipeline implementations to use a common base class (GemmPipelineAgBgCrImplBase)
Adds validation utilities with configurable tolerance values and max error tracking
Updates profiler output to include maximum error information

Reviewed changes

Copilot reviewed 5 out of 5 changed files in this pull request and generated 5 comments.

Show a summary per file

File	Description
profiler/include/profiler/grouped_convolution_forward_tile_algs.hpp	Adds max error reporting to validation output
include/ck_tile/ops/gemm/pipeline/gemm_pipeline_agmem_bgmem_creg_v2.hpp	Refactors V2 pipeline to use base class and extract window creation logic
include/ck_tile/ops/gemm/pipeline/gemm_pipeline_agmem_bgmem_creg_v1.hpp	Adds Interwave scheduler specialization and refactors V1 pipeline structure
experimental/builder/include/ck_tile/builder/testing/validation.hpp	Adds tolerance getter functions and max error tracking to validation
example/ck_tile/20_grouped_convolution/conv_configs.hpp	Adds pipeline type trait definitions for BASIC_V1 and BASIC_V2

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-01-20T22:11:04Z