Skip to content

Conversation

@aartbik
Copy link
Collaborator

@aartbik aartbik commented May 24, 2025

No description provided.

@aartbik aartbik requested review from cliffburdick and Copilot May 24, 2025 00:56
Copy link
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull Request Overview

Generalize the DIA-format sparse matrix-vector multiply to support non-square (m × n) matrices by introducing explicit row (m) and column (n) dimensions.

  • Add m and n from the input matrix sizes and pass both to the DIA kernel in matvec_cusparse.h
  • Update kernel signature and bounds check in matvec.cuh to use m (rows) instead of assuming a square matrix

Reviewed Changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated no comments.

File Description
include/matx/transforms/matmul/matvec_cusparse.h Introduce m = a.Size(0) and n = a.Size(1); pass both to dia_spmv_kernel
include/matx/kernels/matvec.cuh Update dia_spmv_kernel signature to (…, m, n) and change the loop bound from i < n to i < m
Comments suppressed due to low confidence (2)

include/matx/transforms/matmul/matvec_cusparse.h:327

  • The number of blocks is currently computed using the number of columns (n), but the kernel launches one thread per row (i < m). To cover all rows, BATCHES should be computed as ceil(m / THREADS) instead of ceil(n / THREADS).
uint32_t BATCHES = static_cast<uint32_t>(cuda::std::ceil(static_cast<double>(n) / THREADS));

include/matx/kernels/matvec.cuh:44

  • [nitpick] The parameter numD is not immediately clear; consider renaming it to numDiags or diagCount to improve readability and self-documentation.
template <typename VAL, typename CRD>
__global__ void dia_spmv_kernel(VAL *A, CRD *diags, uint64_t numD, VAL *B, VAL *C, uint64_t m, uint64_t n) {

@cliffburdick
Copy link
Collaborator

/build

@aartbik aartbik merged commit b86ccdb into NVIDIA:main May 24, 2025
1 check passed
@aartbik aartbik deleted the bok branch May 24, 2025 18:46
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants