Skip to content

Releases: NVIDIA/MatX

v0.4.0

03 Apr 15:55

Choose a tag to compare

New Features

Bug Fixes

New Contributors

Full Changelog: v0.3.0...v0.4.0

v0.3.0

20 Jan 19:43
20e00a2

Choose a tag to compare

v0.3.0 marks a major release with over 100 features and bug fixes. Release cadence will occur more frequently after this release to support users not living at the HEAD.

What's Changed

Read more

Minor fix on name collision

05 Apr 04:18
8e6bb10

Choose a tag to compare

v0.2.5

Changed MAX name to not collide with other libraries (#162)

Minor fix

31 Mar 15:06
342ac85

Choose a tag to compare

Fixed argmin initialization issue that gave wrong results sometimes

v0.2.3

23 Mar 20:30
1c2a942

Choose a tag to compare

  • Improved error messages
  • Added support for einsum function. Includes tensor contractions, GEMMs with transposed outputs, dot products, and trace
  • Integrated cuTENSOR library
  • Added real/imag/r2c operators
  • Added chirp function
  • Added file readers for .mat files
  • Fixes to conv2, fft2
  • Switched to CUB for certain reductions. Results in a 4x speedup in some cases
  • Added find() and find_idx() functions
  • Added unique() function
  • Many CMake fixes to clean up transitive target
  • Added casting operators
  • Added negate operator

N-D Tensors

24 Dec 04:31
88c5c60

Choose a tag to compare

Added support for N-D tensors for:

  • Operators
  • FFTs
  • GEMMs
  • Reductions
  • Solver
  • Tensor/operator accesses

v0.2.1

17 Dec 23:59
0683898

Choose a tag to compare

Added unlimited concatenation of tensors

Tensor class refactoring

16 Dec 22:00
fe701b2

Choose a tag to compare

This release adds major changes with the main tensor class to allow for custom types for storage and descriptors. In addition, static tensor descriptors are now possible for compile time pointer arithmetic. As of this release it is not longer recommended to construct tensor_t objects directly. Instead, prefer the make_ variants of the functions.

Other features of this release are:

  • Refactored tensor class to use generic storage and descriptors

  • Adding comments on all make functions. Fixing spectrogram examples

  • Added concatenation operator

  • Added static tensors

  • Adding const on all operator() where applicable

  • Add more creation of tensors

  • Changed convolution example to use static tensor sizes

  • Added documentation for make

v0.1.1

08 Nov 23:01
de6e1a8

Choose a tag to compare

  • Added make_tensor helper functions
  • Updated Black-Scholes example
  • Moved host-specific defines into separate file
  • Updated build system to better track libcuda++ and nvbench
  • Improved release mode speed by turning off assertion checking
  • Improved host operator creation time by storing intermediate variables
  • Updated recursive filter example to error if not enough shared memory is available

v0.1.0

26 Oct 20:01
739ec6b

Choose a tag to compare

First public release of MatX. Brief list of supported features are:

  • Frontend API for cuBLAS, CUTLASS, cuFFT, cuSolver, cuRAND, and CUB
  • All standard POD data types supported, as well as fp16/bf16 and complex
  • Template expression trees to generate optimized device kernels
  • Examples for both performance and accuracy
  • Over 500 unit tests
  • Benchmarks using nvbench
  • Native CMake build system
  • and more!