Skip to content

Releases: CHIP-SPV/chipStar

v1.2.1

11 Nov 13:15
3bd6515
Compare
Choose a tag to compare

What's Changed

This minor releases adds some fixes and performance improvements, most notably module caching.

New Contributors

Full Changelog: v1.2...v1.2.1

chipStar release 1.2

25 Sep 16:43
814e8f3
Compare
Choose a tag to compare

Release Notes

This release brings significant stability and performance improvements, enhanced support for CUDA, new HIP/ROCm library ports and integrations for HipBLAS, HipFFT, HipRAND/RocRAND. Initial testing of running HIP/CUDA applications on RISC-V.

Tested Platforms

  • Intel, AMD CPUs via Intel Compute Runtime
  • Intel GPUs via Neo i915 driver
  • ARM Mali GPUs (Quartz64 SBC)
  • RISC-V (Starfive Visionfive 2 SBC Debian, experimental)
  • AMD GPUs via rusticl(exploratory work)

Notable Changes

  • Introduced cucc, a drop-in replacement for nvcc:

    • Added cucc, enabling direct compilation of CUDA sources.
    • Added nvcc softlink, allowing you to compile CUDA sources without making any changes.
    • Adjusted CUDA headers to improve compatibility with CUDA sources, including a dummy cublas_v2.h header to prevent conflicts with system headers.
  • Enhanced OpenCL backend:

    • Support for cl_ext_buffer_device_address extension:
      • Added support for devices featuring the cl_ext_buffer_device_address extension, improving memory management capabilities.
    • Optimized queue profiling:
      • The OpenCL backend now uses non-profiling queues by default and switches to profiling queues only when needed, resulting in performance improvements.
    • Various other performance optimizations
  • Fixed Level Zero backend issues:

    • Addressed out-of-memory (OOM) errors:
      • Fixed memory leaks and improved resource management to prevent OOM errors during heavy workloads.
    • Improved thread safety:
      • Implemented mutexes and synchronization mechanisms to enhance thread safety within the Level Zero backend.
  • Rebased to HIP 6.x and updated hip-tests:

    • Updated the codebase to be compatible with HIP 6.x.

Library Support Changes

  • Expanded HIP library support:
    • HipBLAS integration:
      • Introduced the CHIP_BUILD_HIPBLAS option to enable building HipBLAS.
    • HipFFT integration:
      • Introduced the CHIP_BUILD_HIPFFT option to enable building HipFFT.
    • RocRAND port:

v1.2-RC1

04 Sep 07:41
95acbd1
Compare
Choose a tag to compare
v1.2-RC1 Pre-release
Pre-release

What's Changed

New Contributors

Full Changelog: v1.1...v1.2-RC1

chipStar release 1.1

22 Jan 13:08
ffebe4d
Compare
Choose a tag to compare

chipStar release 1.1

This release cycle focused on stabilization and performance improvements
over the 1.0 release. The release was measured to run some benchmarks up
to twice as fast as 1.0, with an average improvement of 30% measured on HeCBench.

Further highlights are described in the following.

Release Highlights

  • Added support for Clang/LLVM 17. LLVM 15 and 16 are still supported.
  • Ability to Use the Intel Unified Shared Memory Extension, with OpenCL backend
  • Optimized Atomic Operations
  • Use of Immediate Command Lists for Low Latency Dispatch, with Level Zero backend
  • Improved portability to other platforms & devices
  • Improved Asynchronous Execution

The full release notes are available in docs/release_notes/chipStar_1.1.rst

The full sources of the release (including git submodules) are available packaged in the attached file chipStar-1.1.tar.gz
(SHA256: 9258a313c503073a082ca310cebf048d84c4ab698facfc8d1d9ce1381ffb9fc5).

v1.1-RC4

16 Jan 09:55
ffebe4d
Compare
Choose a tag to compare
v1.1-RC4 Pre-release
Pre-release

The 4th release candidate for v1.1. Please test and add your results to the test log and any major problems or regressions as issues to the 1.1 milestone.

1.1 release notes.

v1.1-RC3

19 Dec 21:48
c2f604a
Compare
Choose a tag to compare
v1.1-RC3 Pre-release
Pre-release

The third release candidate for v1.1. Please test and add your results to the test log and any major problems or regressions as issues to the 1.1 milestone.

1.1 release notes.

v1.1-RC2

14 Dec 20:45
Compare
Choose a tag to compare
v1.1-RC2 Pre-release
Pre-release

The second release candidate for v1.1. Please test and add your results to the test log and any major problems or regressions as issues to the 1.1 milestone.

1.1 release notes.

v1.1-RC1

10 Nov 13:51
15ed7bc
Compare
Choose a tag to compare
v1.1-RC1 Pre-release
Pre-release

The first release candidate for v1.1. Please test and add your results to the test log and any major problems or regressions as issues to the 1.1 milestone.

1.1 release notes.

chipStar 1.0

17 Jul 15:06
Compare
Choose a tag to compare

chipStar Release 1.0

chipStar can compile HIP and CUDA applications to platforms which support SPIR-V as the device intermediate representation. It supports OpenCL and Level Zero as the low-level runtime alternatives. More info here.

The full sources of the release (including git submodules) are available packaged in the attached file chipStar_1_0.tar.gz (SHA256: aa2d46ad0ed6c1005e3466f0afcea26b594a49c5ec3725cf91842549d9b547ea).

Full release notes.

Release Highlights

  • The compilation toolchain works with Clang/LLVM 15 and 16.
  • Over 950 unit tests pass and several full real-world HPC applications have been tested to work
  • See the Features document for the current HIP/CUDA feature coverage.
  • Tested with Intel Level Zero on multiple GPU devices, Intel OpenCL for CPUs and GPUs and PoCL on Intel CPUs and GPUs (via the Level Zero driver)
  • Large number of bugs fixed since 0.9

Known Issues

The 1.0 release is focused on correctness. There are known bottlenecks that can limit the performance by up to 10x in some cases. Many of these performance bottlenecks will be addressed in the next release. Please keep this in mind while testing chipStar 1.0, and report any correctness/stability issues by opening an issue on github.

chipStar 1.0-RC3

17 Jul 11:17
Compare
Choose a tag to compare
chipStar 1.0-RC3 Pre-release
Pre-release

chipStar Release 1.0-RC3

chipStar can compile HIP and CUDA applications to platforms which support SPIR-V as the device intermediate representation. It supports OpenCL and Level Zero as the low-level runtime alternatives. More info here,

The full sources for the release (including git submodules) are available packaged in the attached file chipStar_1_0_RC3.tar.gz.

Full release notes.

Release Highlights

  • The compilation toolchain works with Clang/LLVM 15 and 16.
  • Over 950 unit tests pass and several full real-world HPC applications have been tested to work
  • See the Features document for the current HIP/CUDA feature coverage.
  • Tested with Intel Level Zero on multiple GPU devices, Intel OpenCL for CPUs and GPUs and PoCL on Intel CPUs and GPUs (via the Level Zero driver)
  • Large number of bugs fixed since 0.9

Known Issues

The 1.0 release is focused on correctness. There are known bottlenecks that can limit the performance by up to 10x in some cases. Many of these performance bottlenecks will be addressed in the next release. Please keep this in mind while testing chipStar 1.0, and report any correctness/stability issues by opening an issue on github.