Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

CHANGELOG for 4.4 #6914

Open
dalg24 opened this issue Apr 3, 2024 · 0 comments
Open

CHANGELOG for 4.4 #6914

dalg24 opened this issue Apr 3, 2024 · 0 comments

Comments

@dalg24
Copy link
Member

dalg24 commented Apr 3, 2024

This is the changelog for the upcoming 4.4 release

Features:

Backend and Architecture Enhancements:

CUDA:

  • nvcc(wrapper): Adding ability to process --disable-warnings flag 6936
  • Use recommended/max team size functions in Cuda ParallelFor and Reduce constructors #6891

HIP:

  • MI300 support unified memory support #6877 PATCH RELEASE 4.3.1
  • Use builtin atomics in the HIP backend #6882

SYCL:

  • Fix multi-GPU support #6887
  • Improve team_reduce performance #6562
  • Unroll shuffle loops for top-level parallel_reduce and parallel_scan #6750
  • Fix lock for guarding scratch space in TeamPolicy parallel_reduce #6988
  • Print submission command queue property #7004

OpenMPTarget:

  • Honor user requested loop ordering in MDRange policy #6925
  • Use mutex lock for guarding scratch space in parallel_scan. #6998

OpenACC:

HPX:

Threads:

OpenMP:

General Enhancements

  • Add converting constructor in Kokkos::RandomAccessIterator #6929
  • Add kokkos_swap(Array<T, N>) specialization #6943

Build System Changes

  • Do not require OpenMP support for languages other than CXX #6965
  • Update Intel GPU architectures in Makefile #6895
  • Fix use of OpenMP with Cuda or HIP as compile language #6972
  • develop is the default branch on GitHub now.

Incompatibilities (i.e. breaking changes)

  • Drop Kokkos::Array specialization of Impl::ViewMapping #6906
  • Deprecate trailing Proxy template argument in Kokkos::Array #6934

Deprecations

  • Remove Experimental::LayoutTiled class template and deprecate is_layouttiled trait #6907
  • Deprecate specialization of Kokkos::pair for a single element #6947

Bug Fixes

  • Do not return a copy of the input functor in Experimental::for_each #6910
  • Fix CUDA reduction overflow for RangePolicy #6578 PATCH RELEASE 4.3.1
  • nvcc_wrapper: bring back support for --fmad option #6931 PATCH RELEASE 4.3.1
  • Fix Makefile.kokkos for Threads #6896
  • Fix realloc on views of non-default constructible element types #6993
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant