avx-512
Here are 35 public repositories matching this topic...
Design of the Fast-Orbit Feedback correction for SESAME's accelerator
-
Updated
Nov 8, 2023 - C
An implementation of Google's Encoded Polyline algorithm in AVX512 because why not. Perhaps the fastest and least portable polyline encoder out there?
-
Updated
Nov 9, 2022 - C
The vectorized (AVX-512) batched singular value decomposition algorithm for matrices of order two.
-
Updated
Dec 16, 2022 - C
-
Updated
Nov 14, 2022 - C++
The Tomato Patch FFT is the fastest FFT in the world- but it is by no means efficient.
-
Updated
Jul 7, 2023
Implementation of Hierarchy Oblivious Algorithms
-
Updated
Sep 3, 2019 - C++
Some loose performance experiments with Agner Fog's VCL
-
Updated
Apr 14, 2024 - C++
Zbynek's various C and C++ experiments
-
Updated
Jul 31, 2022 - C++
Matilda is a library to repeatedly multiply a constant matrix with a variable vector
-
Updated
May 23, 2024 - C++
Data for Intel Xeon-Phi server used in PyAF tests
-
Updated
Dec 31, 2020 - Python
Fast Fourier Transform implementation though x86 AVX-512 SIMD extension
-
Updated
Dec 1, 2023 - C++
Running GPGPU-like kernels on CPU with auto-vectorization for SSE/AVX/AVX512 SIMD Architectures
-
Updated
May 13, 2023 - C++
high-speed math functions based on AVX-512 intrinsics
-
Updated
Jul 4, 2022 - C++
Improve this page
Add a description, image, and links to the avx-512 topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the avx-512 topic, visit your repo's landing page and select "manage topics."