Name	Name	Last commit message	Last commit date
parent directory ..
00_introduction_example	00_introduction_example
01_simple_fft_thread	01_simple_fft_thread
02_simple_fft_block	02_simple_fft_block
03_block_fft_performance	03_block_fft_performance
04_nvrtc_fft	04_nvrtc_fft
05_fft_Xd	05_fft_Xd
06_convolution	06_convolution
07_convolution_3d	07_convolution_3d
08_mixed_precision	08_mixed_precision
09_introduction_lto_example	09_introduction_lto_example
10_cufft_device_api_example	10_cufft_device_api_example
common	common
lto_helper	lto_helper
CMakeLists.txt	CMakeLists.txt
Makefile	Makefile
README.md	README.md
README_LTO.md	README_LTO.md

cuFFTDx Library - API Examples

NOTE: For information about cuFFTDx + cuFFT LTO, please visit cuFFTDx + cuFFT LTO EA Library - API Examples.

All examples, including more advanced ones, are shipped within cuFFTDx package.

Description

This folder demonstrates cuFFTDx APIs usage.

Requirements

cuFFTDx/MathDx package
See cuFFTDx requirements
CMake 3.26 or newer
Linux system with installed NVIDIA drivers
NVIDIA GPU of Volta (SM70) or newer architecture

Build

You may specify CUFFTDX_CUDA_ARCHITECTURES to limit CUDA architectures used for compilation (see CMake:CUDA_ARCHITECTURES)
mathdx_ROOT - path to mathDx package (XX.Y - version of the package)

mkdir build && cd build
cmake -DCUFFTDX_CUDA_ARCHITECTURES=70-real -Dmathdx_ROOT=/opt/nvidia/mathdx/XX.Y ..
make
# Run
ctest

Examples

For the detailed descriptions of the examples please visit Examples section of the cuFFTDx documentation.

Group	Subgroup	Example	Description
Introduction Examples		introduction_example	cuFFTDx API introduction
Simple FFT Examples	Thread FFT Examples	simple_fft_thread	Complex-to-complex thread FFT
		simple_fft_thread_fp16	Complex-to-complex thread FFT half-precision
	Block FFT Examples	simple_fft_block	Complex-to-complex block FFT
		simple_fft_block_shared	Complex-to-complex block FFT shared-memory API
		simple_fft_block_std_complex	Complex-to-complex block FFT with `cuda::std::complex` as data type
		simple_fft_block_half2	Complex-to-complex block FFT with `__half2` as data type
		simple_fft_block_fp16	Complex-to-complex block FFT half-precision
		simple_fft_block_c2r	Complex-to-real block FFT
		simple_fft_block_r2c	Real-to-complex block FFT
		simple_fft_block_c2r_fp16	Complex-to-real block FFT half-precision
		simple_fft_block_r2c_fp16	Real-to-complex block FFT half-precision
		simple_fft_block_block_dim	Complex-to-complex block FFT with BlockDim operator
FFT Performance		block_fft_performance	Benchmark for C2C block FFT
		block_fft_performance_many	Benchmark for C2C/R2C/C2R block FFT
NVRTC Examples		nvrtc_fft_thread	Complex-to-complex thread FFT
		nvrtc_fft_block	Complex-to-complex block FFT
		nvrtc_query_database_fft_block	Complex-to-complex block FFT using runtime FFT database query
2D/3D FFT Examples		fft_2d	Example showing how to perform 2D FP32 C2C FFT with cuFFTDx
		fft_2d_single_kernel	2D FP32 FFT in a single kernel using Cooperative Groups kernel launch
		fft_2d_single_kernel_block_dim	2D FP32 FFT in a single kernel using Cooperative Groups kernel launch and dimensions with different ept
		fft_2d_r2c_c2r	Example showing how to perform 2D FP32 R2C/C2R convolution with cuFFTDx
		fft_3d	Example showing how to perform 3D FP32 C2C FFT with cuFFTDx
		fft_3d_box_single_block	Small 3D FP32 FFT that fits into a single block, each dimension is different
		fft_3d_cube_single_block	Small 3D (equal dimensions) FP32 FFT that fits into a single block
Convolution Examples		convolution	Simplified FFT convolution
		convolution_padded	R2C-C2R FFT convolution with optimization and zero padding
		convolution_r2c_c2r	Simplified R2C-C2R FFT convolution
		convolution_performance	Benchmark for FFT convolution using cuFFTDx and cuFFT
3D Convolution Examples		convolution_3d	cuFFTDx fused 3D convolution with preprocessing, filtering and postprocessing
		convolution_3d_c2r	cuFFTDx fused 3D C2R/R2C FFT convolution
		convolution_3d_r2c	cuFFTDx fused 3D R2C/C2R FFT convolution
		convolution_3d_padded	cuFFTDx fused 3D FFT convolution using zero padding
		convolution_3d_padded_r2c	cuFFTDx fused 3D R2C/C2R FFT convolution with zero padding
Mixed Precision Examples		mixed_precision_fft_1d	Example showing how to use separate storage and compute precisions
		mixed_precision_fft_2d	Mixed precision 2D FFT with benchmarking and accuracy comparison

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

cuFFTDx Library - API Examples

Description

Requirements

Build

Examples

FilesExpand file tree

cuFFTDx

Directory actions

More options

Directory actions

More options

Latest commit

History

cuFFTDx

Folders and files

parent directory

README.md

cuFFTDx Library - API Examples

Description

Requirements

Build

Examples