moderngpu 1.0
moderngpu 2.0 is a completely new project with no direct continuity from 1.0. However, many of the algorithms are similar, and those algorithms are covered in these pages.
- FAQ
- Introduction
- Performance
- The Library
- Reduce and Scan
- Bulk Remove and Bulk Insert
- Merge
- Mergesort
- Segmented Sort and Locality Sort
- Vectorized Sorted Search
- Load-Balancing Search
- IntervalExpand and IntervalMove
- Relational Joins
- Multisets
- Segmented Reduction
- Benchmarks and usage: Segmented reduction (CSR)
- Benchmarks and usage: Reduce-by-key
- Benchmarks and usage: Sparse matrix * vector (CSR)
- User notes
- Intra-tile algorithm
- Carry-out and carry-in
- CTASegReduce
- Reduce-by-key front-end
- Preprocessed segmented reduction - Apply
- CSR to COO
- Preprocessed segmented reduction - Construct
- Segmented reduction (CSR) front-end