Skip to content

Latest commit

 

History

History
14 lines (8 loc) · 363 Bytes

README.md

File metadata and controls

14 lines (8 loc) · 363 Bytes

README

This repository contains CUDA C parallel implementations of some well-known algorithms.

Matrix Addition

Basic Kernel, Grid Stride Loop, CUDA Unified Memory.

Matrix Multiplication

Basic Kernel, Grid Stride Loop, Tiling, CUDA Unified Memory.

Convolution 2D

Basic Kernel, Tiling, CUDA Unified Memory.