cuda-kernel

Here are 12 public repositories matching this topic...

ELS-RD / kernl

Kernl lets you run PyTorch transformer models several times faster on GPU with a single line of code, and is designed to be easily hackable.

cuda pytorch transformer triton cuda-kernel

Updated Feb 16, 2024
Jupyter Notebook

teddykoker / torchsort

Star

Fast, differentiable sorting and ranking in PyTorch

pytorch sort ranking cuda-kernel

Updated Dec 29, 2023
Python

tpoisonooo / how-to-optimize-gemm

Star

row-major matmul optimization

vulkan cuda armv7 arm64 ptx gemm-optimization cuda-kernel int4

Updated Sep 9, 2023
C++

GithubRealFan / keccak256-blockchain-hash-opencl-kernel

Star

algorithm opencl cuda cuda-programming keccak256 cuda-kernel

Updated Jan 10, 2023
C

GithubRealFan / Simple-Projects-CUDA

Star

algorithm cpp opencl cuda cuda-programming cuda-kernel

Updated Feb 13, 2023
Cuda

GithubRealFan / Matrix-Multiply-CUDA

Star

c algorithm cpp opencl cuda cuda-programming cuda-kernel

Updated Mar 10, 2023
Cuda

rbga / CPU-vs-GPU-Matrix-Operation

Star

A performance comparison of standard matrix functions between CPU and GPU using Nvidia CUDA on Visual Studio using C++

Updated Jul 29, 2023
Cuda

ckswjd99-at-snu / SHPC-2023-2

Star

SNU CSE Scalable High Performance Computing (M1522.006700) - 2023 Autumn

opencl cuda avx512 cuda-kernel shpc

Updated Jan 13, 2024
C

kachi-group / ichida-algo

Star

Winning submission for StartHack 2024: HPC optimized multi-GPU/CPU inference

c neural-network cuda multithreaded simd-intrinsics high-perofrmance-computing cuda-kernel

Updated Aug 20, 2024
C

Shikha-code36 / CUDA-Programming-Beginner-Guide

Star

A beginner's guide to CUDA programming

cuda cuda-kernels cuda-demo cuda-toolkit cuda-support cuda-basics cuda-library cuda-programming cuda-basic cuda-kernel cuda-cpp

Updated Jul 29, 2024
Cuda

webis-de / pytorch-window-matmul

Star

a custom CUDA kernel for windowed matrix multiplication

cuda pytorch cuda-kernel

Updated Jan 3, 2024
Python

ProgrammerGnome / CUDA-codes

Star

Snippet repository for learning parallel GPU programming with CUDA.

c cuda parallelization learning-materials parallel-programming gpu-programming cpp-programming cuda-kernel

Updated Dec 29, 2023
C++

Improve this page

Add a description, image, and links to the cuda-kernel topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the cuda-kernel topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cuda-kernel

Here are 12 public repositories matching this topic...

ELS-RD / kernl

teddykoker / torchsort

tpoisonooo / how-to-optimize-gemm

GithubRealFan / keccak256-blockchain-hash-opencl-kernel

GithubRealFan / Simple-Projects-CUDA

GithubRealFan / Matrix-Multiply-CUDA

rbga / CPU-vs-GPU-Matrix-Operation

ckswjd99-at-snu / SHPC-2023-2

kachi-group / ichida-algo

Shikha-code36 / CUDA-Programming-Beginner-Guide

webis-de / pytorch-window-matmul

ProgrammerGnome / CUDA-codes

Improve this page

Add this topic to your repo