See what the CodeGayHub community is most excited about today.
Efficient GPU kernels for block-sparse matrix multiplication and convolution
Fast parallel CTC.
GPU database engine
Fully Convolutional Instance-aware Semantic Segmentation
MatConvNet: CNNs for MATLAB
Introduction to Parallel Programming class code
Automatically exported from code.google.com/p/cuda-convnet2
Optimized primitives for collective multi-GPU communication
A GPU implementation of Convolutional Neural Nets in C++
Fast, gpu-based CSV parser
Stereo Matching by Training a Convolutional Neural Network to Compare Image Patches
CUB is a flexible library of cooperative threadblock primitives and other utilities for CUDA kernel programming.
High-Performance Graph Primitives on GPUs
A CUDA backend for Torch7
Facebook's CUDA extensions.
Code release for "Convolutional Two-Stream Network Fusion for Video Action Recognition", CVPR 2016.
Fork of Alex Krizhevsky's cuda-convnet 1. Adds dropout.
DeepSpeech neon implementation
Source code that accompanies The CUDA Handbook.
A CUDA implementation of SIFT for NVidia GPUs (2.6 ms on a GTX 1060)
CUDA Data Parallel Primitives Library
Unsupervised Learning of Video Representations using LSTMs