namesny-com/cuda_net.md at 3c71b6464d64bfd8118f11510627aa9da9071129

614 B

Raw Blame History

title	draft
Writing a Convolutional Neural Network library with CUDA Support	true

"Just use cuBLAS, it'll be easier. You don't have to implement custom CUDA kernels.", they said. Actually, noone said that. I just thought that because I didn't do enough research.

Why not combine multiple challenging things into 1 (C++, cmake, CUDA, CNN)

Quickly discovering that without writing custom kernels, you can't really progress

cuBLAS column major layout, macro
cmake woes (findCUDA)
google test
padding kernel
column major / row major headache
removing cuBLAS -> just row major representation

614 B Raw Blame History

614 B

Raw Blame History