Both @wsmoses and @ejmeitz asked if it would be possible to depend on only the CUDA kernel language support (and presumably the ability to execute CUDA code), and not the whole CUDA package including all of the libraries. This actually seems like a reasonably thing, e.g. introducing CUDALang.jl or something, but it's nontrivial: