tritonsigmoid fast sigmoid attention
what happens when you run a cuda kernel
tiny vllm cpp cuda inference engine
nvidia cuda oxide rust compiler