tritonsigmoid fast sigmoid attention
llms are complicated now
executorch unified pytorch on device
llm from scratch
torchdae implicit dae solver