https://github.com/flagos-ai/libtriton_jit
https://github.com/triton-lang/triton/pull/9600
https://github.com/DF4FM/CUDA-Agent/tree/main/results 有人fork的早,可以继续吃瓜