https://github.com/patrick-toulme/justabyte/blob/main/cutile_blackwell_post/run_moe_dump_all.py