https://zhuanlan.zhihu.com/p/601392771 Transformer 模型 https://zhuanlan.zhihu.com/p/626008090 浅析推理加速引擎FasterTransformer https://zhuanlan.zhihu.com/p/612181615 大语言模型发展现状 https://www.philschmid.de/fine-tune-flan-t5-peft Efficient Large Language Model training with LoRA and Hugging Face https://datawhalechina.github.io/thorough-pytorch/index.html 深入浅出PyTorch https://pytorch.org/tutorials/intermediate/ddp_tutorial.html Pytorch DDP https://blog.csdn.net/weixin_46782905/article/details/121480902 PyTorch DDP 学习 https://zhuanlan.zhihu.com/p/451671838 PyTorch Dispatcher https://github.com/pytorch/xla/pull/3431/files FSDP in PyTorch XLA https://github.com/kungfu-team/mindspore/blob/3fa5dd4495f4071b701e7ff490b7085b8824aaaa/tests/ut/cpp/serving/acl_stub.h#L473 mindspore 设计 https://handbook.pytorch.wiki/chapter4/4.1-fine-tuning.html fine tuning 微调 https://www.cnblogs.com/xiximayou/p/17345539.html pytorch在有限的资源下部署大语言模型 https://zhuanlan.zhihu.com/p/600223792 介绍triton很nb