2023-08-04T03:09:51Z
https://zhuanlan.zhihu.com/p/601392771 Transformer 模型
https://zhuanlan.zhihu.com/p/626008090 浅析推理加速引擎FasterTransformer
https://zhuanlan.zhihu.com/p/612181615 大语言模型发展现状
https://www.philschmid.de/fine-tune-flan-t5-peft Efficient Large Language Model training with LoRA and Hugging Face
https://datawhalechina.github.io/thorough-pytorch/index.html 深入浅出PyTorch
https://pytorch.org/tutorials/intermediate/ddp_tutorial.html Pytorch DDP
https://blog.csdn.net/weixin_46782905/article/details/121480902 PyTorch DDP 学习
https://zhuanlan.zhihu.com/p/451671838 PyTorch Dispatcher
https://github.com/pytorch/xla/pull/3431/files FSDP in PyTorch XLA
https://github.com/kungfu-team/mindspore/blob/3fa5dd4495f4071b701e7ff490b7085b8824aaaa/tests/ut/cpp/serving/acl_stub.h#L473 mindspore 设计
https://handbook.pytorch.wiki/chapter4/4.1-fine-tuning.html fine tuning 微调
https://www.cnblogs.com/xiximayou/p/17345539.html pytorch在有限的资源下部署大语言模型
https://zhuanlan.zhihu.com/p/600223792 介绍triton很nb