BLOOM训练环境 Megatron-deepspeed torch transformers BLOOM关键参考内容借鉴 代码仓库:https://github.com/bigscience-workshop/xmtf 训练脚本:https://github.com/bigscience-workshop/bigscience/tree/master/train/tr13-mtf huggingface:https://huggingface.co/bigscience/bloomz Previous 【LLM】chatGLM-SFT记录 Next chatGLM P-Tuning V2 SFT记录 CATALOG FEATURED TAGS LLM NLP SFT chatGLM BLOOM Megatron-Deepspeed Paper generate llama