原文整理页

Starcloud-1 卫星利用英伟达 H100 芯片在太空中成功训练了首个大语言模型,并完成了推理测试

来源作者:Andrej Karpathy (@karpathy)原始来源:https://x.com/karpathy/status/1998804883701698986

中文导读

Starcloud-1 卫星利用英伟达 H100 芯片在太空中成功训练了首个大语言模型,并完成了推理测试。

正文 Markdown

We have just used the @Nvidia H100 onboard Starcloud-1 to train the first LLM in space! We trained the nano-GPT model from Andrej @Karpathy on the complete works of Shakespeare and successfully ran inference on it. We have also run inference on a preloaded Gemma model, and we…