原文整理页

Andrej Karpathy 探讨了“引人深思”这一人类认知能力在 LLM 中的对应形式,认为其本质是能激发高质量思维链的特定 Token 序列

来源作者:Andrej Karpathy (@karpathy)原始来源:https://x.com/karpathy/status/2001699564928279039

中文导读

Andrej Karpathy 探讨了“引人深思”这一人类认知能力在 LLM 中的对应形式,认为其本质是能激发高质量思维链的特定 Token 序列。

正文 Markdown

I love the expression “food for thought” as a concrete, mysterious cognitive capability humans experience but LLMs have no equivalent for. Definition: “something worth thinking about or considering, like a mental meal that nourishes your mind with ideas, insights, or issues that require deeper reflection. It's used for topics that challenge your perspective, offer new understanding, or make you ponder important questions, acting as intellectual stimulation.” So in LLM speak it’s a sequence of tokens such that when used as prompt for chain of thought, the samples are rewarding to attend over, via some yet undiscovered intrinsic reward function. Obsessed with what form it takes. Food for thought.