Study Reveals Key Differences in LLM Architectures for Cognitive Tasks
New study reveals LLMs show distinct activation patterns for cognitive tasks, with math reasoning having highest attention entropy and decoder models displaying greater sparsity.
New study reveals LLMs show distinct activation patterns for cognitive tasks, with math reasoning having highest attention entropy and decoder models displaying greater sparsity.