New directions in Evolution Strategy fine-tuning

16 views
Skip to first unread message

Risto Miikkulainen

unread,
Mar 1, 2026, 11:18:53 PM (2 days ago) Mar 1
to Reinforcement Learning Mailing List
Evolution strategies is an alternative to gradient descent in fine-tuning LLMs. Cognizant AI Lab and collaborators recently completed four new papers on it, evaluating it in several new tasks including math and ARC-AGI, extending it to fine-tuning quantized models, demonstrating how LLMs can be trained to know what they know, and showing why high dimensionality of LLMs is a blessing (not a curse) in fine tuning. For more details, see

Papers: https://cgnz.at/6007QZN13 (tasks)
        https://cgnz.at/6003QZNHN (quantized)
        https://cgnz.at/6002QZNGG (metacognition)
        https://cgnz.at/6000QZNyI (blessing)

-- Risto
Reply all
Reply to author
Forward
0 new messages