New directions in Evolution Strategy fine-tuning

16 views

Skip to first unread message

Risto Miikkulainen

unread,

Mar 1, 2026, 11:18:53 PM (2 days ago) Mar 1

to Reinforcement Learning Mailing List

Evolution strategies is an alternative to gradient descent in fine-tuning LLMs. Cognizant AI Lab and collaborators recently completed four new papers on it, evaluating it in several new tasks including math and ARC-AGI, extending it to fine-tuning quantized models, demonstrating how LLMs can be trained to know what they know, and showing why high dimensionality of LLMs is a blessing (not a curse) in fine tuning. For more details, see

Overview blog: https://cgnz.at/6005QZNMb

Papers: https://cgnz.at/6007QZN13 (tasks)

https://cgnz.at/6003QZNHN (quantized)

https://cgnz.at/6002QZNGG (metacognition)

https://cgnz.at/6000QZNyI (blessing)

Animated illustration: https://x.com/RobertoDailey1/status/2026791418380087442

-- Risto

Reply all

Reply to author

Forward

0 new messages