Post-doc in AI safety at King's College London (deadline 20 November)

90 views

Skip to first unread message

ncl...@gmail.com

unread,

Oct 8, 2025, 5:52:14 PM (2 days ago) Oct 8

to Machine Learning News

My team at the Department of Informatics, King's College London, is looking to appoint a two-year post-doc (associate or fellow, depending on experience) in Technical AI Safety

The position is funded by the Open Philanthropy grant “Verifiably Robust Conformal Probes”. The project’s goal is to develop methods for latent probing (aka activation monitoring) of large language models (LLMs) that leverage certification and conformal prediction techniques to offer probabilistic and adversarial robustness guarantees. Applications include the detection of misaligned LLM intentions such as deception, harmfulness, jailbreaking, and power-seeking behaviours.

Please apply at https://www.kcl.ac.uk/jobs/127005-postdoctoral-research-associatefellow-technical-ai-safety