Nirav Diwan

I am a 2^nd year Ph.D. student in Computer Science at the University of Illinois Urbana-Champaign, advised by Prof. Gang Wang in the Siebel School of Computing and Data Science. I also collaborate with Prof. Varun Chandrasekaran and Prof. Huan Zhang.

I study security risks in foundation model training and deployment. My research is driven by three core questions:

What are the threat models that adversaries can exploit in foundation model training and deployment?
What is the root cause of unsafe failure modes in LLMs?
How can we train models to ensure these causes don’t occur?

Recently, I co-led the creation of PurpCode, which is the first open-source reasoning model for cybersafety, winning the Amazon Nova AI Challenge (2025).

I completed my undergraduate studies from IIIT-Delhi, where I had the good fortune of working with Prof. Tanmoy Chakraborty (now at IIT Delhi), and Prof. Zubair Shafiq (at UC Davis), and Prof. Ganesh Bagler.

Internship. I am looking for both Industrial and Academic Internships in the areas of Machine Learning, and Security & Privacy. Feel free to reach out!

Collaboration. I am always looking to work with undergrads and MS students! Feel free to send me an email with the title [Together] in the subject line!

Industry experience

Prior Research Experience

News

May 6, 2026	We release CoT-Guard a 4B model that outperforms larger models (e.g GPT-5.4, GPT5-mini) for CoT Monitoring!
May 1, 2026	Our work on Extractable Memorization from Differentially Private LLM is now accepted at Theory and Practice of Differential Privacy (TPDP) Workshop 2025! See you in Boston!
Apr 1, 2026	Passed my Quals — Officially a PhD candidate now!
Mar 7, 2026	Released a blog post on extracting training data from VaultGemma - the first DP-SGD trained LLM.
Jan 2, 2026	Our simple evaluation study on using LLMs for scraping by everyday users is now accepted at AICS Workshop@AAAI 2025!
Jul 22, 2025	🥇 Our work PurpCode, the first reasoning model for secure code generation developed using Deliberative Alignment, won the Amazon Nova AI Challenge ($250k prize)!
Jul 22, 2025	PurpCode is now accepted at NeurIPS 2025! See you in San Deigo!
Dec 1, 2024	My summer internship work at LG AI Research has been accepted at AAAI Good Data Workshop 2025.
Sep 16, 2024	Our proposal for the Amazon Trusted AI Challenge (Grant - $250,000) got accepted.
Aug 23, 2024	Officially started my PhD at UIUC!