Vaidehi Patil

| |

Hi, I am Vaidehi Patil, a third-year PhD student in the Computer Science Department at the University of North Carolina at Chapel Hill advised by Prof. Mohit Bansal. I am a part of the MURGe-Lab and the broader UNC-AI group. My research goal is to make deep learning models more safe and responsible for real world applications. For this goal, I work on topics like safety, privacy, security for LLMs, multimodal models, multi-agent systems. I'm also interested in controlling and steering ML models using techniques such as model editing.

I received my Interdisciplinary Dual Degree: B. Tech. in Electrical Engineering and M. Tech. in AI and Data Science and a Minor Degree in Computer Science and Engineering from IIT Bombay in 2022. I have spent wonderful summers as an intern at Adobe Research in Summer 2020 and Summer 2021, at Amazon AGI in Summer 2023 and Apple in Summer 2024. At IIT Bombay, I was advised by Prof. Sunita Sarawagi from IIT Bombay and Partha Talukdar from Google Research India.

Research Interests

Natural Language Processing
Machine Learning
Multimodality
AI Safety
Privacy
Interpretability
Multi-agent systems

Education

PhD in Computer Science, 2022-present
University of North Carolina at Chapel Hill
M. Tech. in AI and Data Science, 2017-2022
Indian Institute of Technology Bombay
B. Tech. in Electrical Engineering, 2017-2022
Indian Institute of Technology Bombay
Minor in Computer Science and Engineering, 2017-2022
Indian Institute of Technology Bombay

Recent News

[Jul '25] I was interviewed by MIT Tech Review for an article covering our machine unlearning (MUGen) workshop @ ICML!
[Apr '25] I will be giving a talk at Ploutos on UPCORE!
[Apr '25] Organizing the ICML 2025 Workshop on Machine Unlearning for Generative AI — check out the Call for Papers!
[Mar '25] Our workshop proposal on Machine Unlearning for Generative AI has been accepted at ICML 2025!
[Feb '25] New preprint: Utility-Preserving Coreset Selection for Balanced Unlearning.
[Jan '25] Our work on a training-free and adaptive guard for safe visual generation is accepted at ICLR 2025!
[Dec '24] Our paper introducing UnLOK-VQA — a dataset for multimodal unlearning — is now accepted to TMLR!

[Sep '24] I will be giving an invited talk at DLCT on Can Sensitive Information Be Deleted From LLMs? Objectives for Defending Against Extraction Attacks on September 6
[Aug '24] I will be attending ACL 2024 in Bangkok, Thailand from Aug 11-16. Let us chat if you are around!
[Jun '24] RefineSumm — multimodal summarization dataset paper — is accepted at ACL 2024 Main Conference!
[May '24] Joined Apple as an ML Research Scient Intern for Summer 2024
[May '24] I will be attending ICLR 2024 in Vienna, Austria from May 7-11. Let us chat if you are around!
[Jan '24] Paper on sensitive information deletion accepted at ICLR 2024 as Spotlight!
[Oct '23] Paper on debiasing multimodal models accepted at EMNLP 2023 Findings
[Oct '23] New preprint introducing an attack-defense framework for Information Deletion! Can Sensitive Information Be Deleted From LLMs? Objectives for Defending Against Extraction Attacks
[May '23] Joined Amazon as an Applied Scientist Intern for Summer 2023
[Oct '22] Paper titled GEMS, Scene Expansion using Generative Models of Graphs accepted at WACV 2023
[Aug '22] Awarded the Undergraduate Research Award (URA03) by IIT Bombay
[Jul '22] Awarded the Best IDDDP Dissertation Award by CMInDS, IIT Bombay for my Interdisciplinary Dual Degree Thesis Project titled 'Multilingual Representations for Closely Related Languages'
[Apr '22] Awarded the Carolina Computing Fellowship by the Department of Computer Science at UNC Chapel Hill
[Apr '22] Accepted the PhD admit from the Computer Science Department at UNC Chapel Hill
[Mar '22] Paper titled Overlap-based Vocabulary Generation Improves Cross-lingual Transfer Among Related Languages accepted for oral presentation at ACL 2022 Main Conference
[Jan '22] Undergraduate Teaching Assistant for the course Introduction to Machine Learning (DS 303)
[Jul '21] Undergraduate Teaching Assistant for the course Programming for Data Science (DS 203)

Experience

Summer 2024 - Research Intern at Apple
Summer 2023 - Research Intern at Amazon AGI
Summer 2021 - Research Intern at Adobe Research
Summer 2020 - Research Intern at Adobe Research

Publications and Preprints

SAFREE: Train-free And Adaptive Guard For Safe Text-to-Image And Video Generation ICLR 2025
Jaehong Yoon, Shoubin Yu, Vaidehi Patil, Huaxiu Yao, Mohit Bansal
Abstract | ArXiv | | Project Webpage
UPCORE: Utility-Preserving Coreset Selection for Balanced Unlearning Preprint
Vaidehi Patil, Elias Stengel-Eskin, Mohit Bansal
Abstract | ArXiv |
Unlearning Sensitive Information in Multimodal LLMs: Benchmark and Attack-Defense Evaluation TMLR 2024
Vaidehi Patil, Yi-Lin Sung, Peter Hase, Jie Peng, Tianlong Chen, Mohit Bansal
Abstract | ArXiv | | Talk | Slides | Dataset
REFINESUMM: Self-Refining MLLM for Generating a Multimodal Summarization Dataset ACL 2024
Vaidehi Patil, Leonardo F. R. Ribeiro, Mengwen Liu, Mohit Bansal, Markus Dreyer
Abstract | ArXiv | Poster | Slides | Talk |
Debiasing Multimodal Models via Causal Information Minimization EMNLP 2023 Findings
Vaidehi Patil, Adyasha Maharana, Mohit Bansal
Abstract | ArXiv | Poster | Slides | Talk |
Can Sensitive Information Be Deleted From LLMs? Objectives for Defending Against Extraction Attacks ICLR 2024 Spotlight
Vaidehi Patil*, Peter Hase*, Mohit Bansal
Abstract | ArXiv | Poster | Slides | Talk |
GEMS: Scene Expansion using Generative Models of Graphs WACV 2023
Rishi Agarwal, Tirupati Saketh Chandra, Vaidehi Patil, Aniruddha Mahapatra, Kuldeep Kulkarni, Vishwa Vinay
Abstract
Overlap-based Vocabulary Generation Improves Cross-lingual Transfer Among Related Languages ACL 2022 Oral
Vaidehi Patil, Partha Talukdar, Sunita Sarawagi
Abstract | ArXiv | Poster | Slides | Talk |
Detecting Document Versions and Their Ordering In a Collection WISE 2021 (Best Paper Runner-Up)
Natwar Modani, Anurag Maurya, Gaurav Verma, Inderjeet Nair, Vaidehi Patil, Anirudh Kanfade
Abstract | ArXiv |
Exploiting Language Relatedness for Low Web-Resource Language Model Adaptation: An Indic Languages Study ACL 2021
Yash Khemchandani, Sarvesh Mehtani, Vaidehi Patil, Abhijeet Awasthi, Partha Talukdar, Sunita Sarawagi
Abstract | ArXiv | Poster | Slides | Talk |

Invited Talks

Teaching

Teaching Assistant at IIT Bombay

Programming for Data Science - Fall 2021
Intro to Machine Learning - Spring 2022

Misc

Other Professional Activities

Organization
- Organizing a workshop on Machine Unlearning for Generative AI at ICML 2025
  (with Mantas Mazeika, Yang Liu, Katherine Lee, Mohit Bansal, Bo Li)
Reviewing
- ICML 2025, TMLR 2025, ICLR 2025, ACL 2025, NAACL 2025, TMLR 2024, ACL ARR 2024, CVPR 2022

How to reach me?

Please feel free to reach out if you think that we have any shared interests. The best way to reach me is via email at vaidehi [at] cs [dot] unc [dot] edu.