Detecting Unsafe Training Data via Data Attribution Methods

In submission to ACL 2025, 2015

Yijun Pan, Taiwei Shi, Jieyu Zhao, Jiaqi W. Ma