Skip to yearly menu bar Skip to main content


Poster 125

Interpreting and Controlling Model Behavior via Constitutions for Atomic Concept Edits

Neha Kalibhat ⋅ Zi Wang ⋅ Prasoon Bajpai ⋅ Drew Proud ⋅ Wenjun Zeng ⋅ Been Kim ⋅ Mani Malek

Abstract

Log in and register to view live content