Density-Guided Response Optimization: Community-Grounded Alignment via Implicit Acceptance Signals
[Invited Talk] Density-Guided Response Optimization: Community-Grounded Alignment via Implicit Acceptance Signals
Invited talk for the UMD Digital Twin reading group; discussed recent ACM FAccT paper, which uses a new methodology to recover latent community preferences for downstream alignment use.