3.3.4 Ungrounded inference / Sensitive trait attribution Risk Description: Audio input can lead to the model making potentially biased or inaccurate inferences about speakers. We de昀椀ne two categories: • Ungrounded inference (UGI): Making inferences about a speaker that could not be deter- mined solely from audio content. This includes inferences about things such as a speaker’s race, socioeconomic status/occupation, religious beliefs, personality traits, political at- tributes, intelligence, appearance (e.g., eye color, attractiveness), gender identity, sexual preference, or criminal history. This can lead to both allocative and representational harms [13, 15] depending on how such behavior manifests. • Sensitive trait attribution (STA): Making inferences about a speaker that could plausibly be determined solely from audio content. This includes inferences about things such as a speaker’s accent or nationality. Potential harms from STA include an increase in risks 10

GPT-4o - Page 10 GPT-4o Page 9 Page 11