GPT-4o (8/32) — OpenAI

3.3.3 Disparate performance on voice inputs Risk Description: Models may perform di昀昀erently with users speaking with di昀昀erent accents. Disparate performance can lead to a di昀昀erence in quality of service for di昀昀erent users of the model [12, 13, 14]. Risk Mitigation: We post-trained GPT-4o with a diverse set of input voices to have model performance and behavior be invariant across di昀昀erent user voices. Evaluations: We run evaluations on GPT-4o Advanced Voice Mode using a 昀椀xed assistant voice (

GPT-4o Page 7 Page 9