Choose a scenario, record one English sentence, and the coach will reply. Your audio is sent to the backend for ASR, coach response generation, and TTS playback.