Contains the audio instructions. Common versions include natural voices like "Peter" or more advanced Text-to-Speech (TTS)