A speech output apparatus is disclosed, which can allow the user to easily
catch synthetic speech when the synthetic speech is output upon being
superposed on a music output. The apparatus output can output a music and
synthetic speech that indicates contents of information such as an e-mail
and is superposed on the music. When the synthetic speech is output to be
superposed on the music during output, the apparatus gradually decreases
a tone volume of the music.