Voice controlled electronic musical instrument page

The invention is an electronic, voice-controlled musical instrument. It is in essence an electronic kazoo. The player hums into the mouthpiece, and the device imitates the sound of a musical instrument whose pitch and volume change in response to the player's voice. The player is given the impression of playing the actual instrument and controlling it intimately with the fine nuances of his voice. The instrument can in principle be any music-producing sound source: a trumpet, trombone, clarinet, flute, piano, electric guitar, voice, whistle, even a chorus of voices, i.e. virtually any source of sound. In its simplest configuration, the instrument resembles a kind of horn. However, the shape and appearance of the instrument can be fashioned by the manufacture to match the sound of any traditional instrument, if desired; or its shape can be completely novel. The functional requirements of the invention's physical design are only: that it be hand-held; that it have a mouthpiece (5) where the player's voice enters; that it have one or more speakers (3) where the sound is produced; that it have a body (11) where the electronics and batteries are stored and where finger-actuated controls (1a, 1b) can be placed. There primary software components of the invention are the frequency-detection module, the loudness-tracking module, and the note-attack module. The frequency-detection module (FDM) identifies the frequency of the player's voice. It does this by analyzing the incoming sound wave and finding patterns of recurring shapes. This method is a highly computationally efficient and novel combination of auto-correlation and zero crossing- or peak-based pitch detection. The chosen instrument is synthesized at the pitch determined by the FDM or at an offset from that pith as desired by the player. The loudness-tracking component measures the loudness of the player's voice, and this information is used then to set the volume of the synthesized sound. The note-attack module detects abrupt changes in the loudness of the player's voice. This component helps decide when the synthesized instrument should begin a new note.