A system and method for creating a ring tone for an electronic device takes as
input a phrase sung in a human voice and transforms it into a control signal controlling,
for example, a ringer on a cellular telephone. Time-varying features of the input
signal are analyzed to segment the signal into a set of discrete notes and assigning
to each note a chromatic pitch value. The set of note start and stop times and
pitches are then translated into a format suitable for controlling the device.