The pitch extracting part generates a pitch waveform signal in a manner
making the time interval of the pitch of the input audio sound data to be
the same. After the number of samples in each region is made to be the
same by the re-sampling part, the pitch waveform signal is changed into a
subband data that express a time-varying-strength of a basic frequency
composition and a higher harmonic composition by the subband analyzing
part. The subband data are superimposed by a modulation wave composition
that expresses attaching data of an attaching object by the data
attaching part and is regarded as a bit stream to output through a
nonlinear quantizing. A portion expressing the higher harmonic
composition that is made corresponding to the audio sound expressed by
this audio sound data in the subband data are deleted by the encoding
part.