Speech Coding Using an Analysis-by-Synthesis Sinusoidal Model
Signal Compression Laboratory Research Project
Researcher: | Cagri Etemoglu |
Faculty: | Dr. Vladimir Cuperman |
Research Focus: | To achieve toll quality speech at
4kbps, we propose an AbS sinusoidal speech coder in which sinusoidal components are
extracted in a closed loop fashion. The problem in open loop analysis is the fact that
analysis does not correlate well with the synthesis which results in degraded speech
quality. In the closed loop method, sinusoidal components are extracted recursively.
Consequently, at each instant the encoder knows the current error, which helps it to
correlate well with the decoder. The same method can be used to analyze different types of
frames. The parameters are the amplitudes, frequencies and phases of the sinusoids.
Amplitudes will be quantized with variable dimension VQ techniques. Phases are modeled in
voiced frames as quadratic phase, in unvoiced frames as random phase and in transition
frames they are quantized. Frequencies extracted by the analysis are constrained to a
candidate frequency space so that the resulting frequencies will be amenable to efficient
modeling.
|
Presentation: | Speech Coding Using an Analysis-by-Synthesis Sinusoidal Model |