| dc.description.abstract |
Speech coding is a technique that compresses speech signals into a smaller digital form, making it easier to transmit or store,
while still maintaining the quality and intelligibility of the speech. The review aimed to identify and analyses the most effec
tive waveform-based nonlinear speech coding prediction techniques, including the use of neural networks and polynomial
f
ilters. The study analyzed 29 publications from 2000 to 2023 and found that neural network-based models are widely used
for speech compression, with RNN topologies being favored due to their ability to introduce nonlinearity and nonstationary.
While nonlinear adaptive speech prediction techniques have been explored for speech coding, further research is needed
to optimize the adaptive algorithms used in these models. The review also identified a need for future research to address
quality performance and computational cost, and suggested further exploration of RNN predictor models. The methodology
used in this study involved a computer science approach that follows three main phases: planning, conducting, and reporting.
Six different stages were followed, including determining research questions, defining research approach, study selection
criteria, quality measurement criteria, data extraction strategy, and synthesizing extracted data. Overall, this study highlights
the need for continued research in the development and improvement of neural network-based speech compression models |
en_US |