TY - JOUR
T1 - Simple speech transform coding scheme using forward adaptive quantization for discrete input signal
AU - Perić, Zoran
AU - Tančić, Milan
AU - Simić, Nikola
AU - Despotović, Vladimir
N1 - Funding Information:
This work is supported by the Serbian Ministry of Education, Science and Technologic Development (Project TR 32035).
Funding Information:
Ackno4wledgmen. Biswas, Ats., Sahu, P. K., Bhowmick, A., Chandra, M. Speech Recognition Using ERB-like Admissible Wavelet Packet This work is supported by the Serbian Ministry of Education, ISEcTiEencJeou arnnadl ToefchRneosleoagrcihc, D2e01velopmen5, 62(2),t 129-139. https://doi.org/10.1080/03772063.2015.1056844 (Project TR 32035). 5. Chu, W. C. Speech Coding Algorithms. Foundation and Evolution of Standardized Coders, John Wiley & Sons, New Jersey, 2003, Chapters 5-6, 143-183. 6. Farias, R. C., Brossier, J-M. Scalar Quantization for Estimation: From an Asimptotic Design to a Practical 4. Biswas, AS., oSlauhtiuo,n P. . KIE.,E BEhoTwrmaniscakc,t Aio.n, Cs hoandSriag,n Mal. SPpreoececshsing, 2014, Recognition U62(11sing ERB), -like Admissible Wavelet Packet2860-2870. Decomposition based on Perceptual Sub-Band Weight-ing. IETE Journal of Research, 2015, 62(2), 129-139. Learning Architectures for Speech Emotion Recognition. https://doNi.oreugra/l10.1080/N03etw77or2063ks, .201520.105684417, 92, 60-68.
Funding Information:
quantizer design and deep analyThis work is supported bysis of perfthe Seormancerbian Ministry of Education, Science and Technologic Development As it is a common c(Project TRas32e tha035t the inf). ormation about con- tinuous signal is not available in digitization systems and that system performance should be estimated using the information about discrete signal variance, it has been anaDlyetzeermd pineartifoonrmofaQnucaes ie-Lstoigmaraithimonic inQ ubaonthiz cerasfo-r Laplacian es. It has been shown that the proposed coding scheme provides excellent estimation in the case of input signal variances lower than 5 [dB] for various values of Impulse Signals. Automatic Control and Computer Sciences, system paramet2017er, s. However, the 5m1(o5d),el overrates the 294-300. performanceh tftopsr: /h/digoih.oerrg /v1a0r.3ia1n03c/eSs0 i1n4 6t4h1e1 c6a17s0e5 w00h2e9n the informa3t.ionB aabvoiruiste tctio, nDt.i nPu.,o Duhsu slii,gRn.alF uis ionno to favInafirlarbedle,and Visible due to support SenrsorangIme amismages Basetdch of on Anquanisotrotizpicer DiQ0ffu, swhichion and Karhunen- will be considerLoeved in the future Transform. IEe rEEesSearensoch.rs Journal, 2016, 17(1), 203- 209 https://doi.org/10.1109/JSEN.2015.2478655
Publisher Copyright:
© 2019, Kauno Technologijos Universitetas. All rights reserved.
PY - 2019
Y1 - 2019
N2 - The speech coding scheme based on the simple transform coding and forward adaptive quantization for discrete input signal processing is proposed in this paper. The quasi-logarithmic quantizer is applied for discretization of continuous input signal, i.e. for preparing discrete input. The application of forward adaptation based on the input signal variance provides more efficient bandwidth usage, whereas utilization of transform coding provides sub-sequences with more predictable signal characteristics that ensure higher quality of signal reconstruction at the receiving end. In order to provide additional compression, transform coding precedes adaptive quantization. As an objective measure of system performance, signal-to-quantization-noise ratio is used. System performance is discussed for two typical cases. In the first case, it was considered that the information about continuous signal variance is available, whereas the second case considers system performance estimation when only the information about discretized signal variance is present, which means that there is a loss of input signal information. The main goal of such performance estimation comparison of the proposed speech signal coding model is to explore what is the objectivity of performance if the information about a continuous source is absent, which is a common phenomenon in digital systems. The advantages of the proposed coding scheme are demonstrated by comparing the performance of the reconstructed signal with other similar exiting speech signal coding systems.
AB - The speech coding scheme based on the simple transform coding and forward adaptive quantization for discrete input signal processing is proposed in this paper. The quasi-logarithmic quantizer is applied for discretization of continuous input signal, i.e. for preparing discrete input. The application of forward adaptation based on the input signal variance provides more efficient bandwidth usage, whereas utilization of transform coding provides sub-sequences with more predictable signal characteristics that ensure higher quality of signal reconstruction at the receiving end. In order to provide additional compression, transform coding precedes adaptive quantization. As an objective measure of system performance, signal-to-quantization-noise ratio is used. System performance is discussed for two typical cases. In the first case, it was considered that the information about continuous signal variance is available, whereas the second case considers system performance estimation when only the information about discretized signal variance is present, which means that there is a loss of input signal information. The main goal of such performance estimation comparison of the proposed speech signal coding model is to explore what is the objectivity of performance if the information about a continuous source is absent, which is a common phenomenon in digital systems. The advantages of the proposed coding scheme are demonstrated by comparing the performance of the reconstructed signal with other similar exiting speech signal coding systems.
KW - Forward adaptive quantization
KW - Quasi-logarithmic quantizer
KW - Speech coding
KW - Transform coding
UR - http://www.scopus.com/inward/record.url?scp=85074707668&partnerID=8YFLogxK
U2 - 10.5755/j01.itc.48.3.21685
DO - 10.5755/j01.itc.48.3.21685
M3 - Article
AN - SCOPUS:85074707668
VL - 48
SP - 454
EP - 463
JO - Information Technology and Control
JF - Information Technology and Control
SN - 1392-124X
IS - 3
ER -