Efficient Implementation of Gammatone Filters Based on Warped Cosine Modulated Filter Bank
- 作者: Porhun M.I1, Vashkevich M.I1
-
隶属关系:
- Belarusian State University of Informatics and Radioelectronics (BSUIR)
- 期: 卷 23, 编号 5 (2024)
- 页面: 1398-1422
- 栏目: Artificial intelligence, knowledge and data engineering
- URL: https://journal-vniispk.ru/2713-3192/article/view/265755
- DOI: https://doi.org/10.15622/ia.23.5.5
- ID: 265755
如何引用文章
全文:
详细
作者简介
M. Porhun
Belarusian State University of Informatics and Radioelectronics (BSUIR)
Email: porhun@bsuir.by
P. Brovky St. 6
M. Vashkevich
Belarusian State University of Informatics and Radioelectronics (BSUIR)
Email: vashkevich@bsuir.by
P. Brovky St. 6
参考
- Lass N.J., Donai J.J. Hearing science fundamentals. San Diego: Plural Publishing. 2021. 355 p.
- Herre J., Dick S. Psychoacoustic models for perceptual audio coding – A tutorial review // Applied Sciences. 2019. vol. 9. no. 14. doi: 10.3390/app9142854.
- Schilling A., Gerum R., Metzner C., Maier A., Krauss P. Intrinsic noise improves speech recognition in a computational model of the auditory pathway // Frontiers in Neuroscience. 2022. vol. 16. doi: 10.3389/fnins.2022.908330.
- Schadler M.R., Hulsmeier D., Warzybok A., Kollmeier B. Individual aided speech-recognition performance and predictions of benefit for listeners with impaired hearing employing FADE // Trends in Hearing. 2020. vol. 24. doi: 10.1177/2331216520938929.
- Luo Q. The improving effect of intelligent speech recognition System on english learning // Advances in Multimedia. 2022. doi: 10.1155/2022/2910859.
- Krobba A., Debyeche M., Selouani S. A. A novel hybrid feature method based on Caelen auditory model and gammatone filterbank for robust speaker recognition under noisy environment and speech coding distortion // Multimedia Tools and Applications. 2023. vol. 82. no. 11. pp. 16195–16212.
- Krobba A., Debyeche M., Selouani S. A. Mixture linear prediction Gammatone Cepstral features for robust speaker verification under transmission channel noise // Multimedia Tools and Applications. 2020. vol. 79. pp. 18679–18693.
- Jacome K.G.R., Grijalva F.L., Masiero B.S. Sound events localization and detection using bio-inspired gammatone filters and temporal convolutional neural networks // IEEE/ACM transactions on audio, speech, and language processing. 2023. vol. 31. pp. 2314–2324.
- Queiroz A., Coelho R. F0-based gammatone filtering for intelligibility gain of acoustic noisy signals // IEEE Signal Processing Letters. 2021. vol. 28. pp. 1225–1229.
- Irino T. Hearing Impairment Simulator Based on Auditory Excitation Pattern Playback: WHIS // IEEE Access. 2023. vol. 11. pp. 78419–78430.
- Kates J.M., Arehart K.H. The hearing-aid speech perception index (HASPI) version 2 // Speech Communication. 2021. vol. 131. pp. 35–46.
- Порхун М.И., Вашкевич М.И. Метод моделирования эффекта ухудшения частотного разрешения слуха у больных нейросенсорной тугоухостью // Информатика. 2021. Т. 18. № 3. С. 68–82. doi: 10.37661/1816-0301-2021-18-3-68-82.
- Islam R., Tarique M. Investigating the performance of gammatone filters and their applicability to design cochlear implant processing system // Designs. 2024. vol. 8. no. 1. doi: 10.3390/designs8010016.
- Mourgela A., Reiss J., Agus T.R. Investigation of a real-time hearing loss simulation for use in audio production // Audio Engineering Society Convention 149. 2020.
- Nagae M., Irino T., Nisimura R., Kawahara H., Patterson R.D. Hearing impairment simulator based on compressive gammachirp filter // Signal and Information Processing Association Annual Summit and Conference (APSIPA’2014). 2014. doi: 10.1109/APSIPA.2014.7041579.
- Вашкевич М.И., Азаров И.С. Сравнение частотно-временных преобразований: Фурье-анализ, вейвлеты и банки фильтров на основе фазового преобразования // Цифровая обработка сигналов. 2020. № 2. С. 13–26.
- Patterson R.D., Nimmo-Smith I., Holdsworth J., Rice P. An efficient auditory filterbank based on the gammatone function // Meeting of the IOC Speech Group on Auditory Modelling at RSRE. 1987. vol. 2. no. 7.
- Irino T., Patterson R.D. The gammachirp auditory filter and its application to speech perception // Acoustical Science and Technology. 2020. vol. 41. no. 1. pp. 99–107.
- Wang D., Brown G. Computational Auditory Scene Analysis: Principles, Algorithms, and Applications. New Jersey: Wiley-IEEE Press. 2008. 395 p.
- Feldbauer C., Kubin G., Kleijn W.B. Anthropomorphic Coding of Speech and Audio: A Model Inversion Approach // EURASIP Journal on Advances in Signal Processing. 2005. no. 9. pp. 1334–1349.
- Koilpillai R.D., Vaidyanathan P.P. Cosine-modulated FIR filter banks satisfying perfect reconstruction // IEEE Trans. on Signal Processing. 1992. vol. 4. no. 4. pp. 770–783.
- Вашкевич М.И., Петровский А.А. Неравнополосные банки фильтров для слуховых аппаратов: анализ алгоритмов, автоматизация проектирования // Автоматизация проектирования дискретных систем: материалы 7-й междунар. конф. (г. Минск, 16-17 ноября). 2010. C. 53–60.
- Порхун М.И., Вашкевич М.И. Моделирование частотной характеристики банка гамматон-фильтров при помощи неравнополосного косинусно-модулированного банка фильтров // Цифровая обработка сигналов и её применение: труды 24-й междунар. конф. (г. Москва, 30 марта – 1 апреля). 2022. Т. 1. C. 53–57.
- Kingma D.P., Ba J. Adam: A method for stochastic optimization // arXiv preprint. 2014. arXiv: 1412.6980. 15 p. URL: https://arxiv.org/pdf/1412.6980.pdf (дата обращения: 15.07.2024).
- Zeiler M.D. ADADELTA: An adaptive learning rate method // arXiv preprint. 2012. arXiv:1212.5701. 6 p. URL: https://arxiv.org/pdf/1212.5701 (дата обращения: 15.07.2024).
- Malcolm S. An Efficient Implementation of the Patterson-Holdsworth Auditory Filter Bank // Apple Computer Technical Report. 1993. vol. 35. no. 8. 42 p.
补充文件
