pisco_log
banner

Voice Disorder Identification and Classification


Abstract


Dysphonia, a voice disorder affecting many individuals, varies in severity and treatment complexity depending on its type. Early
identification is crucial to prevent long-term damage. This project aims to distinguish pathological from healthy voice samples using spectral
features extracted via Linear Prediction Coefficients (LPC). We apply K-Nearest Neighbors (KNN) and Support Vector Machine (SVM) classifiers, and evaluate their performance using metrics such as precision, true positive rate, and false negative rate. Results show KNN outperforms SVM, achieving 78% accuracy and better robustness in mixed voice conditions.

Keywords


Dysphonia; Pathological Voice Detection; Linear Prediction Coefficients (LPC); K-Nearest Neighbors (KNN); Support Vector Machine (SVM); Voice Classification; Signal Processing

Full Text:

PDF

Included Database


References


[1] Neighbors C, Song SA. Dysphonia. [Updated 2022 May 1]. In: Stat Pearls [Internet]. Treasure Island (FL): StatPearls Publishing; 2024

Jan-. Available from:https://www.ncbi.nlm.nih.gov/books/NBK565881/

[2] Wikipedia contributors, Wikipedia, The Free Encyclopedia. https://en.wikipedia.org/w/index.php?title=Dyslexia&oldid=1199397896

[3] Muhammad G, Altuwaijri G, Alsulaiman M, et al. Automatic voice pathology detection and classification using vocal tract area irregularity. Biocybern Biomed Eng. 2016; 36:309 317. https://doi.org/10.1016/j. bbe.2016.01.004.

[4] Cesari U, De Pietro G, Marciano E, et al. A new database of healthy and pathological voices. Comput Electr Eng. 2018; 68:310321.

[5] Lili Chen, Chaoyu Wang, Junjiang Chen, Zejun Xiang, Xue Hu, Voice Disorder Identification by using Hilbert-Huang Transform (HHT)

and K Nearest Neighbor (KNN), Journal of Voice, Volume 35, Issue 6, 2021, Pages 932.e1-932.e11, ISSN 0892-1997. https://doi.

org/10.1016/j.jvoice.2020.03.009.




DOI: http://dx.doi.org/10.70711/aitr.v2i10.7135

Refbacks

  • There are currently no refbacks.