Identificador persistente para citar o vincular este elemento: http://hdl.handle.net/10662/21831
Títulos: A mobile‑assisted voice condition analysis system for Parkinson’s disease: assessment of usability conditions
Autores/as: Carrón Perreira, Javier
Campos Roca, Yolanda
Madruga Escalona, Mario
Pérez Sánchez, Carlos Javier
Palabras clave: Funciones acústicas;Aprendizaje automático;Base de datos mPower;Enfermedad de Parkinson;Procesamiento de voz;Sistema de análisis de condición de voz;Acoustic features;Machine learning;mPower database;Parkinson’s disease;Speech processing;Voice condition analysis system
Fecha de publicación: 2021
Editor/a: BMC
Resumen: Background and objective: Automatic voice condition analysis systems to detect Parkinson’s disease (PD) are generally based on speech data recorded under acoustically controlled conditions and professional supervision. The performance of these approaches in a free-living scenario is unknown. The aim of this research is to investigate the impact of uncontrolled conditions (realistic acoustic environment and lack of supervision) on the performance of automatic PD detection systems based on speech. Methods: A mobile-assisted voice condition analysis system is proposed to aid in the detection of PD using speech. The system is based on a server–client architecture. In the server, feature extraction and machine learning algorithms are designed and implemented to discriminate subjects with PD from healthy ones. The Android app allows patients to submit phonations and physicians to check the complete record of every patient. Six different machine learning classifiers are applied to compare their performance on two different speech databases. One of them is an in-house database (UEX database), collected under professional supervision by using the same Android-based smartphone in the same room, whereas the other one is an age, sex and health-status balanced subset of mPower study for PD, which provides real-world data. By applying identical methodology, single-database experiments have been performed on each database, and also cross-database tests. Cross-validation has been applied to assess generalization performance and hypothesis tests have been used to report statistically significant differences. Results: In the single-database experiments, a best accuracy rate of 0.92 (AUC = 0.98) has been obtained on UEX database, while a considerably lower best accuracy rate of 0.71 (AUC = 0.76) has been achieved using the mPower-based database. The cross-database tests provided very degraded accuracy metrics. Conclusion: The results clearly show the potential of the proposed system as an aid for general practitioners to conduct triage or an additional tool for neurologists to perform diagnosis. However, due to the performance degradation observed using data from mPower study, semi-controlled conditions are encouraged, i.e., voices recorded at home by the patients themselves following a strict recording protocol and control of the information about patients by the medical doctor at charge.
URI: http://hdl.handle.net/10662/21831
DOI: 10.1186/s12938-021-00951-y
Colección:DMATE - Artículos
DTCYC - Artículos

Archivos
Archivo Descripción TamañoFormato 
s12938-021-00951-y.pdf1,59 MBAdobe PDFDescargar


Este elemento está sujeto a una licencia Licencia Creative Commons Creative Commons