Fake Voice Detection: A Hybrid CNN-LSTM Based Deep Learning Approach
dc.contributor.author | Oyucu, Saadin | |
dc.contributor.author | Çelimli, Derya Betül Ünsal | |
dc.contributor.author | Aksöz, Ahmet | |
dc.date.accessioned | 2025-05-04T16:41:58Z | |
dc.date.available | 2025-05-04T16:41:58Z | |
dc.date.issued | 2024 | |
dc.department | Sivas Cumhuriyet Üniversitesi | |
dc.description | 8th International Symposium on Multidisciplinary Studies and Innovative Technologies, ISMSIT 2024 -- 7 November 2024 through 9 November 2024 -- Ankara -- 204563 | |
dc.description.abstract | This study focuses on developing and evaluating a hybrid Convolutional Neural Network-Long Short-Term Memory (CNN-LSTM) based deep learning model for detecting fake voice recordings. The proposed model addresses the critical issue of artificial intelligence-generated speech mimicking human voices, which can potentially be used for malicious purposes, thereby endangering individuals' privacy and safety. A comprehensive dataset comprising 5,889 real and 5,889 fake voice samples was utilized for this research. The dataset underwent rigorous preprocessing, including segmentation into fixed-length windows and normalization. The hybrid CNN-LSTM model was then trained and validated systematically involving exploratory data analysis and extensive hyperparameter tuning. The experimental results demonstrated that the proposed model achieved an accuracy of 99.2%, an F1 score of 99.2%, a recall of 99.4%, and a precision of 99.0%, indicating its robust performance in distinguishing between real and fake voices. The findings underscore the potential of the hybrid CNN-LSTM model as a powerful tool for safeguarding digital communications against the growing threat of fake voices. © 2024 IEEE. | |
dc.identifier.doi | 10.1109/ISMSIT63511.2024.10757293 | |
dc.identifier.isbn | 979-835035442-3 | |
dc.identifier.scopus | 2-s2.0-85213320665 | |
dc.identifier.scopusquality | N/A | |
dc.identifier.uri | https://doi.org/10.1109/ISMSIT63511.2024.10757293 | |
dc.identifier.uri | https://hdl.handle.net/20.500.12418/35016 | |
dc.indekslendigikaynak | Scopus | |
dc.language.iso | en | |
dc.publisher | Institute of Electrical and Electronics Engineers Inc. | |
dc.relation.ispartof | ISMSIT 2024 - 8th International Symposium on Multidisciplinary Studies and Innovative Technologies, Proceedings | |
dc.relation.publicationcategory | Konferans Öğesi - Uluslararası - Kurum Öğretim Elemanı | |
dc.rights | info:eu-repo/semantics/closedAccess | |
dc.snmz | KA_Scopus_20250504 | |
dc.subject | audio forensics | |
dc.subject | convolutional neural network (CNN) | |
dc.subject | fake voice detection | |
dc.subject | long short-term memory (LSTM) | |
dc.title | Fake Voice Detection: A Hybrid CNN-LSTM Based Deep Learning Approach | |
dc.type | Conference Object |