Fake Voice Detection: A Hybrid CNN-LSTM Based Deep Learning Approach

Oyucu, Saadin; Çelimli, Derya Betül Ünsal; Aksöz, Ahmet

Fake Voice Detection: A Hybrid CNN-LSTM Based Deep Learning Approach

dc.contributor.author	Oyucu, Saadin
dc.contributor.author	Çelimli, Derya Betül Ünsal
dc.contributor.author	Aksöz, Ahmet
dc.date.accessioned	2025-05-04T16:41:58Z
dc.date.available	2025-05-04T16:41:58Z
dc.date.issued	2024
dc.department	Sivas Cumhuriyet Üniversitesi
dc.description	8th International Symposium on Multidisciplinary Studies and Innovative Technologies, ISMSIT 2024 -- 7 November 2024 through 9 November 2024 -- Ankara -- 204563
dc.description.abstract	This study focuses on developing and evaluating a hybrid Convolutional Neural Network-Long Short-Term Memory (CNN-LSTM) based deep learning model for detecting fake voice recordings. The proposed model addresses the critical issue of artificial intelligence-generated speech mimicking human voices, which can potentially be used for malicious purposes, thereby endangering individuals' privacy and safety. A comprehensive dataset comprising 5,889 real and 5,889 fake voice samples was utilized for this research. The dataset underwent rigorous preprocessing, including segmentation into fixed-length windows and normalization. The hybrid CNN-LSTM model was then trained and validated systematically involving exploratory data analysis and extensive hyperparameter tuning. The experimental results demonstrated that the proposed model achieved an accuracy of 99.2%, an F1 score of 99.2%, a recall of 99.4%, and a precision of 99.0%, indicating its robust performance in distinguishing between real and fake voices. The findings underscore the potential of the hybrid CNN-LSTM model as a powerful tool for safeguarding digital communications against the growing threat of fake voices. © 2024 IEEE.
dc.identifier.doi	10.1109/ISMSIT63511.2024.10757293
dc.identifier.isbn	979-835035442-3
dc.identifier.scopus	2-s2.0-85213320665
dc.identifier.scopusquality	N/A
dc.identifier.uri	https://doi.org/10.1109/ISMSIT63511.2024.10757293
dc.identifier.uri	https://hdl.handle.net/20.500.12418/35016
dc.indekslendigikaynak	Scopus
dc.language.iso	en
dc.publisher	Institute of Electrical and Electronics Engineers Inc.
dc.relation.ispartof	ISMSIT 2024 - 8th International Symposium on Multidisciplinary Studies and Innovative Technologies, Proceedings
dc.relation.publicationcategory	Konferans Öğesi - Uluslararası - Kurum Öğretim Elemanı
dc.rights	info:eu-repo/semantics/closedAccess
dc.snmz	KA_Scopus_20250504
dc.subject	audio forensics
dc.subject	convolutional neural network (CNN)
dc.subject	fake voice detection
dc.subject	long short-term memory (LSTM)
dc.title	Fake Voice Detection: A Hybrid CNN-LSTM Based Deep Learning Approach
dc.type	Conference Object

Koleksiyon

Scopus İndeksli Yayınlar Koleksiyonu

Fake Voice Detection: A Hybrid CNN-LSTM Based Deep Learning Approach

Dosyalar

Koleksiyon