Detection of Voices Generated by Artificial Intelligence with Deep Learning Methods Yapay Zeka Tarafından Üretilen Seslerin Derin Öğrenme Yöntemleriyle Tespiti


ÖZTÜRK S. B., ÖZYER B., TEMİZ Ö.

32nd IEEE Conference on Signal Processing and Communications Applications, SIU 2024, Mersin, Turkey, 15 - 18 May 2024, (Full Text) identifier

  • Publication Type: Conference Paper / Full Text
  • Doi Number: 10.1109/siu61531.2024.10601078
  • City: Mersin
  • Country: Turkey
  • Keywords: artificial intelligence, computer forensics, deep learning, voice detection
  • Ataturk University Affiliated: Yes

Abstract

There are potential threats to ensuring the accuracy and reliability of voice recordings in forensic science, such as identity theft, spreading misleading information, and manipulation of legal evidence. While the advancement of artificial intelligence technologies increases the production of digital fake documents in forensic medicine and digital forensic science, distinguishing the sounds produced by artificial intelligence from real sounds stands out as a serious problem. In this study, we proposed a system that can distinguish between voices produced by artificial intelligence and real human voices. In the proposed system, based on the voice features obtained using Mel Frequency Implicit Coefficients (MFCC), Convolutional Neural Networks (CNN), Long Short Term Memory (LSTM) and a hybrid model created by combining these two methods are used to distinguish real human voices produced by artificial intelligence. The performance of deep learning-based classification algorithms used was examined. Experiments have shown that the hybrid model outperforms single CNN and LSTM models by classifying sounds more accurately.