A study on effective data preprocessing and augmentation method in diabetic retinopathy classification using pre-trained deep learning approaches


İncir R., BOZKURT F.

Multimedia Tools and Applications, cilt.83, sa.4, ss.12185-12208, 2024 (SCI-Expanded) identifier identifier

  • Yayın Türü: Makale / Tam Makale
  • Cilt numarası: 83 Sayı: 4
  • Basım Tarihi: 2024
  • Doi Numarası: 10.1007/s11042-023-15754-7
  • Dergi Adı: Multimedia Tools and Applications
  • Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus, FRANCIS, ABI/INFORM, Applied Science & Technology Source, Compendex, Computer & Applied Sciences, INSPEC, zbMATH
  • Sayfa Sayıları: ss.12185-12208
  • Anahtar Kelimeler: Classification, Data augmentation, Deep learning, Diabetic retinopathy
  • Atatürk Üniversitesi Adresli: Evet

Özet

High glucose levels in the blood not only damage different tissues and organs of the body, but also cause adverse effects on the eye. This condition is called diabetic retinopathy (DR). DR can cause blurred vision, darkening of the field of vision, and severe vision loss. The number of people infected with the disease is increasing in our country and worldwide. The time-consuming physician check-ups and the presence of small lesions indicate the need to develop diagnostic systems. Deep learning-based applications have become the trend for diagnosing and grading diseases from images. This study aims to create a meaningful and sufficient dataset using effective data preprocessing and affine transformation techniques in diabetic retinopathy classification. In this study, classification was performed using seven different pre-trained deep learning architectures. An experimental study of each technique was performed on the EyePACS dataset. An overfitting problem was encountered in the experimental results with the original data set. Thus, data preprocessing and data augmentation processes were carried out in order to eliminate overfitting by considering the imbalance between classes in the dataset. The classification performance obtained from each architecture was observed according to performance metrics of precision, recall, F1 Score, accuracy, and loss. In this study, the best performance was achieved with 97.65% test accuracy with the proposed EfficientNetV2-M network model.