Leveraging foreground–background cues for semantically-driven, training-free moving object detection

ŞİMŞEK, Emrah; Negin, Farhood; Özyer, Gülşah; ÖZYER, Barış

doi:10.1016/j.engappai.2024.108873

Leveraging foreground–background cues for semantically-driven, training-free moving object detection

Atıf İçin Kopyala

ŞİMŞEK E., Negin F., Özyer G. T., ÖZYER B.

Engineering Applications of Artificial Intelligence, cilt.136, 2024 (SCI-Expanded)

Yayın Türü: Makale / Tam Makale
Cilt numarası: 136
Basım Tarihi: 2024
Doi Numarası: 10.1016/j.engappai.2024.108873
Dergi Adı: Engineering Applications of Artificial Intelligence
Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus, Academic Search Premier, Aerospace Database, Applied Science & Technology Source, Communication Abstracts, Compendex, Computer & Applied Sciences, INSPEC, Metadex, Civil Engineering Abstracts
Anahtar Kelimeler: Deep learning, Moving object detection, Semantic probability, Semantic segmentation, Training-free
Atatürk Üniversitesi Adresli: Evet

Özet

Deep learning algorithms and semantic cue-based methods are increasingly used to detect moving objects. The semantic label changes in pixel location over time are used for moving object detection in semantic cue-based studies. In this approach, traditional background subtraction is employed only when there is a lack of semantic background. However, the change in semantic information in each pixel is insufficient for accurate detection of moving objects due to similar labels assignments for different objects and limited use of pixel intensity data. Moreover, challenges arise in detecting moving objects, such as a lack of diversity in labeled objects, an uneven distribution of labels or classes, and an insufficient amount of unseen datasets. In this study, we propose the Moving Semantic Object Detector (MSemOD), a train-less moving object detection method based on a traditional method enhanced with semantic information provided by pre-trained semantic segmentation models. The proposed method includes a region-based decision step, utilizing the traditional methods and semantic information as spatial–temporal cues. Threshold values are applied in the method to determine the temporal and spatial movements of each segment in the image. To improve the consistency of the semantic information, the proposed method converts the semantic segmentation output into a semantic probability map by enhancing the temporal and spatial state. The performance of the proposed method is evaluated on the LASIESTA, CDNet2014, and the BMC datasets and the results are compared with the state-of-the-art algorithms. The results show that using region-based semantic information can significantly reduce false positives and enhance foreground–background segmentation in moving object detection.