IMAGE AND VISION COMPUTING, cilt.162, 2025 (SCI-Expanded, Scopus)
Recent advances in deep learning have revolutionized robotic applications such as 3D mapping, visual navigation and autonomous control. Monocular Visual Odometry (MVO) represents a critical advancement in autonomous systems, particularly drones, utilizing single-camera setups to navigate complex environments effectively. This review explores MVO's evolution from traditional methods to its integration with cutting edge technologies like deep learning and semantic understanding. In this study, we explore the latest training strategies, innovations in model architecture, and advanced fusion techniques used in hybrid models that combine depth and semantic information. A comprehensive literature review traces the evolution of MVO techniques, highlighting key datasets and performance metrics. Section 2 outlines the problem, while Section 3 reviews the studies, charting the evolution of MVO techniques predating the advent of deep learning. Section 4 details the methodology, focusing on cutting-edge training strategies, advancements in architectural designs, and fusion techniques in hybrid models integrating depth and semantic information. Finally, Section 5 summarizes findings, discusses implications, and suggests future research directions.