Design and Implementation of an Autonomous Line-Following Mobile Robot Using Q-Learning and Computer Vision

K. Ch. Sekhar; J. Maria Shanthi; K. R. Senthil Kumar; M. Gopikrishnan; R. Revathi; Joshuva Arockia Dhanraj

doi:10.1051/epjconf/202635403002

Open Access

Issue		EPJ Web Conf. Volume 354, 2026 19^th Global Congress on Manufacturing and Management (GCMM 2025)


Article Number		03002
Number of page(s)		18
Section		Robotics, Autonomous Systems, and Smart Inspection
DOI		https://doi.org/10.1051/epjconf/202635403002
Published online		02 March 2026

EPJ Web of Conferences 354, 03002 (2026)
https://doi.org/10.1051/epjconf/202635403002

Design and Implementation of an Autonomous Line-Following Mobile Robot Using Q-Learning and Computer Vision

K. Ch. Sekhar¹^*, J. Maria Shanthi², K. R. Senthil Kumar³, M. Gopikrishnan⁴, R. Revathi⁵ and Joshuva Arockia Dhanraj⁶

¹ Department of Mechanical Engineering, Lendi Institute of Engineering and Technology, Jonnada, Andhra Pradesh 535005, India
² Department of Computer Science and Engineering, Anurag University, Hyderabad, Telangana 500088, India
³ Department of Mechanical Engineering, R.M.K. Engineering College, Kavaraipettai, Tamil Nadu 601206, India
⁴ Department of Information Technology, Prathyusha Engineering College, Tiruvallur, Tamil Nadu 602025, India
⁵ Department of Computer Science - Artifical Intelligence, PSGR Krishnammal College for Women, Coimbatore, Tamil Nadu 641004, India
⁶ Research and Development Cell, Lovely Professional University, Phagwara, Punjab 144411, India

^* Corresponding author: This email address is being protected from spambots. You need JavaScript enabled to view it.

Published online: 2 March 2026

Abstract

This research paper discusses the creation and application of a fully automated robot that follows a line by means of reinforcement learning techniques, more specifically the Q-learning algorithm. The system proposed comprises a vision-based perception module that uses OpenCV for its development and also includes the simulation environment of CoppeliaSim thus making the navigation process adaptive as well as robust. By interaction with its surroundings at all times, the agent gets trained on the control policy that is optimal in terms of lateral deviation minimization and hence provides stable trajectory tracking. The research comprises a detailed assessment of the Q-table dimensions and hyper-parameters such as the learning rate, discount factor, and exploration rate, systematically determining their impact on learning performance, convergence, and accuracy. According to the experimental findings, a configuration of a 7×7 Q-table strikes the best balance between precision and convergence speed that in turn results in smooth and even path tracking. The method, though quite effective under controlled conditions, does have its drawbacks in terms of state discretization, generalization, and real-time processing, thereby providing directions for the applications of deep reinforcement learning and adaptive perception models in the future.

This is an Open Access article distributed under the terms of the Creative Commons Attribution License 4.0, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.

Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.

Initial download of the metrics may take a while.