| Issue |
EPJ Web Conf.
Volume 354, 2026
19th Global Congress on Manufacturing and Management (GCMM 2025)
|
|
|---|---|---|
| Article Number | 03002 | |
| Number of page(s) | 18 | |
| Section | Robotics, Autonomous Systems, and Smart Inspection | |
| DOI | https://doi.org/10.1051/epjconf/202635403002 | |
| Published online | 02 March 2026 | |
https://doi.org/10.1051/epjconf/202635403002
Design and Implementation of an Autonomous Line-Following Mobile Robot Using Q-Learning and Computer Vision
1 Department of Mechanical Engineering, Lendi Institute of Engineering and Technology, Jonnada, Andhra Pradesh 535005, India
2 Department of Computer Science and Engineering, Anurag University, Hyderabad, Telangana 500088, India
3 Department of Mechanical Engineering, R.M.K. Engineering College, Kavaraipettai, Tamil Nadu 601206, India
4 Department of Information Technology, Prathyusha Engineering College, Tiruvallur, Tamil Nadu 602025, India
5 Department of Computer Science - Artifical Intelligence, PSGR Krishnammal College for Women, Coimbatore, Tamil Nadu 641004, India
6 Research and Development Cell, Lovely Professional University, Phagwara, Punjab 144411, India
* Corresponding author: This email address is being protected from spambots. You need JavaScript enabled to view it.
Published online: 2 March 2026
Abstract
This research paper discusses the creation and application of a fully automated robot that follows a line by means of reinforcement learning techniques, more specifically the Q-learning algorithm. The system proposed comprises a vision-based perception module that uses OpenCV for its development and also includes the simulation environment of CoppeliaSim thus making the navigation process adaptive as well as robust. By interaction with its surroundings at all times, the agent gets trained on the control policy that is optimal in terms of lateral deviation minimization and hence provides stable trajectory tracking. The research comprises a detailed assessment of the Q-table dimensions and hyper-parameters such as the learning rate, discount factor, and exploration rate, systematically determining their impact on learning performance, convergence, and accuracy. According to the experimental findings, a configuration of a 7×7 Q-table strikes the best balance between precision and convergence speed that in turn results in smooth and even path tracking. The method, though quite effective under controlled conditions, does have its drawbacks in terms of state discretization, generalization, and real-time processing, thereby providing directions for the applications of deep reinforcement learning and adaptive perception models in the future.
© The Authors, published by EDP Sciences, 2026
This is an Open Access article distributed under the terms of the Creative Commons Attribution License 4.0, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.
Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.
Initial download of the metrics may take a while.

