Enhancing Object Recognition for Visually Impaired Individuals using Computer Vision
Enhancing Object Recognition for Visually Impaired Individuals using Computer Vision |
||
|
||
© 2024 by IJETT Journal | ||
Volume-72 Issue-4 |
||
Year of Publication : 2024 | ||
Author : Myo Min Aung, Dechrit Maneetham, Padma Nyoman Crisnapati, Yamin Thwe |
||
DOI : 10.14445/22315381/IJETT-V72I4P130 |
How to Cite?
Myo Min Aung, Dechrit Maneetham, Padma Nyoman Crisnapati, Yamin Thwe, "Enhancing Object Recognition for Visually Impaired Individuals using Computer Vision," International Journal of Engineering Trends and Technology, vol. 72, no. 4, pp. 297-305, 2024. Crossref, https://doi.org/10.14445/22315381/IJETT-V72I4P130
Abstract
In the realm of computer vision and autonomous systems, object recognition and obstacle recognition are pivotal tasks, each contributing uniquely to the intelligent capabilities and safety of people and mobile robots. While object recognition focuses on identifying and classifying objects within digital images or video frames, obstacle recognition is dedicated to detecting and localizing obstacles or hazards in an environment. Object recognition, which utilizes machine learning, computer vision, YOLOv4 architecture, and the COCO dataset, is presented with a particular emphasis on visually impaired individuals. This study integrates YOLOv4 and the COCO dataset, aiming to advance object recognition while harnessing the benefits of obstacle recognition. The research encompasses hardware implementation, including a Raspberry Pi with an added 7-inch LCD and software implementation involving machine learning models. Test results reveal the system's robustness and real-time functionality. Furthermore, the user experience testing at the exhibition of Phramongkutklao Hospital garnered positive feedback, which is valuable input to build a user-centric approach in developing object recognition technology tailored to their needs. This research promises valuable contributions to intelligent systems' object recognition in complex environments.
Keywords
Computer Vision, Object recognition, Raspberry-pi, Vision impairment, YOLOv4.
References
[1] World Health Organization, Blindness and Vision Impariment, 2023. [Online]. Available: https://www.who.int/news-room/fact-sheets/detail/blindness-and-visual-impairment
[2] Kanak Manjari, Madhushi Verma, and Gaurav Singal, “A Survey on Assistive Technology for Visually Impaired,” Internet of Things, vol. 11, 2020.
[CrossRef] [Google Scholar] [Publisher Link]
[3] Rabia Jafri et al., “Computer Vision-Based Object Recognition for the Visually Impaired in an Indoors Environment: A Survey,” The Visual Computer, vol. 30, pp. 1197-1222, 2014.
[CrossRef] [Google Scholar] [Publisher Link]
[4] Idris Jeelani et al., “Real-Time Vision-Based Worker Localization & Hazard Detection for Construction,” Automation in Construction, vol. 121, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[5] Afshan Latif et al., “Content-Based Image Retrieval and Feature Extraction: A Comprehensive Review,” Mathematical Problems in Engineering, vol. 2019, pp. 1-21, 2019.
[CrossRef] [Google Scholar] [Publisher Link]
[6] Shervin Minaee et al., “Image Segmentation Using Deep Learning: A Survey,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 44, no. 7, pp. 3523-3542, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[7] Vidula V. Meshram et al., “An Astute Assistive Device for Mobility and Object Recognition for Visually Impaired People,” IEEE Transactions on Human-Machine Systems, vol. 49, no. 5, pp. 449-460, 2019.
[CrossRef] [Google Scholar] [Publisher Link]
[8] Wilson Luu et al., “A Holistic Model of Low Vision Care for Improving Vision‐Related Quality of Life,” Clinical and Experimental Optometry, vol. 103, no. 6, pp. 733-741, 2020.
[CrossRef] [Google Scholar] [Publisher Link]
[9] Mohamed Dhiaeddine Messaoudi, Bob-Antoine J. Menelas, and Hamid Mcheick, “Review of Navigation Assistive Tools and Technologies for the Visually Impaired,” Sensors, vol. 22, no. 20, pp. 1-29, 2022.
[CrossRef] [Google Scholar] [Publisher Link]
[10] Usman Masud et al., “Smart Assistive System for Visually Impaired People Obstruction Avoidance through Object Detection and Classification,” IEEE Access, vol. 10, pp. 13428-13441, 2022.
[CrossRef] [Google Scholar] [Publisher Link]
[11] Abhishek Gupta, “Deep Learning for Object Detection and Scene Perception in Self-Driving Cars: Survey, Challenges, and Open Issues,” Array, vol. 10, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[12] Shuchang Xu et al., “Virtual Paving: Rendering a Smooth Path for People with Visual Impairment through Vibrotactile and Audio Feedback,” Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies, vol. 4, no. 3, pp. 1-25, 2020.
[CrossRef] [Google Scholar] [Publisher Link]
[13] Silvia Rostianingsih, Alexander Setiawan, and Christopher Imantaka Halim, “COCO (Creating Common Object in Context) Dataset for Chemistry Apparatus,” Procedia Computer Science, vol. 171, pp. 2445-2452, 2020.
[CrossRef] [Google Scholar] [Publisher Link]
[14] Yamin Cheng et al., “Rethinking Vision Transformer through Human–Object Interaction Detection,” Engineering Applications of Artificial Intelligence, vol. 122, 2023.
[CrossRef] [Google Scholar] [Publisher Link]
[15] Peiyuan Jiang et al., “A Review of Yolo Algorithm Developments,” Procedia Computer Science, vol. 199, pp. 1066-1073, 2022.
[CrossRef] [Google Scholar] [Publisher Link]
[16] Mang Ning et al., “YOLOv4-object: An Efficient Model and Method for Object Discovery,” 2021 IEEE 45th Annual Computers, Software, and Applications Conference (COMPSAC), Madrid, Spain, pp. 31-36, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[17] Joo Woo et al., “A Study on Object Detection Performance of YOLOv4 for Autonomous Driving of Tram,” Sensors, vol. 22, no. 22, pp. 1-11, 2022.
[CrossRef] [Google Scholar] [Publisher Link]
[18] Bineeth Kuriakose, Raju Shrestha, and Frode Eika Sandnes, “Tools and Technologies for Blind and Visually Impaired Navigation Support: A Review,” IETE Technical Review, vol. 39, no. 1, pp. 3-18, 2020.
[CrossRef] [Google Scholar] [Publisher Link]
[19] Sharada Murali et al., “Smart Walking Cane for the Visually Challenged,” 2016 IEEE Region 10 Humanitarian Technology Conference (R10-HTC), Agra, India, pp. 1-4, 2016.
[CrossRef] [Google Scholar] [Publisher Link]
[20] Pisak Chinchai et al., “A White Cane Modified with Ultrasonic Detectors for People with Visual Impairment,” Journal of Associated Medical Sciences, vol. 55, no. 3, pp. 11–18, 2022.
[CrossRef] [Google Scholar] [Publisher Link]
[21] Koppala Guravaiah et al., “Third Eye: Object Recognition and Speech Generation for Visually Impaired,” Procedia Computer Science, vol. 218, pp. 1144-1155, 2023.
[CrossRef] [Google Scholar] [Publisher Link]
[22] Vikky Mohane, and Chetan Gode, “Object Recognition for Blind People Using Portable Camera,” 2016 World Conference on Futuristic Trends in Research and Innovation for Social Welfare (Startup Conclave), Coimbatore, India, pp. 1-4, 2016.
[CrossRef] [Google Scholar] [Publisher Link]
[23] Hanen Jabnoun, Faouzi Benzarti, and Hamid Amiri, “Visual Scene Prediction for Blind People Based on Object Recognition,” 2017 14th International Conference on Computer Graphics, Imaging and Visualization, Marrakesh, Morocco, pp. 21-26, 2017.
[CrossRef] [Google Scholar] [Publisher Link]
[24] P. Jagadesh et al., “Design and Development of an Intelligent and Smart Helmet for Visually Impaired,” 2023 9th International Conference on Advanced Computing and Communication Systems, Coimbatore, India, pp. 249-253, 2023.
[CrossRef] [Google Scholar] [Publisher Link]
[25] Milios Awad et al., “Intelligent Eye: A Mobile Application for Assisting Blind People,” 2018 IEEE Middle East and North Africa Communications Conference, Jounieh, Lebanon, pp. 1-6, 2018.
[CrossRef] [Google Scholar] [Publisher Link]
[26] Reagan L. Galvez et al., “Object Detection Using Convolutional Neural Networks,” TENCON 2018 - 2018 IEEE Region 10 Conference, Jeju, Korea (South), pp. 2023-2027, 2019.
[CrossRef] [Google Scholar] [Publisher Link]
[27] Patrick Poirson et al., “Fast Single Shot Detection and Pose Estimation,” 2016 Fourth International Conference on 3D Vision (3DV), Stanford, CA, USA, pp. 676-684, 2016.
[CrossRef] [Google Scholar] [Publisher Link]
[28] Ashwani Kumar, S.S. Sai Satyanarayana Reddy, and Vivek Kulkarni, “An Object Detection Technique for Blind People in Real-Time Using Deep Neural Network,” 2019 Fifth International Conference on Image Information Processing, Shimla, India, pp. 292-297, 2019.
[CrossRef] [Google Scholar] [Publisher Link]
[29] Md. Atikur Rahman, and Muhammad Sheikh Sadi, “IoT Enabled Automated Object Recognition for the Visually Impaired,” Computer Methods and Programs in Biomedicine Update, vol. 1, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[30] Meeth Shetty, “A Review on Deep Learning Object Detection: YOLO vs SSD,” International Journal of Advanced Research in Science, Communication and Technology, vol. 5, no. 2, 2021.
[Publisher Link]
[31] Lu Tan et al., “Comparison of RetinaNet, SSD, and YOLO v3 for Real-Time Pill Identification,” BMC Medical Informatics and Decision Making, vol. 21, no. 324, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[32] N. Aishwarya et al., “Skin Cancer Diagnosis with Yolo Deep Neural Network,” Procedia Computer Science, vol. 220, pp. 651-658, 2023.
[CrossRef] [Google Scholar] [Publisher Link]