Advanced Computer Vision Techniques Using Graph Neural Networks for Real-Time Object Detection and Scene Understanding

Mridul  Dixit; Prasanth Varasala; Malarvizhi S; Regulwar Ganesh Bhaiyya; Dr. G. Sanjiv Rao; Chandrashekhar Ramesh  Ramtirthkar; Piyush  Pal; Mahendran Arumugam

Authors

Mridul Dixit Department of Computer Engineering & Applications, GLA University, Mathura.
Prasanth Varasala Associate Professor, Department of Electronics and Communication Engineering, Pragati Engineering College, ADB Road, Surampalem, Near Peddapuram, Kakinada District, Andhra Pradesh, India - 533437.
Malarvizhi S 3Assistant Professor, Department of Commerce, Meenakshi College of Arts and Science, Meenakshi Academy of Higher Education and Research.
Regulwar Ganesh Bhaiyya Associate Professor, Department of Information Technology, Vardhaman College of Engineering, Shamshabad, Hyderabad, India - 501 218.
Dr. G. Sanjiv Rao Professor, Department of Artificial Intelligence and Machine Learning, Aditya University, Surampalem, Andhra Pradesh, Pin 533437.
Chandrashekhar Ramesh Ramtirthkar Associate Professor, Mechanical Engineering, Vishwakarma Institute of Technology, Pune, Maharashtra, 411037, India.
Piyush Pal School of Engineering &Technology,Noida international University, Uttar Pradesh 203201, India.
Mahendran Arumugam 8Center for Global Health Research,Saveetha Medical College, Saveetha Institute of Medical and Technical Sciences, Chennai, India.

Keywords:

Computer Vision; Graph Neural Networks (GNN); Real-Time Object Detection; Scene Understanding; Graph Attention Networks (GAT); Scene Graph Generation; Deep Learning; Contextual Reasoning; YOLO-Based Detection; Spatial Relationship Modeling; Intelligent Vision Systems; Semantic Scene Analysis.

Abstract

Real-time object recognition and scene perception are central to the higher level computer vision operationalities in autonomous driving, smart surveillance, robotics, and smart healthcare systems. Nevertheless, traditional convolution-based object detecting models are mainly concerned with single object detection and do not typically work well in obtaining contextual and spatial associations among objects in complex scenes. Such a restriction diminishes the accuracy of semantic understanding as well as the reliability of decision-making in changing real-world situations. To counter this difficulty, this paper presents a novel computer vision structure that is more advanced with the implementation of the Graph Neural Network (GNN) to detect objects and comprehend scenes in real-time. In the proposed model, the lightweight YOLO-based backbone feature extractor is paired with a Graph Attention Network (GAT) to predict inter-object relationships and context scene relationships. It has a scene graph generation mechanism to enhance semantic reasoning and spatial interaction analysis between detected objects. The framework was tested on benchmark datasets such as COCO and Visual Genome with real-time conditions. The experimental findings support that the proposed method obtained an average Precision (mAP) of 91.3, detection accuracy of 94.1, scene relationship recognition accuracy of 92.6, and a detection rate of 42 FPS, better than the traditional CNN-based and transformer-based detection frameworks and with low computational latency in real-time implementation.

Advanced Computer Vision Techniques Using Graph Neural Networks for Real-Time Object Detection and Scene Understanding

Authors

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

License

Most read articles by the same author(s)

Similar Articles

Make a Submission

INDEXING

Developed By

Information

Browse

Current Issue