Advanced Computer Vision Techniques Using Graph Neural Networks for Real-Time Object Detection and Scene Understanding

Authors

  • Mridul Dixit Department of Computer Engineering & Applications, GLA University, Mathura.
  • Prasanth Varasala Associate Professor, Department of Electronics and Communication Engineering, Pragati Engineering College, ADB Road, Surampalem, Near Peddapuram, Kakinada District, Andhra Pradesh, India - 533437.
  • Malarvizhi S 3Assistant Professor, Department of Commerce, Meenakshi College of Arts and Science, Meenakshi Academy of Higher Education and Research.
  • Regulwar Ganesh Bhaiyya Associate Professor, Department of Information Technology, Vardhaman College of Engineering, Shamshabad, Hyderabad, India - 501 218.
  • Dr. G. Sanjiv Rao Professor, Department of Artificial Intelligence and Machine Learning, Aditya University, Surampalem, Andhra Pradesh, Pin 533437.
  • Chandrashekhar Ramesh Ramtirthkar Associate Professor, Mechanical Engineering, Vishwakarma Institute of Technology, Pune, Maharashtra, 411037, India.
  • Piyush Pal School of Engineering &Technology,Noida international University, Uttar Pradesh 203201, India.
  • Mahendran Arumugam 8Center for Global Health Research,Saveetha Medical College, Saveetha Institute of Medical and Technical Sciences, Chennai, India.

Keywords:

Computer Vision; Graph Neural Networks (GNN); Real-Time Object Detection; Scene Understanding; Graph Attention Networks (GAT); Scene Graph Generation; Deep Learning; Contextual Reasoning; YOLO-Based Detection; Spatial Relationship Modeling; Intelligent Vision Systems; Semantic Scene Analysis.

Abstract

Real-time object recognition and scene perception are central to the higher level computer vision operationalities in autonomous driving, smart surveillance, robotics, and smart healthcare systems. Nevertheless, traditional convolution-based object detecting models are mainly concerned with single object detection and do not typically work well in obtaining contextual and spatial associations among objects in complex scenes. Such a restriction diminishes the accuracy of semantic understanding as well as the reliability of decision-making in changing real-world situations. To counter this difficulty, this paper presents a novel computer vision structure that is more advanced with the implementation of the Graph Neural Network (GNN) to detect objects and comprehend scenes in real-time. In the proposed model, the lightweight YOLO-based backbone feature extractor is paired with a Graph Attention Network (GAT) to predict inter-object relationships and context scene relationships. It has a scene graph generation mechanism to enhance semantic reasoning and spatial interaction analysis between detected objects. The framework was tested on benchmark datasets such as COCO and Visual Genome with real-time conditions. The experimental findings support that the proposed method obtained an average Precision (mAP) of 91.3, detection accuracy of 94.1, scene relationship recognition accuracy of 92.6, and a detection rate of 42 FPS, better than the traditional CNN-based and transformer-based detection frameworks and with low computational latency in real-time implementation.

Downloads

Published

2026-06-14

How to Cite

Dixit, M., Varasala, P., S, M., Bhaiyya, R. G., Rao, D. G. S., Ramtirthkar, C. R., … Arumugam , M. (2026). Advanced Computer Vision Techniques Using Graph Neural Networks for Real-Time Object Detection and Scene Understanding. International Journal of Artificial Intelligence and Machine Learning, 6(2s), 544–554. Retrieved from https://svedbergopen.com/index.php/ijaiml/article/view/236

Similar Articles

<< < 11 12 13 14 15 16 17 18 19 20 > >> 

You may also start an advanced similarity search for this article.