Computer Vision-Based Medical Image Segmentation Using Hybrid CNN and Transformer Architectures

Ashish  Sharma; M. Radhika Mani; Dr. Arivukkodi R; Dhanalaxmi Chinthala; Dr. Ravi Thangjam; Amol Bhilare; Tanya Singh; Ankur Singh

Computer Vision-Based Medical Image Segmentation Using Hybrid CNN and Transformer Architectures

Authors

Ashish Sharma Department of Computer Engineering & Applications, GLA, University, Mathura.
M. Radhika Mani Professor, Department of Computer Science and Engineering, Pragati Engineering College, ADB Road, Surampalem, NearPeddapuram, Kakinada District, Andhra Pradesh, India – 533437.
Dr. Arivukkodi R Computer Science, Meenakshi College of Arts and Science, Meenakshi Academy of Higher Education and Research.
Dhanalaxmi Chinthala Assistant Professor, Departmentof Information Technology, Vardhaman College of Engineering, Shamshabad, Hyderabad, India - 501 218.
Dr. Ravi Thangjam Professor, School of Business, Aditya University, Surampalem, Andhra Pradesh, Pin 533437.
Amol Bhilare Assistant Professor, Computer Engineering, Vishwakarma Institute of Technology, Pune, Maharashtra, 411037.
Tanya Singh School of Engineering &Technology,Noida international University, Uttar Pradesh 203201, India.
Ankur Singh Bist Graphic Era Hill University Bhimtal campus & Centre for Promotion of Research Graphic Era (Deemed to be) University, Dehradun, India.

Keywords:

Medical Image Segmentation, CNN, Transformer, Deep Learning, Computer Vision, Dice Score, IoU, Biomedical Imaging.

Abstract

In computer-aided diagnosis, disease monitoring, treatment planning, and precision healthcare, medical image segmentation is a crucial task that allows the identification of the anatomic structures and pathological regions in biomedical images. Traditional convolutional neural network (CNN)-based segmentation models have shown a high level of local feature extraction, but tend to have limited global contextual information and lack of long-range dependency modeling, which results into erroneous boundary demarcation and low segmentation accuracy under traditional medical imaging conditions. This study attempts to address these drawbacks by proposing a hybrid CNNTransformer framework, in which the capability of learning spatial features of CNN backbones is combined with the ability to learn the global context of transformer-based attention mechanisms to improve the medical image segmentation. The proposed architecture uses hierarchical local feature extraction with CNN encoder and transformer modules to extract semantic dependencies of long range and multi-scale contextual features, enhancing the robustness and accuracy of segmentation. The standard medical image segmentation dataset was used to evaluate the effectiveness of the proposed method through an experimental approach in which preprocessing and augmentation methods were implemented to enhance model generalization and efficiency in the training process. The proposed model was evaluated with the well-known segmentation measures, such as Dice Similarity Coefficient (DSC), Intersection over Union (IoU) and pixel-wise Accuracy. Experimental findings have shown that the hybrid framework achieves better segmentation performance than the conventional CNN-based frameworks because the framework provides better representation of the features, less false segmentation regions and accuracy in the boundaries. The suggested method demonstrated significant progress on Dice score, IoU, as well as the overall consistency of segmentation on difficult samples of medical imaging. The created framework provides strong clinical importance in that it enables more confident automated diagnosis, lessening manual annotation work, and enhances the decision making ability in intelligent health care system and computer-aided medical imaging software.

Downloads

Published

2026-05-12

How to Cite

Sharma, A., Mani, M. R., R, D. A., Chinthala, D., Thangjam, D. R., Bhilare, A., … Singh, A. (2026). Computer Vision-Based Medical Image Segmentation Using Hybrid CNN and Transformer Architectures. International Journal of Artificial Intelligence and Machine Learning, 6(2s), 445–458. Retrieved from https://svedbergopen.com/index.php/ijaiml/article/view/224

Download Citation

Issue

Vol. 6 No. 2s (2026): IJAIML_VOL.6_NO.2s 2026

Section

Articles

License

This work is licensed under a Creative Commons Attribution 4.0 International License.

Most read articles by the same author(s)

Dr. Priya Sethuraman, Dr. Arivukkodi R, Dr. Nallusamy C, Durga B, Xalida Sultanova, Bakhodir Khoshbakov, Integrating Knowledge Graphs with Natural Language Processing for Context-Aware Educational Content Recommendations , International Journal of Artificial Intelligence and Machine Learning: Vol. 6 No. 1s (2026): IJAIML_VOL.6_NO.1s 2026
Subhash Chand Agrawal, Avinash Gudimetla, Samundeeswari K, Sameera Khan, Dr. Ravi Thangjam, Gajanan Chavan, Kumari Shipra, Dr.N. Neelima, Edge Computing and AI Integration for Low-Latency Decision-Making in Smart Cities and Industrial IoT , International Journal of Artificial Intelligence and Machine Learning: Vol. 6 No. 2s (2026): IJAIML_VOL.6_NO.2s 2026
Mayank Srivastava, Y. Suma Chamundeswari, Suganya S, Swetha Polisetty, Dr. G. Sanjiv Rao, Ashutosh Kulkarni, Kuldeep Dhiman, Ankur Singh , Autonomous Multi-Agent Systems Using Reinforcement Learning for Cooperative Task Allocation and Optimization , International Journal of Artificial Intelligence and Machine Learning: Vol. 6 No. 2s (2026): IJAIML_VOL.6_NO.2s 2026
Rakesh Kumar, Y Vijay Kumar, Kanchana K, Sameera Khan, Dr. Ravi Thangjam, Rajesh Raikwar, Paul Praveen Albert Selvakumar, Mahendran Arumugam, Reinforcement Learning-Driven Autonomous Navigation System for Mobile Robots in Unstructured and Dynamic Terrains , International Journal of Artificial Intelligence and Machine Learning: Vol. 6 No. 2s (2026): IJAIML_VOL.6_NO.2s 2026
Ashish Sharma, Lakshmi Viveka K, Hadasha Nobel tune, Dhanalaxmi Chinthala, Dr. Ravi Thangjam, Bipin Sule, Tanveer Ahmad Wani, D. Akila, A Hybrid Framework Integrating Supervised and Reinforcement Learning for Adaptive Decision-Making in Dynamic Environments , International Journal of Artificial Intelligence and Machine Learning: Vol. 6 No. 2s (2026): IJAIML_VOL.6_NO.2s 2026
Rohit Agarwal, Vinod Kumar Naidu Pamuluri, Sathya arthi R, Ala Rajitha, Dr. Ravi Thangjam, Prashant Anerao, Deepika Sharma, Human-AI Collaborative Systems: Cognitive Computing Approaches for Enhancing User Interaction and Decision Support , International Journal of Artificial Intelligence and Machine Learning: Vol. 6 No. 2s (2026): IJAIML_VOL.6_NO.2s 2026

Computer Vision-Based Medical Image Segmentation Using Hybrid CNN and Transformer Architectures

Authors

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

License

Most read articles by the same author(s)

Similar Articles

Make a Submission

INDEXING

Developed By

Information

Browse

Current Issue