Enhancing Medical Image Classification with Vision Transformers on Diverse Datasets
International Journal of Science and Research (IJSR)

International Journal of Science and Research (IJSR)
Call for Papers | Fully Refereed | Open Access | Double Blind Peer Reviewed

ISSN: 2319-7064


Downloads: 3 | Views: 97 | Weekly Hits: ⮙2 | Monthly Hits: ⮙2

Research Paper | Computer Science & Engineering | India | Volume 14 Issue 1, January 2025 | Popularity: 4.8 / 10


     

Enhancing Medical Image Classification with Vision Transformers on Diverse Datasets

Aditya Dhar Dwivedi


Abstract: Medical image classification is essential for accurate diagnosis and effective treatment planning. This research investigates the implementation of MedViT, a robust Vision Transformer tailored for medical image analysis, and compares its performance against four models: StarterNet, TinyVGG, a standard Vision Transformer (ViT), and a Convolutional Neural Network (CNN). Evaluations conducted on PathMINST, a medical imaging dataset, and CIFAR - 10, a general - purpose image classification dataset, to assess model generalization.


Keywords: MedViT, Vision Transformer, Medical Image Classification, PathMINST, CIFAR-10


Edition: Volume 14 Issue 1, January 2025


Pages: 680 - 693


DOI: https://www.doi.org/10.21275/SR25113171116



Make Sure to Disable the Pop-Up Blocker of Web Browser


Text copied to Clipboard!
Aditya Dhar Dwivedi, "Enhancing Medical Image Classification with Vision Transformers on Diverse Datasets", International Journal of Science and Research (IJSR), Volume 14 Issue 1, January 2025, pp. 680-693, https://www.ijsr.net/getabstract.php?paperid=SR25113171116, DOI: https://www.doi.org/10.21275/SR25113171116