PolySAM-Lite: Parameter-efficient adaptation of the Segment Anything Model for colorectal polyp segmentation

Umar Hasan
Muhammad Ali Nayeem

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Colorectal cancer is a leading cause of global cancer mortality, where early detection and segmentation of polyps during colonoscopy are critical for survival. While Vision Foundation Models like the Segment Anything Model (SAM) demonstrate exceptional zero-shot generalization, their massive computational footprint hinders deployment in resource-constrained clinical settings. To bridge this gap, we introduce PolySAM-Lite, a resource-efficient framework that adapts the SAM architecture for specific medical segmentation tasks using Low-Rank Adaptation (LoRA). By freezing the heavy image encoder and injecting trainable low-rank matrices specifically into the fused Query-Key-Value (QKV) attention layers, we fine-tuned the model using only 4.2 million parameters (∼4.5% of the ViT-Base total). Experimental evaluations on the Kvasir-SEG dataset demonstrate that PolySAM-Lite achieves a Dice Similarity Coefficient (DSC) of 0.9348, significantly outperforming the zero-shot SAM baseline (DSC: 0.8656) by 6.92%. Furthermore, ablation studies reveal that our method maintains robust performance (DSC: 0.9240) even when trained on only 50% of the available data. Statistical analysis confirms the significance of these improvements (p < 0.001). Notably, PolySAM-Lite achieved an Area Under the Curve (AUC) of 0.9972 on a single consumer-grade NVIDIA T4 GPU, demonstrating that high-performance medical AI can be democratized for low-resource healthcare environments.

Version published to 10.21203/rs.3.rs-8662498/v1 on Research Square
Mar 3, 2026

High-Precision Lung Cancer Localization Precision with Histogram Equalisation and Frequency-Domain Hybrid Attention

This article has 1 author:
1. Shiqiang BAI
This article has no evaluationsLatest version Apr 1, 2026
An Optimized EfficientNet-B3 Based Lung Cancer Detection Framework with Stratified Splitting and Deployment-Ready Torch Script Integration

This article has 4 authors:
1. Sugandha Saxena
2. S N Prasad
3. Preethi Preethi
4. Chaya Ravindra
This article has no evaluationsLatest version Apr 7, 2026
HDFF-Net: A Hybrid Dual-Feature Fusion Network with Cross-Modal Attention for Automated Colposcopic Transformation Zone Classification

This article has 2 authors:
1. B. Shubhaker¹
2. B. S. Raghavendra²
This article has no evaluationsLatest version Apr 7, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

High-Precision Lung Cancer Localization Precision with Histogram Equalisation and Frequency-Domain Hybrid Attention

An Optimized EfficientNet-B3 Based Lung Cancer Detection Framework with Stratified Splitting and Deployment-Ready Torch Script Integration

HDFF-Net: A Hybrid Dual-Feature Fusion Network with Cross-Modal Attention for Automated Colposcopic Transformation Zone Classification