← Back to Benchmarks
simmediumvision-robotmetric · varies

Vision-Language Semantic Grounding for Multi-Domain Crop-Weed Segmentation

Description

Fine-grained crop-weed segmentation is essential for enabling targeted herbicide application in precision agriculture. However, existing deep learning models struggle to generalize across heterogeneous agricultural environments due to reliance on dataset-specific visual features. We propose Vision-Language Weed Segmentation (VL-WS), a novel framework that addresses this limitation by grounding pixel-level segmentation in semantically aligned, domain-invariant representations. Our architecture em

Source

http://arxiv.org/abs/2602.23677v1