← Back to Benchmarks
simmediumvision-robotmetric · varies

DISC: Dense Integrated Semantic Context for Large-Scale Open-Set Semantic Mapping

Description

Open-set semantic mapping enables language-driven robotic perception, but current instance-centric approaches are bottlenecked by context-depriving and computationally expensive crop-based feature extraction. To overcome this fundamental limitation, we introduce DISC (Dense Integrated Semantic Context), featuring a novel single-pass, distance-weighted extraction mechanism. By deriving high-fidelity CLIP embeddings directly from the vision transformer's intermediate layers, our approach eliminate

Source

http://arxiv.org/abs/2603.03935v1