EVTP-IVS: Effective Visual Token Pruning For Unifying Instruction Visual Segmentation In Multi-Modal Large Language Models
Published in IEEE/CVF Winter Conference on Applications of Computer Vision (WACV 2026), 2026
Published at WACV 2026. (* equal contribution, # corresponding author)
Recommended citation: Wenhui Zhu*, Xiwen Chen*, Zhipeng Wang*#, Shao Tang, Sayan Ghosh, Xuanzhao Dong, Rajat Koner, Yalin Wang. (2026). "EVTP-IVS: Effective Visual Token Pruning For Unifying Instruction Visual Segmentation In Multi-Modal Large Language Models." WACV 2026.
Download Paper
