Abstract: We investigate fine-tuning Vision-Language Models (VLMs) for multi-task medical image understanding, focusing on detection, localization, and counting of findings in medical images. Our ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results