*: equal contribution.   †: corresponding author.   C: conference   P: preprint   W: workshop


[C6] What “Not” to Detect: Improving Object Detection under Negation via Reasoning and Token Merging

NegToMe

  • Inha Kang, Youngsun Lim, Seonho Lee, Jiho Choi, Junsuk Choi†, Hyunjung Shim†
  • Keywords: Described Object Detection under Negation

ICLR 2026, Paper


[C5] 3D-Aware Vision-Language Models Fine-Tuning with Geometric Distillation

3D-Aware-VLM

  • Seonho Lee*, Jiho Choi*, Inha Kang, Jiwook Kim, Junsung Park, Hyunjung Shim†
  • Keywords: 3D-Aware VLM Finetuning

EMNLP 2025 Findings, Paper, Codes GitHub Stars


[C4] Fine-Grained Image-Text Correspondence with Cost Aggregation for Open-Vocabulary Part Segmentation

PartCATSeg

  • Jiho Choi*, Seonho Lee*, Seungho Lee, Minhyun Lee, Hyunjung Shim†
  • Keywords: Open-Vocabulary Part Segmentation

CVPR 2025, Paper, Codes GitHub Stars


[C3] Scribble-Guided Diffusion for Training-free Text-to-Image Generation

Scribble-Guided Diffusion

  • Seonho Lee*, Jiho Choi*, Seohyun Lim, Jiwook Kim, Hyunjung Shim†
  • Keywords: Conditional Image Generation

ICIP 2025, Paper, Codes GitHub Stars


[C2] DreamCatalyst: Fast and High-Quality 3D Editing via Controlling Editability and Identity Preservation

DreamCatalyst

  • Jiwook Kim*, Seonho Lee*, Jaeyo Shin, Jiho Choi, Hyunjung Shim†
  • Keywords: 3D Editing, Score Distillation
  • Selected as a Daily Paper in HuggingFace

ICLR 2025, Paper, Project Page, Codes GitHub Stars


[C1] Understanding Multi-Granularity for Open-Vocabulary Part Segmentation

PartCLIPSeg

  • Jiho Choi*, Seonho Lee*, Seungho Lee, Minhyun Lee, Hyunjung Shim†
  • Keywords: Open-Vocabulary Part Segmentation

NeurIPS 2024, Paper, Codes GitHub Stars


[W1] Grounding the “Not”: Symbolic Representation of Negation for Logical Reasoning in VLMs

CoVAND

  • Inha Kang, Seonho Lee, Jiho Choi, Junsuk Choi†, Hyunjung Shim†
  • Keywords: Negation Understanding, Affirmative Bias

ICLR 2026 Workshop LLM Reasoning


[P3] Dense Reward for Multi-View 3D Reasoning with Global Maps and Local Views

DR3D

  • Jiho Choi*, Seonho Lee*, Seojeong Park, Hyunjung Shim†
  • Keywords: Multi-View 3D Reasoning, Reinforcement Learning

Under Review


[P2] Mitigating Perceptual Judgment Bias in Multimodal LLM-as-a-Judge via Perceptual Perturbation and Reward Modeling

Perceptual_Judge

  • Seojeong Park*, Jiho Choi*, Junyong Kang, Seonho Lee, Jaeyo Shin, Hyunjung Shim†
  • Keywords: Multimodal LLM, Perceptual Judgment Bias

Under Review


[P1] WaymoQA: A Multi-View Visual Question Answering Dataset for Safety-Critical Reasoning in Autonomous Driving

WaymoQA

  • Seungjun Yu, Seonho Lee, Namho Kim, Jaeyo Shin, Junsung Park, Wonjeong Ryu, Raehyuk Jung, Hyunjung Shim†
  • Keywords: Visual Question Answering, Safety-Critical Reasoning

Under Review, Preprint, Codes GitHub Stars