Computer Vision · Perception · VLMs · Visual Agents

Beomyoung Kim 김범영

Applied Scientist, NAVER Cloud (Image Vision)

I'm an Applied Scientist at NAVER Cloud on a mission to build AI that truly understands the visual world. My journey began in core visual recognition — segmentation, detection, and matting — with 14+ papers at CVPR, ICCV, ECCV, NeurIPS, and AAAI (572 citations, h-index 9). I'm now extending that perception expertise into multimodal & vision-language models — perception-grounded VLMs and visual reasoning agents that don't just see, but reason and act. My north star: research that ships — turning frontier ideas into products used by millions. I'm also pursuing my Ph.D. at KAIST alongside full-time research at NAVER Cloud.

View CV Google Scholar GitHub

Email· DBLP· LinkedIn· 📍 Seoul, Korea

14+

Publications

572

Citations

h-index

Years @ NAVER

Research

What I work on

VLMs & Visual Reasoning Agent NOW @ NAVER

Extending my visual-recognition expertise to vision-language models — multimodal LLMs/VLMs with fine-grained visual perception and grounding, and visual reasoning agents that perceive, localize, and reason over images.

Vision Foundation Models & Matting

Promptable, zero-shot segmentation & matting for anything; label-efficient human matting.

ZIM (ICCV'25 Highlight) · WSSHM

Label-Efficient Segmentation

Weakly- and semi-supervised semantic & instance segmentation from cheap supervision.

PointWSSIS · BESTIE · DRS · WSSS-BED

Continual & Efficient Learning

Class-incremental and panoptic continual segmentation; lightweight, efficient models.

ECLIPSE · SSUL · EResFD · PfLayer

Updates

News

Jul 2026✈️ Attending ICML 2026 in person — reach out to connect.

Jun 2026🔥 Phoenix accepted to ECCV 2026.

Jun 2025🔥 ZIM accepted to ICCV 2025 as a Highlight.

Oct 2024🎤 Invited talk at TEAM NAVER DAN 24 — CLOVA-X image editing.

Mar 2024🔥 ECLIPSE accepted to CVPR 2024.

Feb 2023🔥 PointWSSIS accepted to CVPR 2023.

Peer-reviewed · 14+ papers

Selected Publications

Under review

Under review · NeurIPS 2026 · Visual Reasoning Agents

Visual reasoning agents. A diagnostic framework for 3D spatial reasoning that turns silent perception failures in visual program synthesis into typed diagnoses, driving targeted program repair to rival frontier VLMs without task-specific training.

2026

ECCV 2026

Learning from Adversity: Semantic-Aware Mask Refinement through Adversarial Perturbation

Beomyoung Kim, Sung Ju Hwang

arXiv & code coming soon

2025

ICCV 2025 · ★ Highlight

ZIM: Zero-Shot Image Matting for Anything

Beomyoung Kim, Chanyong Shin, Joonhyun Jeong, Hyungsik Jung, Se-Yun Lee, Sewhan Chun, Dong-Hyun Hwang, Joonsang Yu

project arXiv code

bibtex

@inproceedings{kim2025zim,
  title={ZIM: Zero-Shot Image Matting for Anything},
  author={Kim, Beomyoung and others},
  booktitle={ICCV}, year={2025}}

2024

arXiv preprint

Towards Label-Efficient Human Matting

Beomyoung Kim, Myeong Yeon Yi, Joonsang Yu, Young Joon Yoo, Sung Ju Hwang

arXiv code

arXiv preprint

Rethinking Saliency-Guided Weakly-Supervised Semantic Segmentation

Beomyoung Kim, Donghyeon Kim, Sung Ju Hwang

arXiv code

CVPR 2024

ECLIPSE: Efficient Continual Learning in Panoptic Segmentation with Visual Prompt Tuning

Beomyoung Kim, Joonsang Yu, Sung Ju Hwang

arXiv code

WACV 2024

EResFD: Rediscovery of the Effectiveness of Standard Convolution for Lightweight Face Detection

Joonhyun Jeong, Beomyoung Kim, Joonsang Yu, Youngjoon Yoo

arXiv code

2023

CVPR 2023

The Devil is in the Points: Weakly Semi-Supervised Instance Segmentation via Point-Guided Mask Representation

Beomyoung Kim, Joonhyun Jeong, Dongyoon Han, Sung Ju Hwang

arXiv code

2022

CVPR 2022

Beyond Semantic to Instance Segmentation via Semantic Knowledge Transfer and Self-Refinement

Beomyoung Kim, Youngjoon Yoo, Chaeeun Rhee, Junmo Kim

arXiv code

ICLR 2022

Learning Features with Parameter-Free Layers

Dongyoon Han, Youngjoon Yoo, Beomyoung Kim, Byeongho Heo

paper code

WACV 2022

TricubeNet: 2D Kernel-Based Object Representation for Weakly-Occluded Oriented Object Detection

Beomyoung Kim, Janghyeon Lee, Sihaeng Lee, Doyeon Kim, Junmo Kim

arXiv code

2021 & earlier

NeurIPS 2021

SSUL: Semantic Segmentation with Unknown Label for Exemplar-based Class-Incremental Learning

Sungmin Cha*, Beomyoung Kim*, Youngjoon Yoo, Taesup Moon (* equal)

arXiv code

AAAI 2021

Discriminative Region Suppression for Weakly-Supervised Semantic Segmentation

Beomyoung Kim, Sangeun Han, Junmo Kim

arXiv code

MIWAI 2021

3D Point Cloud Upsampling and Colorization using GAN

Beomyoung Kim, Sangeun Han, Eojindl Yi, Junmo Kim

paper

RiTA 2020

Fully Automated Valet Parking System Based on Infrastructure Sensing

Hyunjee Ryu, Beomyoung Kim, Heecheol Yoo, Jungwon Lee

paper

Research → Product

Real-World Impact

ZIM — Zero-Shot Image Matting. Promptable matting foundation model behind image-editing experiences. demo →
CLOVA-X Image Editing. Generative image editing, presented at TEAM NAVER DAN 24. talk →
FaceSign — CLOVA Vision. Face recognition service for NAVER identity verification & payments. learn more →

Background