Computer Vision · Multimodal AI · Vision-Language Models

Beomyoung Kim 김범영

AI Research Engineer, NAVER Cloud (Image Vision)

I'm an AI Research Engineer at NAVER on a mission to build AI that truly understands the visual world. My journey began in core visual recognition — segmentation, detection, and matting — with 13+ papers at CVPR, ICCV, NeurIPS, ICLR and AAAI (570 citations, h-index 9). I'm now extending that perception expertise into multimodal & vision-language models — perception-grounded VLMs and visual reasoning agents that don't just see, but reason and act. My north star: research that ships — turning frontier ideas into products used by millions. I'm also pursuing my Ph.D. at KAIST alongside full-time research at NAVER.

Email· DBLP· LinkedIn· 📍 Seoul, Korea
Beomyoung Kim
13+
Publications
570
Citations
9
h-index
5+
Years @ NAVER
Research

What I work on

Multimodal & Vision-Language Models NOW @ NAVER

Extending my visual-recognition expertise to vision-language models — multimodal LLMs/VLMs with fine-grained visual perception and grounding, and visual reasoning agents that perceive, localize, and reason over images.

Vision Foundation Models & Matting

Promptable, zero-shot segmentation & matting for anything; label-efficient human matting.

ZIM (ICCV'25 Highlight) · WSSHM

Label-Efficient Segmentation

Weakly- and semi-supervised semantic & instance segmentation from cheap supervision.

PointWSSIS · BESTIE · DRS · WSSS-BED

Continual & Efficient Learning

Class-incremental and panoptic continual segmentation; lightweight, efficient models.

ECLIPSE · SSUL · EResFD · PfLayer

Updates

News

Jul 2026✈️ Attending ICML 2026 in person — reach out to connect.
Jun 2025🔥 ZIM accepted to ICCV 2025 as a Highlight.
Oct 2024🎤 Invited talk at TEAM NAVER DAN 24 — CLOVA-X image editing.
Mar 2024🔥 ECLIPSE accepted to CVPR 2024.
Feb 2023🔥 PointWSSIS accepted to CVPR 2023.
Peer-reviewed · 13+ papers

Selected Publications

Under review
Under review · NeurIPS 2026

A diagnostic framework for 3D spatial reasoning that turns silent perception failures in visual program synthesis into typed diagnoses, driving targeted program repair to rival frontier VLMs without task-specific training.

Under review · ECCV 2026

A segmentation mask-refinement framework that synthesizes realistic, semantic-aware errors via adversarial perturbation and corrects them with contrastive learning, consistently improving state-of-the-art segmentation models.

2025
ICCV 2025 · ★ Highlight

ZIM: Zero-Shot Image Matting for Anything

Beomyoung Kim, Chanyong Shin, Joonhyun Jeong, Hyungsik Jung, Se-Yun Lee, Sewhan Chun, Dong-Hyun Hwang, Joonsang Yu
projectarXivcode
bibtex
@inproceedings{kim2025zim,
  title={ZIM: Zero-Shot Image Matting for Anything},
  author={Kim, Beomyoung and others},
  booktitle={ICCV}, year={2025}}
2024
arXiv preprint

Towards Label-Efficient Human Matting

Beomyoung Kim, Myeong Yeon Yi, Joonsang Yu, Young Joon Yoo, Sung Ju Hwang
arXiv preprint

Rethinking Saliency-Guided Weakly-Supervised Semantic Segmentation

Beomyoung Kim, Donghyeon Kim, Sung Ju Hwang
CVPR 2024

ECLIPSE: Efficient Continual Learning in Panoptic Segmentation with Visual Prompt Tuning

Beomyoung Kim, Joonsang Yu, Sung Ju Hwang
WACV 2024

EResFD: Rediscovery of the Effectiveness of Standard Convolution for Lightweight Face Detection

Joonhyun Jeong, Beomyoung Kim, Joonsang Yu, Youngjoon Yoo
2023
CVPR 2023

The Devil is in the Points: Weakly Semi-Supervised Instance Segmentation via Point-Guided Mask Representation

Beomyoung Kim, Joonhyun Jeong, Dongyoon Han, Sung Ju Hwang
2022
CVPR 2022

Beyond Semantic to Instance Segmentation via Semantic Knowledge Transfer and Self-Refinement

Beomyoung Kim, Youngjoon Yoo, Chaeeun Rhee, Junmo Kim
ICLR 2022

Learning Features with Parameter-Free Layers

Dongyoon Han, Youngjoon Yoo, Beomyoung Kim, Byeongho Heo
WACV 2022

TricubeNet: 2D Kernel-Based Object Representation for Weakly-Occluded Oriented Object Detection

Beomyoung Kim, Janghyeon Lee, Sihaeng Lee, Doyeon Kim, Junmo Kim
2021 & earlier
NeurIPS 2021

SSUL: Semantic Segmentation with Unknown Label for Exemplar-based Class-Incremental Learning

Sungmin Cha*, Beomyoung Kim*, Youngjoon Yoo, Taesup Moon (* equal)
AAAI 2021

Discriminative Region Suppression for Weakly-Supervised Semantic Segmentation

Beomyoung Kim, Sangeun Han, Junmo Kim
MIWAI 2021

3D Point Cloud Upsampling and Colorization using GAN

Beomyoung Kim, Sangeun Han, Eojindl Yi, Junmo Kim
RiTA 2020

Fully Automated Valet Parking System Based on Infrastructure Sensing

Hyunjee Ryu, Beomyoung Kim, Heecheol Yoo, Jungwon Lee
Research → Product

Real-World Impact

Background

Experience & Education

Experience
NAVER Cloud · Image Vision
AI Research Engineer
2023 — now
NAVER CLOVA · Image Vision
AI Research Engineer
2021 — 2023
Research Internships
NAVER CLOVA (FACE, OCR) · Hyundai Mobis (Autonomous Driving)
2018 — 2020
Education
KAIST · Graduate School of AI
Ph.D. — MLAI Lab, Prof. Sung Ju Hwang
2022 — now
KAIST · Electrical Engineering
M.S. — SIIT Lab, Prof. Junmo Kim
2019 — 2021
Inha University
B.S. — Information & Comm. Engineering
2013 — 2019
Academic Service

Conference & Journal Reviewer

2026CVPR · ECCV · NeurIPS · ICLR · TPAMI
2025CVPR · ICCV · NeurIPS · ICLR · TMLR
2024CVPR · ECCV · NeurIPS · ICLR · AAAI
2023CVPR · ICCV · NeurIPS · WACV
Outreach

Invited Talks

Centum Digital Week 2025
2025

Centum Digital Week — Next Code 2025

"Beyond AI, Into Agents." A researcher's story from the AI agent era.

event →
TEAM NAVER DAN 24
2024

TEAM NAVER Conference DAN 24

CLOVA-X Image Editing: the world of pixel magic delivered by AI.

session →
Catch Career-Con 2022
2022

Jinhaksa Catch Career-Con

How to land an AI research engineer role — career mentoring (NAVER CLOVA).

NeurIPS 2021 ML in Korea
2021

NeurIPS 2021 Social — ML in Korea

SSUL: Semantic Segmentation with Unknown Label for Class-Incremental Learning.

Inha University
2022

Inha University — Invited Lecture

Weakly-Supervised Instance Segmentation (CVPR 2022), hosted by Prof. Chaeeun Rhee.