arXiv Open Access 2026

Real-time Monocular 2D and 3D Perception of Endoluminal Scenes for Controlling Flexible Robotic Endoscopic Instruments

Ruofeng Wei Kai Chen Yui Lun Ng Yiyao Ma Justin Di-Lang Ho +5 lainnya
Lihat Sumber

Abstrak

Endoluminal surgery offers a minimally invasive option for early-stage gastrointestinal and urinary tract cancers but is limited by surgical tools and a steep learning curve. Robotic systems, particularly continuum robots, provide flexible instruments that enable precise tissue resection, potentially improving outcomes. This paper presents a visual perception platform for a continuum robotic system in endoluminal surgery. Our goal is to utilize monocular endoscopic image-based perception algorithms to identify position and orientation of flexible instruments and measure their distances from tissues. We introduce 2D and 3D learning-based perception algorithms and develop a physically-realistic simulator that models flexible instruments dynamics. This simulator generates realistic endoluminal scenes, enabling control of flexible robots and substantial data collection. Using a continuum robot prototype, we conducted module and system-level evaluations. Results show that our algorithms improve control of flexible instruments, reducing manipulation time by over 70% for trajectory-following tasks and enhancing understanding of surgical scenarios, leading to robust endoluminal surgeries.

Topik & Kata Kunci

Penulis (10)

R

Ruofeng Wei

K

Kai Chen

Y

Yui Lun Ng

Y

Yiyao Ma

J

Justin Di-Lang Ho

H

Hon Sing Tong

X

Xiaomei Wang

J

Jing Dai

K

Ka-Wai Kwok

Q

Qi Dou

Format Sitasi

Wei, R., Chen, K., Ng, Y.L., Ma, Y., Ho, J.D., Tong, H.S. et al. (2026). Real-time Monocular 2D and 3D Perception of Endoluminal Scenes for Controlling Flexible Robotic Endoscopic Instruments. https://arxiv.org/abs/2602.14666

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2026
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓