Semantic Scholar Open Access 2022 29 sitasi

CADTransformer: Panoptic Symbol Spotting Transformer for CAD Drawings

Zhiwen Fan Tianlong Chen Peihao Wang Zhangyang Wang

Abstrak

Understanding 2D computer-aided design (CAD) drawings plays a crucial role for creating 3D prototypes in architecture, engineering and construction (AEC) industries. The task of automated panoptic symbol spotting, i.e., to spot and parse both countable object instances (windows, doors, tables, etc.) and uncountable stuff (wall, railing, etc.) from CAD drawings, has recently drawn interests from the computer vision community. Unfortunately, the highly irregular ordering and orientations set major roadblocks for this task. Existing methods, based on convolutional neural networks (CNNs) and/or graph neural networks (GNNs), regress instance bounding boxes in the pixel domain and then convert the predictions into symbols. In this paper, we present a novel framework named CAD Transformer, that can painlessly modify existing vision transformer (ViT) backbones to tackle the above limitations for the panoptic symbol spotting task. CADTransformer tokenizes directly from the set of graphical primitives in CAD drawings, and correspondingly optimizes line-grained semantic and instance symbol spotting altogether by a pair of prediction heads. The backbone is further enhanced with a few plug-and-play modifications, including a neighborhood aware self-attention, hierarchical feature aggregation, and graphic entity position encoding, to bake in the structure prior while optimizing the efficiency. Besides, a new data augmentation method, termed Random Layer, is proposed by the layer-wise separation and recombination of a CAD drawing. Overall, CADTransformer significantly boosts the previous state-of-the-art from 0.595 to 0.685 in the panoptic quality (PQ) metric, on the recently released FloorPlanCAD dataset. We further demonstrate that our model can spot symbols with irregular shapes and arbitrary orientations. Our codes are available in https://github.com/VITA-Group/CADTransformer.

Topik & Kata Kunci

Penulis (4)

Z

Zhiwen Fan

T

Tianlong Chen

P

Peihao Wang

Z

Zhangyang Wang

Format Sitasi

Fan, Z., Chen, T., Wang, P., Wang, Z. (2022). CADTransformer: Panoptic Symbol Spotting Transformer for CAD Drawings. https://doi.org/10.1109/CVPR52688.2022.01071

Akses Cepat

PDF tidak tersedia langsung

Cek di sumber asli →
Lihat di Sumber doi.org/10.1109/CVPR52688.2022.01071
Informasi Jurnal
Tahun Terbit
2022
Bahasa
en
Total Sitasi
29×
Sumber Database
Semantic Scholar
DOI
10.1109/CVPR52688.2022.01071
Akses
Open Access ✓