RendNet: Unified 2D/3D Recognizer With Latent Space Rendering

Ruoxi Shi; Xinyang Jiang; Caihua Shan; Yansen Wang; Dongsheng Li

RendNet：潜在空間レンダリングを備えた統合2D/3D認識機能

ベクターグラフィックス（VG）は、エンジニアリング、建築、設計などの幅広いアプリケーションで私たちの日常生活に遍在しています。ほとんどの既存の方法のVG認識プロセスは、最初にVGをラスターグラフィックス（RG）にレンダリングし、次にRGフォーマット。ただし、この手順ではジオメトリの構造が破棄され、VGの高解像度が失われます。最近、元のVG形式から直接認識するために別のカテゴリのアルゴリズムが提案されています。ただし、RGレンダリングで除外できるトポロジエラーの影響を受けます。これらの欠点を回避するために、1つの形式を検討する代わりに、VGとRGの形式を一緒に利用することは良い解決策です。さらに、VGとRGの情報を効果的に組み合わせるには、VGからRGへのレンダリングプロセスが不可欠であると主張します。 VGプリミティブをRGピクセルに転送する方法に関するルールを指定することにより、レンダリングプロセスは、VGとRGの間の相互作用と相関関係を示します。その結果、2Dと3Dの両方のシナリオで認識できる統合アーキテクチャであるRendNetを提案します。これは、VG / RG表現の両方を考慮し、VGからRGへのラスタライズプロセスを組み込むことでそれらの相互作用を活用します。実験によると、RendNetは、さまざまなVGデータセットの2Dおよび3Dオブジェクト認識タスクで最先端のパフォーマンスを実現できます。

Vector graphics (VG) have been ubiquitous in our daily life with vast applications in engineering, architecture, designs, etc. The VG recognition process of most existing methods is to first render the VG into raster graphics (RG) and then conduct recognition based on RG formats. However, this procedure discards the structure of geometries and loses the high resolution of VG. Recently, another category of algorithms is proposed to recognize directly from the original VG format. But it is affected by the topological errors that can be filtered out by RG rendering. Instead of looking at one format, it is a good solution to utilize the formats of VG and RG together to avoid these shortcomings. Besides, we argue that the VG-to-RG rendering process is essential to effectively combine VG and RG information. By specifying the rules on how to transfer VG primitives to RG pixels, the rendering process depicts the interaction and correlation between VG and RG. As a result, we propose RendNet, a unified architecture for recognition on both 2D and 3D scenarios, which considers both VG/RG representations and exploits their interaction by incorporating the VG-to-RG rasterization process. Experiments show that RendNet can achieve state-of-the-art performance on 2D and 3D object recognition tasks on various VG datasets.

updated: Tue Jun 21 2022 01:23:11 GMT+0000 (UTC)

published: Tue Jun 21 2022 01:23:11 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト