OSSR-PID: One-Shot Symbol Recognition in P&ID Sheets using Path Sampling and GCN

Shubham Paliwal; Monika Sharma; Lovekesh Vig

OSSR-PID：パスサンプリングとGCNを使用したP＆IDシートでのワンショットシンボル認識

配管および計装図（P＆ID）は、エンジニアリング回路図および機器レイアウトを表すために、いくつかの製造、石油、およびガス企業に遍在しています。新しいユースケースごとにさまざまな記号のセットに注釈を付けるコストをかけずに、P＆IDから情報を抽出してデジタル化することが急務です。したがって、シンボル認識のための堅牢なワンショット学習アプローチ、つまり、ローカリゼーションとそれに続く分類は、この目標に大きく貢献します。私たちの方法は、画像内のさまざまな輪郭境界に沿ってピクセルを順番にサンプリングすることによって機能します。これらのサンプリングされたポイントは、等高線の構造をキャプチャするグラフを作成するためにプロトタイプの線図で使用されるパスを形成します。続いて、プロトタイプグラフは、動的グラフ畳み込みニューラルネットワーク（DGCNN）に送られます。このネットワークは、グラフを指定されたシンボルクラスの1つに分類するようにトレーニングされています。さらに、分類ネットワークをより堅牢にするために、サンプリングされたポイントを含むシンボル画像でトレーニングされたResnet-34ネットワークからの埋め込みを追加します。 P＆IDの多くのシンボルは構造的に非常に類似しているため、DGCNNトレーニング中にArcface損失を利用します。これは、識別性の高い埋め込みを生成することにより、シンボルクラスの分離可能性を最大化するのに役立ちます。画像は、パイプライン（直線）に接続されたコンポーネントで構成されています。シンボル領域の周囲に分離されたサンプリングされたポイントは、分類タスクに使用されます。 OSSR-PIDという名前の提案されたパイプラインは高速であり、100個のP＆ID図の合成データセット上のシンボルの認識に卓越したパフォーマンスを提供します。また、12枚のP＆IDシートの実際のプライベートデータセットでの以前の作業と私たちの方法を比較し、同等/優れた結果を取得します。驚くべきことに、シンボルごとに1つの典型的な例を使用するだけで、このような優れたパフォーマンスを実現できます。

Piping and Instrumentation Diagrams (P&ID) are ubiquitous in several manufacturing, oil and gas enterprises for representing engineering schematics and equipment layout. There is an urgent need to extract and digitize information from P&IDs without the cost of annotating a varying set of symbols for each new use case. A robust one-shot learning approach for symbol recognition i.e., localization followed by classification, would therefore go a long way towards this goal. Our method works by sampling pixels sequentially along the different contour boundaries in the image. These sampled points form paths which are used in the prototypical line diagram to construct a graph that captures the structure of the contours. Subsequently, the prototypical graphs are fed into a Dynamic Graph Convolutional Neural Network (DGCNN) which is trained to classify graphs into one of the given symbol classes. Further, we append embeddings from a Resnet-34 network which is trained on symbol images containing sampled points to make the classification network more robust. Since, many symbols in P&ID are structurally very similar to each other, we utilize Arcface loss during DGCNN training which helps in maximizing symbol class separability by producing highly discriminative embeddings. The images consist of components attached on the pipeline (straight line). The sampled points segregated around the symbol regions are used for the classification task. The proposed pipeline, named OSSR-PID, is fast and gives outstanding performance for recognition of symbols on a synthetic dataset of 100 P&ID diagrams. We also compare our method against prior-work on a real-world private dataset of 12 P&ID sheets and obtain comparable/superior results. Remarkably, it is able to achieve such excellent performance using only one prototypical example per symbol.

updated: Wed Sep 08 2021 18:05:27 GMT+0000 (UTC)

published: Wed Sep 08 2021 18:05:27 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト