Peer Learning for Unbiased Scene Graph Generation

Liguang Zhou; Junjie Hu; Yuhongze Zhou; Tin Lun Lam; Yangsheng Xu

偏りのないシーングラフ生成のためのピアラーニング

偏りのないシーングラフ生成 (USGG) は、画像内のオブジェクト間の多様で非常に不均衡な述語を予測する必要がある困難なタスクです。これに対処するために、述語サンプリングとコンセンサス投票 (PSCV) を使用して複数のピアが互いに学習することを奨励する、新しいフレームワークのピア学習を提案します。述語サンプリングは、頻度に基づいて述語クラスを下位分布に分割し、各下位分布またはそれらの組み合わせを処理するために異なるピアを割り当てます。コンセンサス投票は、多数派の意見を強調し、少数派の意見を減らすことによって、ピアの補完的な述語知識をアンサンブルします。 Visual Genome の実験では、PSCV が以前の方法よりも優れており、平均 31.6 の SGCls タスクで新しい最先端技術を達成することが示されています。

Unbiased scene graph generation (USGG) is a challenging task that requires predicting diverse and heavily imbalanced predicates between objects in an image. To address this, we propose a novel framework peer learning that uses predicate sampling and consensus voting (PSCV) to encourage multiple peers to learn from each other. Predicate sampling divides the predicate classes into sub-distributions based on frequency, and assigns different peers to handle each sub-distribution or combinations of them. Consensus voting ensembles the peers' complementary predicate knowledge by emphasizing majority opinion and diminishing minority opinion. Experiments on Visual Genome show that PSCV outperforms previous methods and achieves a new state-of-the-art on SGCls task with 31.6 mean.

updated: Sat Mar 04 2023 08:13:35 GMT+0000 (UTC)

published: Sat Dec 31 2022 07:56:35 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト