Google Landmark Retrieval 2021 Competition Third Place Solution

Qishen Ha; Bo Liu; Hongwei Zhang

Google Landmark Retrieval2021コンペティション3位ソリューション

検索トラックと認識トラックの両方について、Google Landmark Challenges2021のソリューションを紹介します。どちらのソリューションも、動的マージンを備えたサブセンターArcFaceに基づくトランスフォーマーとConvNetモデルのアンサンブルです。 2つのトラックは同じトレーニングデータを共有するため、同じパイプラインとトレーニングアプローチを使用しましたが、アンサンブルのモデル選択と後処理が異なります。昨年からの主な改善点は、最新のビジョンアーキテクチャ、特に検索タスクでConvNetを大幅に上回るトランスフォーマーです。検索トラックと認識トラックはそれぞれ3位と4位で終了しました。

We present our solutions to the Google Landmark Challenges 2021, for both the retrieval and the recognition tracks. Both solutions are ensembles of transformers and ConvNet models based on Sub-center ArcFace with dynamic margins. Since the two tracks share the same training data, we used the same pipeline and training approach, but with different model selections for the ensemble and different post-processing. The key improvement over last year is newer state-of-the-art vision architectures, especially transformers which significantly outperform ConvNets for the retrieval task. We finished third and fourth places for the retrieval and recognition tracks respectively.

updated: Sat Oct 09 2021 17:56:40 GMT+0000 (UTC)

published: Sat Oct 09 2021 17:56:40 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト