Well Googled is Half Done: Multimodal Forecasting of New Fashion Product Sales with Image-based Google Trends

Geri Skenderi; Christian Joppi; Matteo Denitto; Marco Cristani

グーグルは半分終わった：画像ベースのグーグルトレンドによる新しいファッション製品の売上のマルチモーダル予測

このホワイトペーパーでは、過去の販売データは利用できず、画像とメタデータがほとんどない、まったく新しいファッションアイテムの販売を予測するために、外因性の知識として視覚的側面のテキスト翻訳に対してGoogleトレンドを体系的に調査することの有効性を調査します。特に、Google Trends Multimodal Transformerの略であるGTM-Transformerを提案します。このエンコーダーは、外因性の時系列の表現を処理し、デコーダーは、Google Trendsエンコーディング、および利用可能なビジュアル情報とメタデータ情報を使用して売上を予測します。私たちのモデルは非自己回帰的に機能し、最初のステップのエラーの複合効果を回避します。 2番目の貢献として、VISUELLEデータセットを紹介します。これは、イタリアのヌナリーの本物の履歴データから派生した、2016年から2019年の間に販売された5577の新製品の売上を含む、新しいファッション製品の売上予測のタスクで最初に公開されたデータセットです。ファストファッション会社。私たちのデータセットには、製品、メタデータ、関連する売上、および関連するGoogleトレンドの画像が含まれています。 VISUELLEを使用して、最新の代替案および多数のベースラインとアプローチを比較し、GTM-Transformerがパーセンテージと絶対誤差の両方の点で最も正確であることを示しています。外因性の知識を追加すると、予測の精度が1.5％WAPE向上し、Googleトレンドを活用することの重要性を示していることは注目に値します。コードとデータセットはどちらもhttps://github.com/HumaticsLAB/GTM-Transformerで入手できます。

This paper investigates the effectiveness of systematically probing Google Trendsagainst textual translations of visual aspects as exogenous knowledge to predict the sales of brand-new fashion items, where past sales data is not available, but only an image and few metadata are available. In particular, we propose GTM-Transformer, standing for Google Trends Multimodal Transformer, whose encoder works on the representation of the exogenous time series, while the decoder forecasts the sales using the Google Trends encoding, and the available visual and metadata information. Our model works in a non-autoregressive manner, avoiding the compounding effect of the first-step errors. As a second contribution, we present the VISUELLE dataset, which is the first publicly available dataset for the task of new fashion product sales forecasting, containing the sales of 5577 new products sold between 2016-2019, derived from genuine historical data ofNunalie, an Italian fast-fashion company. Our dataset is equipped with images of products, metadata, related sales, and associated Google Trends. We use VISUELLE to compare our approach against state-of-the-art alternatives and numerous baselines, showing that GTM-Transformer is the most accurate in terms of both percentage and absolute error. It is worth noting that the addition of exogenous knowledge boosts the forecasting accuracy by 1.5% WAPE wise, showing the importance of exploiting Google Trends. The code and dataset are both available at https://github.com/HumaticsLAB/GTM-Transformer.

updated: Mon Sep 20 2021 20:15:08 GMT+0000 (UTC)

published: Mon Sep 20 2021 20:15:08 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト