Autonomous Drone Swarm Navigation and Multi-target Tracking in 3D Environments with Dynamic Obstacles

Suleman Qamar; Saddam Hussain Khan; Muhammad Arif Arshad; Maryam Qamar; Asifullah Khan

動的障害物のある3D環境での自律ドローンスウォームナビゲーションとマルチターゲット追跡

手作業による作成は時間がかかり複雑な手順であり、実用的ではないため、人工群の自律モデリングが必要です。深層強化学習を採用した自律的アプローチは、群れナビゲーションのためのこの研究で提示されます。このアプローチでは、静的および動的な障害物と抵抗力（線形抗力、角度抗力、重力など）を含む複雑な3D環境をモデル化して、複数の動的ターゲットを追跡します。さらに、複雑な群れの行動を学習するために、強力な群れ形成とターゲット追跡のための報酬関数が考案されています。エージェントの数は固定されておらず、環境を部分的にしか遵守していないため、群れの形成とナビゲーションは困難になります。この点に関して、提案された戦略は、前述の課題に取り組むための3つの主要なフェーズで構成されます。1）動的な群れ管理の方法論、2）障害物の回避、ターゲットへの最短経路の検索、3）ターゲットの追跡と島のモデリング。動的な群れ管理フェーズは、基本的な感覚入力を高レベルのコマンドに変換して、群れのサイズの変動を維持しながら、群れのナビゲーションと分散型セットアップを強化します。島のモデリングでは、スウォームはターゲットの数に応じて個々のサブスウォームに分割できますが、逆に、これらのサブスウォームが結合して1つの巨大なスウォームを形成し、スウォームが複数のターゲットを追跡できるようにします。重要な結果を達成するために、カスタマイズされた最先端のポリシーベースの深層強化学習アルゴリズムが採用されています。有望な結果は、提案された戦略が群れのナビゲーションを強化し、複雑な動的環境で複数の静的および動的ターゲットを追跡できることを示しています。

Autonomous modeling of artificial swarms is necessary because manual creation is a time intensive and complicated procedure which makes it impractical. An autonomous approach employing deep reinforcement learning is presented in this study for swarm navigation. In this approach, complex 3D environments with static and dynamic obstacles and resistive forces (like linear drag, angular drag, and gravity) are modeled to track multiple dynamic targets. Moreover, reward functions for robust swarm formation and target tracking are devised for learning complex swarm behaviors. Since the number of agents is not fixed and has only the partial observance of the environment, swarm formation and navigation become challenging. In this regard, the proposed strategy consists of three main phases to tackle the aforementioned challenges: 1) A methodology for dynamic swarm management, 2) Avoiding obstacles, Finding the shortest path towards the targets, 3) Tracking the targets and Island modeling. The dynamic swarm management phase translates basic sensory input to high level commands to enhance swarm navigation and decentralized setup while maintaining the swarms size fluctuations. While, in the island modeling, the swarm can split into individual subswarms according to the number of targets, conversely, these subswarms may join to form a single huge swarm, giving the swarm ability to track multiple targets. Customized state of the art policy based deep reinforcement learning algorithms are employed to achieve significant results. The promising results show that our proposed strategy enhances swarm navigation and can track multiple static and dynamic targets in complex dynamic environments.

updated: Sun Feb 13 2022 08:26:28 GMT+0000 (UTC)

published: Sun Feb 13 2022 08:26:28 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト