论文信息 - Learning System Using Fuzzy ART for Two-Player Games

Learning System Using Fuzzy ART for Two-Player Games

Adaptive resonance theory neural network (ART) is an unsupervised learning system that can generate and grow the recognition categories based on the similarity between inputs and memories. By this feature, ART can solve the Stability-Plasticity Dilemma. In this report, we propose a learning system for two player games that actions or strategies of opponents change constantly. In the proposed system, an input state space is segmented adaptively by Fuzzy ART neural networks and then a player learns an input state-action pairs by the reinforcement learning. We applied the proposed system to a fighting action game that two players fight while selecting actions. As results of experiments, we show that the player acquired proper actions against opponents. 1 はじめに対戦型ゲームでは, プレーヤは互いの手の出し方から相手の戦略を読み, それに応じて自分の戦略を決定する. 同様に, 対戦相手も戦略を進化させていくため, 対戦で起こりうる状況, その状況における最適な行動戦略は常に変化していくと考えられる. このような動的環境においては, 現在の対戦相手の戦略に対する適応 (新しい事象を学習する可塑性)と, これまでに獲得した戦略の保持 (安定性)のバランスをとりながら学習していくことが重要であると考える. そこで本研究では, 動的環境である対戦型ゲームにおいて, 対戦相手の戦略変化に適応可能な学習システムを提案する. 提案システムでは, ファジィART[1, 2]によって分類された記憶のカテゴリがプレーヤに状態として与えられる. そして, 経験強化型の強化学習法の 1つである profit sharingを用い, 分類された状態空間に対応する行動選択を最適化する. 対戦環境は fighting action game[3]とし, 提案システムにより学習を行う学習プレーヤと, 複数の行動パターンを持つ敵プレーヤとの対戦実験を行う. 実験結果より, 学習プレーヤは敵プレーヤに対し適応的な戦略学習をできることを示す. 2 ファジィARTニューラルネットワークファジィARTはアナログ入力に対応可能なARTモデルであり,入力層とカテゴリ層から構成されている. ファジィARTの構造を図 1に示す. F1ニューロン iと F2 ニューロン jはボトムアップ荷重wijとトップダウン荷重 wjiによって相互結合しており, wij = wjiを満足する. また, トップダウン荷重ベクトル wj = [wj1, · · · , wjn] は F2 ニューロン jに属する記憶である. 荷重ベクトルの初期値は以下のように設定する. wji = · · · = wjm = 1 (1) 1 j m ・・・・・・ 1 i n ・・・・・・ w ij wji

Katsuari Kamei | Jun-ichi Kushida | Y. Hoshino | Iori Nakaoka

[1] Sung Hoon Jung,et al. Exploiting Intelligence in Fighting Action Games Using Neural Networks , 2006, IEICE Trans. Inf. Syst..