A repeated imitation model with dependence between stages: Decision strategies and rewards

Pablo J. Villacorta; David A. Pelta

Open Access

A repeated imitation model with dependence between stages: Decision strategies and rewards

Pablo J. Villacorta

and

David A. Pelta

| Sep 30, 2015

International Journal of Applied Mathematics and Computer Science

Volume 25 (2015): Issue 3 (September 2015)

About this article

Cite

Page range: 617 - 630

Received: Apr 01, 2014

DOI: https://doi.org/10.1515/amcs-2015-0045

This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License.

Adversarial decision making is aimed at determining strategies to anticipate the behavior of an opponent trying to learn from our actions. One defense is to make decisions intended to confuse the opponent, although our rewards can be diminished. This idea has already been captured in an adversarial model introduced in a previous work, in which two agents separately issue responses to an unknown sequence of external inputs. Each agent’s reward depends on the current input and the responses of both agents. In this contribution, (a) we extend the original model by establishing stochastic dependence between an agent’s responses and the next input of the sequence, and (b) we study the design of time varying decision strategies for the extended model. The strategies obtained are compared against static strategies from theoretical and empirical points of view. The results show that time varying strategies outperform static ones

eISSN:: 2083-8492
Language:: English

Publication timeframe:: 4 times per year
Journal Subjects:: Mathematics, Applied Mathematics

Journal RSS Feed

A repeated imitation model with dependence between stages: Decision strategies and rewards

Published Online: Sep 30, 2015

Page range: 617 - 630

Received: Apr 01, 2014

DOI: https://doi.org/10.1515/amcs-2015-0045

© by Pablo J. Villacorta

This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License.