References
- 1
- D. P. Bertsekas, Dynamic Programming and
Optimal Control. Belmont, MA: Athena Scientific, 1995.
- 2
-
D. Foster and R. Vohra, ``Regret in the on-line decision problem.''
Games and Economic Behavior, to appear, 1998.
- 3
- J. Hu and M. P. Wellman, ``Self-fulfilling bias in multiagent
learning,'' Proceedings of ICMAS-96, AAAI Press, 1996.
- 4
- J. O. Kephart, J. E. Hanson and J. Sairamesh,
``Price-war dynamics in a free-market economy of software agents,''
to appear in: Proceedings of ALIFE-VI, Los Angeles, 1998.
- 5
- M. L. Littman, ``Markov games as a framework for
multi-agent reinforcement learning,'' Proceedings of the Eleventh International
Conference on Machine Learning, 157-163, Morgan Kaufmann, 1994.
- 6
-
P. Milgrom and J. Roberts.
``Adaptive and sophisticated learning in normal form games,''
Games and Economic Behavior, 3:82-100, 1991.
- 7
- J. Sairamesh and J. O. Kephart, ``Dynamics of price and
quality differentiation in information and computational markets,''
submitted to ICE-98, 1998.
- 8
- J. M. Vidal and E. H. Durfee, ``Learning nested agent models
in an information economy,'' J. of Experimental and Theoretical AI, to
appear, 1998.
|
|