References

1
D. P. Bertsekas, Dynamic Programming and Optimal Control. Belmont, MA: Athena Scientific, 1995.

2
D. Foster and R. Vohra, ``Regret in the on-line decision problem.'' Games and Economic Behavior, to appear, 1998.

3
J. Hu and M. P. Wellman, ``Self-fulfilling bias in multiagent learning,'' Proceedings of ICMAS-96, AAAI Press, 1996.

4
J. O. Kephart, J. E. Hanson and J. Sairamesh, ``Price-war dynamics in a free-market economy of software agents,'' to appear in: Proceedings of ALIFE-VI, Los Angeles, 1998.

5
M. L. Littman, ``Markov games as a framework for multi-agent reinforcement learning,'' Proceedings of the Eleventh International Conference on Machine Learning, 157-163, Morgan Kaufmann, 1994.

6
P. Milgrom and J. Roberts. ``Adaptive and sophisticated learning in normal form games,'' Games and Economic Behavior, 3:82-100, 1991.

7
J. Sairamesh and J. O. Kephart, ``Dynamics of price and quality differentiation in information and computational markets,'' submitted to ICE-98, 1998.

8
J. M. Vidal and E. H. Durfee, ``Learning nested agent models in an information economy,'' J. of Experimental and Theoretical AI, to appear, 1998.



BACK TO INDEX PAGE | PREVIOUS SECTION