MARKOV O'YINLARI VA ULARNI BELLMAN-SHAPLEY TENGLAMASI YORDAMIDA YECHISH
Keywords:
Kalit so'zlar: Markov o'yinlari, stoxastik o'yin, Shapley teoremasi, Bellman- Shapley tenglamasi, Nash muvozanati, qiymat funksiyasi, qiymat iteratsiyasi, diskontlangan mukofot, optimal siyosat, minimaks qiymat.Abstract
Annotatsiya. Ushbu maqolada jarayonlar tadqiqoti va optimal boshqaruvning
muhim yo'nalishlaridan biri — Markov o'yinlari (stoxastik o'yinlar) tahlil qilinadi.
Maqolada Lloyd Shapley tomonidan 1953-yilda kiritilgan Markov o'yinining
matematik shakllanishi, asosiy turlari (nol yig'indili, umumiy yig'indili, kooperativ va
kooperatsiz, chekli va cheksiz gorizontli, diskontlangan va o'rtacha mukofotli) hamda
yechim kontseptsiyalari ko'rib chiqiladi. Nazariy tahlil natijalari amaliy holat asosida
— ikki firma o'rtasidagi reklama strategiyasi raqobati misolida — Bellman-Shapley
rekurrent tenglamasi yordamida qadam-baqadam yechiladi. Qiymat iteratsiyasi
algoritmi orqali har bir holat uchun optimal qiymat funksiyasi va muvozanat siyosati
topiladi.
References
FOYDALANILGAN ADABIYOTLAR
1. Shapley L. S. Stochastic Games // Proceedings of the National Academy of Sciences
of the USA. — 1953. — Vol. 39, No. 10. — P. 1095–1100.
2. Bellman R. E. Dynamic Programming. — Princeton: Princeton University Press,
1957. — 342 p.
3. Puterman M. L. Markov Decision Processes: Discrete Stochastic Dynamic
Programming. — New York: John Wiley & Sons, 1994. — 649 p.
4. Filar J., Vrieze K. Competitive Markov Decision Processes. — New York:
Springer-Verlag, 1997. — 393 p.
5. Neyman A., Sorin S. (eds.) Stochastic Games and Applications. — Dordrecht:
Kluwer Academic Publishers, 2003. — 476 p.
6. Başar T., Olsder G. J. Dynamic Noncooperative Game Theory, 2nd edition. —
London: Academic Press, 1995. — 519 p.
7. Littman M. L. Markov Games as a Framework for Multi-Agent Reinforcement
Learning // Proceedings of the 11th International Conference on Machine Learning.
— 1994. — P. 157–163.
8. Sutton R. S., Barto A. G. Reinforcement Learning: An Introduction, 2nd edition. —
Cambridge: MIT Press, 2018. — 552 p.
9. Mamadaliyev N., Tuxtasinov M. Variatsion hisob va optimal boshqaruvning asosiy
masalalari. — Toshkent: Universitet, 2013. — 188 b.
10. Owen G. Game Theory, 3rd edition. — San Diego: Academic Press, 1995. — 459
p.