MARKOV O'YINLARI VA ULARNI BELLMAN-SHAPLEY TENGLAMASI YORDAMIDA YECHISH

Authors

  • Mamatova Zilolaxon Xabibulloxonovna Author
  • Abduraxmonova Nozanin Robiljon qizi Author

Keywords:

Kalit so'zlar: Markov o'yinlari, stoxastik o'yin, Shapley teoremasi, Bellman- Shapley tenglamasi, Nash muvozanati, qiymat funksiyasi, qiymat iteratsiyasi, diskontlangan mukofot, optimal siyosat, minimaks qiymat.

Abstract

 
Annotatsiya. Ushbu maqolada jarayonlar tadqiqoti va optimal boshqaruvning 
muhim  yo'nalishlaridan  biri  —  Markov  o'yinlari  (stoxastik  o'yinlar)  tahlil  qilinadi. 
Maqolada  Lloyd  Shapley  tomonidan  1953-yilda  kiritilgan  Markov  o'yinining 
matematik shakllanishi, asosiy turlari (nol yig'indili, umumiy yig'indili, kooperativ va 
kooperatsiz, chekli va cheksiz gorizontli, diskontlangan va o'rtacha mukofotli) hamda 
yechim kontseptsiyalari ko'rib chiqiladi. Nazariy tahlil natijalari amaliy holat asosida 
— ikki firma o'rtasidagi reklama strategiyasi raqobati misolida — Bellman-Shapley 
rekurrent  tenglamasi  yordamida  qadam-baqadam  yechiladi.  Qiymat  iteratsiyasi 
algoritmi orqali har bir holat uchun optimal qiymat funksiyasi va muvozanat siyosati 
topiladi. 

References

FOYDALANILGAN ADABIYOTLAR

1. Shapley L. S. Stochastic Games // Proceedings of the National Academy of Sciences

of the USA. — 1953. — Vol. 39, No. 10. — P. 1095–1100.

2. Bellman R. E. Dynamic Programming. — Princeton: Princeton University Press,

1957. — 342 p.

3. Puterman M. L. Markov Decision Processes: Discrete Stochastic Dynamic

Programming. — New York: John Wiley & Sons, 1994. — 649 p.

4. Filar J., Vrieze K. Competitive Markov Decision Processes. — New York:

Springer-Verlag, 1997. — 393 p.

5. Neyman A., Sorin S. (eds.) Stochastic Games and Applications. — Dordrecht:

Kluwer Academic Publishers, 2003. — 476 p.

6. Başar T., Olsder G. J. Dynamic Noncooperative Game Theory, 2nd edition. —

London: Academic Press, 1995. — 519 p.

7. Littman M. L. Markov Games as a Framework for Multi-Agent Reinforcement

Learning // Proceedings of the 11th International Conference on Machine Learning.

— 1994. — P. 157–163.

8. Sutton R. S., Barto A. G. Reinforcement Learning: An Introduction, 2nd edition. —

Cambridge: MIT Press, 2018. — 552 p.

9. Mamadaliyev N., Tuxtasinov M. Variatsion hisob va optimal boshqaruvning asosiy

masalalari. — Toshkent: Universitet, 2013. — 188 b.

10. Owen G. Game Theory, 3rd edition. — San Diego: Academic Press, 1995. — 459

p.

Published

2026-05-05

How to Cite

Mamatova Zilolaxon Xabibulloxonovna, & Abduraxmonova Nozanin Robiljon qizi. (2026). MARKOV O’YINLARI VA ULARNI BELLMAN-SHAPLEY TENGLAMASI YORDAMIDA YECHISH . Ta’lim Innovatsiyasi Va Integratsiyasi, 68(3), 299-306. https://journalss.org/index.php/tal/article/view/28070