APACHE KAFKA VA APACHE SPARK STRUCTURED STREAMING ASOSIDA OQIMLI MA’LUMOTLARNI QAYTA ISHLASH
Keywords:
Kalit so‘zlar: Apache Kafka, Apache Spark, Structured Streaming, oqimli ma’lumotlar, real vaqt, Big Data.Abstract
Annotatsiya: Ushbu maqolada real vaqt rejimida oqimli ma’lumotlarni qayta
ishlash uchun keng qo‘llaniladigan Apache Kafka va Apache Spark Structured
Streaming texnologiyalari yoritilgan. Maqolada ushbu texnologiyalarning asosiy
tushunchalari, ularning o‘zaro integratsiyasi, ishlash tamoyillari hamda amaliy qo‘llash
bosqichlari ko‘rib chiqiladi. Kafka va Structured Streaming’ning birgalikda
qo‘llanilishi katta hajmdagi ma’lumotlarni samarali, ishonchli va kengaytiriladigan
tarzda qayta ishlash imkonini berishi tahlil qilinadi.
References
Foydalanilgan adabiyotlar
1. Kreps J., Narkhede N., Rao J. Kafka: A Distributed Messaging System for
Log Processing // Proceedings of the NetDB Conference. – 2011.
2. Apache Software Foundation. Apache Kafka Documentation. – URL:
https://kafka.apache.org/documentation
3. Zaharia M., Xin R. S., Wendell P., et al. Apache Spark: A Unified Engine for
Big Data Processing // Communications of the ACM. – 2016. – Vol. 59(11). – P.
56–65.
4. Apache Software Foundation. Apache Spark Structured Streaming
Programming Guide. – URL: https://spark.apache.org/docs/latest/structured-
streaming-programming-guide.html
5. Karau H., Warren R. High Performance Spark: Best Practices for Scaling and
Optimizing Apache Spark. – O’Reilly Media, 2017.