Building a Data Lake on Google Cloud Platform
Posted on March 17, 2022, Level beginner Resource Length medium
When creating a platform, it's critical to have clearly defined customers and products that will benefit from it rather than building in a vacuum. By Javier Turegano Director, Software Engineering @slack.
Building Real-Time ETL Pipelines with Apache Kafka
Posted on February 17, 2022, Level beginner Resource Length short
Whether you're a data engineer, a data scientist, a software developer, or someone else working in the field of software and data - it's very likely that you have implemented an ETL pipeline before. By Stefan Sprenger.
Streaming analytics with Apache Pulsar and Spark structured streaming
Posted on February 12, 2022, Level beginner Resource Length long
Apache Pulsar, a promising new toolkit for distributed messaging and streaming. In this piece we combine two of our favorite pieces of tech: Apache Pulsar and Apache Spark. By Daniel Ciocîrlan.
Comparing the best web servers: Caddy, Apache, and Nginx
Posted on October 23, 2021, Level intermediate Resource Length medium
A web server is a piece of software that accepts a network request from a user agent, typically a web browser, and returns either the appropriate response for the request or an error message. Two dominant solutions for HTTP servers today are Apache and Nginx. However, a new player in the space, Caddy Web Server, is gaining traction for its ease of use. By Ayooluwa Isaiah.
Processing time-series data with Redis and Apache Kafka
Posted on June 22, 2021, Level beginner Resource Length medium
Learn how to analyze time-series data through RedisTimeSeries with Apache Kafka in this practical walkthrough. RedisTimeSeries is a Redis module that brings native time-series data structure to Redis. By Abhishek Gupta.
Integrating Apache Kafka into your CI/CD with Jenkins & Lenses
Posted on March 2, 2021, Level intermediate Resource Length medium
This tutorial will illustrate how you can integrate Lenses into your Jenkins CI/CD using the lenses-cli. The Lenses CLI allows you to manage your Apache Kafka environment using a single unified and secure API across all Kafka components (Kafka Brokers, Zookeepers, Kafka Connect, Schema Registry etc.). By Mihalis Tsoukalos.
Pulsar advantages over Kafka
Posted on November 19, 2020, Level intermediate Resource Length medium
Recently, author has been looking at Pulsar and how it compares to Kafka. A quick search will show you that there is a current war between the two most famous open source messaging systems. By Javier Ramos.
Apache Cassandra: An essentials guide
Posted on August 25, 2020, Level intermediate Resource Length long
Need to get up to speed on Cassandra and learn how it can benefit your software development practices? These are the essentials that cover the basics. By Rebecca Mills, Developer Relations @ DataStax.
Change data capture with Debezium: A simple how-to
Posted on May 19, 2020, Level intermediate Resource Length long
Eric Deandrea wrote this piece about one question that always comes up as organizations moving towards being cloud-native, twelve-factor, and stateless: How do you get an organization's data to these new applications?
Building an adaptive, multi-tenant stream bus with Kafka and Golang
Posted on February 20, 2020, Level intermediate Resource Length medium
Back in the 2000s, SOAP/WSDL with ESB (Enterprise Service Bus) was the dominant server-side architecture for many companies. Since the 2010s, microservices and service mesh technologies have grown wildly and thus became the de-facto industry standards. By Xinyu Liu.
Exploring an Apache Kafka to Pub/Sub migration: Major considerations
Posted on January 28, 2020, Level intermediate Resource Length medium
In many cases, Google's Pub/Sub messaging and event distribution service can successfully replace Apache Kafka, with lower maintenance and operational costs, and better integration with other Google Cloud services. By Leonid Yankulin.
Using Tableau with Kafka: How to build a real-time SQL dashboard on streaming data
Posted on November 9, 2019, Level intermediate Resource Length medium
Scott Morris, Software Engineer at Rockset wrote this piece about building a real-time dashboard for operational monitoring and analytics on streaming event data from Kafka, which often requires complex SQL, including filtering, aggregations, and joins with other data sets.