Tag: Apache

Apache web server hardening and security guide

Posted on April 21, 2024, Level intermediate Resource Length medium

The Web Server is a crucial part of web-based applications. Apache Web Server is often placed at the edge of the network; hence it becomes one of the most vulnerable services to attack. A practical guide to secure and harden Apache HTTP Server. By Chandan Kumar.

Tags apache web-development cloud software-architecture infosec
Introducing WAP pattern support with Apache Iceberg

Posted on February 3, 2024, Level intermediate Resource Length long

If you're using SQLMesh alongside Apache Spark and Apache Iceberg, I have some exciting news for you! Starting from version 0.57.0, SQLMesh applies the Write-Audit-Publish (WAP) pattern when executing models using Apache Spark and the Apache Iceberg data format. The best part? No user action is required to enable this behavior - it's enabled by default. By Iaroslav Zeigerman.

Tags analytics big-data app-development apache devops
Apache ZooKeeper: The perfect tool for configuration management

Posted on December 17, 2023, Level beginner Resource Length medium

Apache ZooKeeper is an open-source distributed coordination system that provides a platform for configuration management, process synchronization, and lock management. Originally developed by Yahoo, it is now maintained by the Apache Software Foundation. By datascientest.com.

Tags event-driven software-architecture management devops apache
Web server load balancing: Techniques and best practices

Posted on August 25, 2023, Level beginner Resource Length medium

Unveiling Dart 3.1: A New Horizon for Functional Programming in Flutter Companies across the globe seek fast system performance and quick responses when it comes to websites and modern applications. Often such high traffic websites must cater to millions of requests from end users as well as clients simultaneously. In such scenarios, a single server may not be able to handle the network traffic. By Hitesh Jethva.

Tags web-development app-development servers apache nginx
How to enable HSTS for enhanced web security in Apache

Posted on May 13, 2023, Level intermediate Resource Length medium

HTTP Strict Transport Security (HSTS) is a web security policy mechanism that helps to protect websites against protocol downgrade attacks and cookie hijacking. It allows web servers to declare that web browsers (or other complying user agents) should interact with it using only secure HTTPS connections, and never via the insecure HTTP protocol. This article will guide you on how to implement and optimize HSTS in Apache for improved web security. By Rahul.

Tags app-development infosec web-development apache ssl
Simplified data pipelines with Pulsar transformation functions

Posted on April 24, 2023, Level intermediate Resource Length medium

They provide a low-code way to develop basic processing and routing of data using existing Pulsar features. Using functions in the cloud is a very efficient way of creating iterable workflows that can transform data, analyze source code, make platform configurations, and do many other useful jobs. As you develop a function you will quickly realize a need for a solid foundation of utilities and formatting. By Christophe Bornet.

Tags app-development data-science apache big-data
Comparing Avro vs Protobuf for data serialization

Posted on April 18, 2023, Level beginner Resource Length short

Data serialization is a crucial aspect of modern distributed systems because it enables the efficient communication and storage of structured data. In this article, we will discuss two popular serialization formats: Avro and Protocol Buffers, Protobuf for short, and compare their strengths and weaknesses to help you make an informed decision about which one to use in your projects. By Daniel Selans.

Tags json queues messaging app-development streaming apache
Using Vulcan codecs with Kafka Java APIs

Posted on April 17, 2023, Level intermediate Resource Length medium

For those that aren't familiar, Vulcan is a functional Avro encoding library that uses the official Apache Avro library under the hood. The difference between this and the official Avro build plugins approach is that the types are defined in plain Scala. Then the Avro schema is generated from those instead of defining the Avro schema and getting code generated at compile time that adheres to that schema. By César Enrique.

Tags apache java messaging app-development streaming scala
Real-time data linkage via Linked Data Event Streams

Posted on April 12, 2023, Level intermediate Resource Length long

Real-time interchanging data across domains and applications is challenging; data format incompatibility, latency and outdated data sets, quality issues, and lack of metadata and context. A Linked Data Event Stream (LDES) is a new data publishing approach which allows you to publish any dataset as a collection of immutable objects. The focus of an LDES is to allow clients to replicate the history of a dataset and efficiently synchronize with its latest changes. By towardsai.net.

Tags data-science streaming performance how-to big-data apache
Deploy Apache Flink cluster on Kubernetes

Posted on March 11, 2023, Level intermediate Resource Length medium

When it comes to deploying Apache Flink on Kubernetes, you can do it in two modes, either session cluster or job cluster. A session cluster is a running standalone cluster that can run multiple jobs, while a Job cluster deploys a dedicated cluster for each job. By Elvis David.

Tags apache devops cloud data-science big-data
How to orchestrate an ETL Data Pipeline with Apache Airflow

Posted on March 10, 2023, Level intermediate Resource Length medium

Data Orchestration involves using different tools and technologies together to extract, transform, and load (ETL) data from multiple sources into a central repository. By Aviator Ifeanyichukwu.

Tags apache database nosql data-science python big-data
Using Apache Kafka to process 1 trillion inter-service messages

Posted on January 27, 2023, Level intermediate Resource Length long

Cloudflare has been using Kafka in production since 2014. We have come a long way since then, and currently run 14 distinct Kafka clusters, across multiple data centers, with roughly 330 nodes. Between them, over a trillion messages have been processed over the last eight years. By Matt Boyle.

Tags event-driven apache apis app-development database

Tag: Apache

Apache web server hardening and security guide

Tags apache web-development cloud software-architecture infosec

Introducing WAP pattern support with Apache Iceberg

Tags analytics big-data app-development apache devops

Apache ZooKeeper: The perfect tool for configuration management

Tags event-driven software-architecture management devops apache

Web server load balancing: Techniques and best practices

Tags web-development app-development servers apache nginx

How to enable HSTS for enhanced web security in Apache

Tags app-development infosec web-development apache ssl

Simplified data pipelines with Pulsar transformation functions

Tags app-development data-science apache big-data

Comparing Avro vs Protobuf for data serialization

Tags json queues messaging app-development streaming apache

Using Vulcan codecs with Kafka Java APIs

Tags apache java messaging app-development streaming scala

Real-time data linkage via Linked Data Event Streams

Tags data-science streaming performance how-to big-data apache

Deploy Apache Flink cluster on Kubernetes

Tags apache devops cloud data-science big-data

How to orchestrate an ETL Data Pipeline with Apache Airflow

Tags apache database nosql data-science python big-data

Using Apache Kafka to process 1 trillion inter-service messages

Tags event-driven apache apis app-development database