Welcome to curated list of handpicked free online resources related to IT, cloud, Big Data, programming languages, Devops. Fresh news and community maintained list of links updated daily. Like what you see? [ Join our newsletter ]

Kubeflow fundamentals: Distributions and installations

Categories

Tags devops cloud kubernetes machine-learning

The aim of the series is to walk you through a detailed introduction of Kubeflow, a deep-dive into the various components and how they all come together to deliver a complete MLOps platform. By Jimmy Guerrero.

In this post we’ll take a look at the different Kubeflow distributions that are available and walk you through some installations using MiniKF:

  • Installing Kubeflow
  • Packaged Kubeflow Distributions
  • Other Platforms with Kubeflow Packaged Distributions

For the purposes of this blog we are going to focus on getting up and running with MiniKF. Why?

  • MiniKF is the easiest distribution to get started with, even for folks with limited Kubernetes experience
  • MiniKF is cross platform. It runs on AWS, GCP and even locally via Vagrant
  • MiniKF comes with prebundled add-ons like Kale and Rok that make it much easier to build pipelines and manage data then the basic Kubeflow distribution offers

You will find video tutorials which will walk you through installation steps on various platforms. Nice one!

[Read More]

Postgres full-text search: Search engine in a database

Categories

Tags database search microservices nosql software-architecture cloud sql

So when we say PostgreSQL is the “batteries included database,” this is just one reason why. With Postgres, you don’t need to immediately look farther than your own database management system for a full-text search solution. If you haven’t yet given Postgres’ built-in full-text search a try, read on for a simple intro. By Kat Batuigas.

The article describes:

  • Postgres full-text search basics for the uninitiated
  • Example: Searching storm event details
  • Functions for weighting and ranking search results

You can get even deeper and make your Postgres full-text search even more robust, by implementing features such as highlighting results, or writing your own custom dictionaries or functions. You could also look into enabling extensions such as unaccent (remove diacritic signs from lexemes) or pg_trgm (for fuzzy search). Speaking of extensions, those were just two of the extensions supported in Crunchy Bridge. We’ve built our managed cloud Postgres service such that you can dive right in and take advantage of all these Postgres features. Good read with SQL query and code exmaples!

[Read More]

Distributed transaction patterns for microservices compared

Categories

Tags cio devops software-architecture microservices apis

One thing most customers want to know is how to coordinate writes to more than one system of record. Answering this question typically involves a long explanation of dual writes, distributed transactions, modern alternatives, and the possible failure scenarios and drawbacks of each approach. By Bilgin Ibryam.

The single indicator that you may have a dual write problem is the need to write to more than one system of record predictably. This requirement might not be obvious and it can express itself in different ways in the distributed systems design process.

The article then walks you through:

  • The dual write problem
  • The modular monolith
  • Orchestration architecture
  • Choreography pattern
  • Parallel pipelines pattern
  • How to choose a distributed transactions strategy

Levels of code and data isolation for applications

Source @https://developers.redhat.com: https://developers.redhat.com/articles/2021/09/21/distributed-transaction-patterns-microservices-compared#the_modular_monolith

Each pattern is explained from architecture, benefits and drawback points of view. In a sizable distributed system with tens of services, there won’t be a single approach that works for all, but a few of these combined and applied for different contexts. You might have a few services deployed on a shared runtime for exceptional requirements around data consistency. You might choose a two-phase commit for integration with a legacy system that supports JTA. You might orchestrate a complex business process, and also use choreography and parallel processing for the rest of the services. In the end, it doesn’t matter what strategy you pick; what matters is choosing a strategy deliberately for the right reasons, and executing it. Excellent read!

[Read More]

Anthos service mesh

Categories

Tags app-development devops software-architecture microservices

Over the course of this series, we are going to cover various topics associated with Google Cloud’s Anthos. The series will involve conceptual understanding supplemented by practical tutorials for you to get up to speed on what some consider a revolutionary piece of technology. By Alfred Tommy.

When you have a microservices based architecture, it proves challenging to manage these individual services. For example, you may want to authenticate/authorise requests between services, you would probably like to get some observability on the network traffic between services, you may even want to split traffic between services. All this and more can be achieved with a service mesh.

Anthos actually comprises of a suite of services, the key ones being:

  • Infrastructure management
  • Container management and orchestration
  • Service management
  • Policy enforcement

The article then explains in detail installing Anthos Service Mesh with a Google Managed Control Plane. Plenty of screenshots and command line exmaples will get you going. Nice one!

[Read More]

Ionic Portals: Introducing the supercharged web view for native apps

Categories

Tags app-development how-to web-development frontend android ios

Ionic Portals offers a supercharged native Web View component for iOS and Android that enables teams to add web-based experiences to native mobile apps. Now, native and web teams can better collaborate while bringing new and existing web experiences to mobile in a safe, controlled way. By Josh Thomas.

The best part? Each Portal can be updated individually. Say you have a web team that manages the FAQ Portal, one that manages the checkout experience, and one that manages a user profile page. The checkout team can safely ship an update to their web experience whenever they want, without disrupting the other web teams or the native release process. Sign up to be notified when this launches this Fall.

The article provides introduction video explaining the concept plus reads about:

  • Why Ionic Portals?
  • How Ionic Portals works
  • iOS configuration
  • Configuring Android
  • Adding a Portal (web app)
  • Beyond the basics

So in essence, this is a huge step in our evolution as a platform and the fulfillment of Ionic’s Web Native vision, one where web developers push the limits of what’s possible with modern web technology and APIs.

[Read More]

AWS WAF and CloudFront: How to use them together

Categories

Tags devops how-to aws infosec cloud

Using AWS CloudFront and AWS WAF together, you can add some security to your sites with less work and focus on making features for your users. By Peter Grainger.

In this post, author will provide some information about AWS CloudFront, AWS WAF, and SQL injection. Then he will provide a step-by-step guide on how to use the two AWS services together to protect against SQL injection.

The main content of the article:

  • Some background information on AWS WAF, CloudFront, and injection
  • Providing protection for the GeoJS API using AWS WAF
  • How to use CloudFront and AWS WAF together to protect against SQL injection attacks

Injection is the number one critical risk for web applications according to OWASP. Setting up AWS WAF with AWS managed rules means you have less maintenance updating the WAF for each new vulnerability. You connect to the AWS WAF in the CloudFront distribution wizard when creating a new distribution. After configuring both services, you can test the WAF will block incoming SQL injection attacks before they reach your application.

You will also get detailed breakdown of steps to set up WAF for this scenario together with screenshots. Nice one!

[Read More]

How we shipped PostgreSQL 14 on Azure within one day of its release

Categories

Tags devops how-to learning azure database cloud

In this blog post, you’ll first get a glimpse into some of our favorite features in Postgres 14. These include connection scaling, faster VACUUM, and improvements to crash recovery times. By Ozgun Erdogan.

With each new PostgreSQL release, there can be breaking changes with any of the above integration points. The process of making extensions compatible with Postgres versions is incorporating changes to these integration points. For example, with PostgreSQL 14, the utility hook’s signature changed to include a new argument. So, we had to incorporate this change, as shown below. You can also read the complete set of changes for Postgres 14 integration in this pull request.

The article the reads about:

  • Favorite new features in PostgreSQL 14
  • Making Citus & other extensions compatible with PostgreSQL 14
  • Hyperscale (Citus) – Releasing a new PostgreSQL version

The first best practice is the separation of responsibilities between Hyperscale (Citus)’s control plane and data plane. In our architecture, the control plane is responsible for the business logic for managing Postgres/Citus databases. This logic includes periodic health checks, high availability and failover, backup and restore, read replicas, regular maintenance operations, and others. The data plane is solely responsible for running the database. As such, the data plane contains almost nothing else other than stock PostgreSQL and its extensions.

In summary, authors were excited to announce general availability (GA) for Postgres 14 on Azure, within one day of the official Postgres 14 release. Good read!

[Read More]

12 critical resources to help you learn DevSecOps

Categories

Tags learning devops infosec containers

Whether you are a seasoned DevOps engineer who wants to branch out to DevSecOps, or you’re just starting your career and defining your direction, there has perhaps never been a better time to enter the DevSecOps career field or learn about cloud native security. By Maor Goldberg.

Indeed, as Gartner notes, there was already a shortage of skilled cybersecurity professionals — a field that includes DevSecOps — as of 2019, and “the global pandemic has further escalated this situation.” That’s due especially to the surge in cybersecurity attacks during 2020 and into 2021, which has made companies more eager than ever to hire engineers who understand DevSecOps and cloud native security.

The article then reads about:

  • How do you learn DevSecOps?
  • Learning resources for DevSecOps and cloud native security
    • Book: Cloud Native Security
    • Book: Hands-On Security in DevOps
    • Book: The DevOps Handbook
    • Book: Container Security
    • Book: Securing DevOps
    • Podcast: “The DevSecOps Talks”
    • Courses: KubeAcademy

… and more. There are many ways to learn DevSecOps and cloud native security. Whether you prefer books, courses, podcasts or a combination thereof, you’ll find a DevSecOps learning resource tailored to your preferences. Good read!

[Read More]

How to handle data duplication in data-heavy Kubernetes environments

Categories

Tags data-science devops how-to learning big-data

It’s convenient to create a copy of your application with a copy of its state for each team. For example, you might want a separate database copy to test some significant schema changes or develop other disruptive operations like bulk insert/delete/update… By Augustinas Stirbis.

Duplicating data takes a lot of time. That’s because you need first to download all the data from a source block storage provider to compute and then send it back to a storage provider again. There’s a lot of network traffic and CPU/RAM used in this process. Hardware acceleration by offloading certain expensive operations to dedicated hardware is always a huge performance boost. It reduces the time required to complete an operation by orders of magnitude.

The article then covers:

  • Volume Snapshots to the rescue
  • Solution? Creating a Golden Snapshot externally
  • High-level plan for preparing the Golden Snapshot
  • High-level plan for cloning data for each team
    • Step 1: Identify disk
    • Step 2: Prepare your golden source
    • Step 3: Get your Disk Snapshot ID
    • Step 4: Create a development environment for each team

You will also get a loads of screenshots and config yaml files to go with this article. At the end of the tutorial you have Golden Snapshot, which is immutable data. Each team will get a copy of this data, and team members can modify it as they see fit, given that a new EBS/persistent disk will be created for each team. Good read!

[Read More]

How to deploy a machine learning model with FastAPI, Docker and Github Actions

Categories

Tags machine-learning learning big-data devops agile

You’ve just trained a model and you’re happy with it because it performs well on your local cross-validation. Now is the time to put this model in production so that other teams within your organization can consume it and embed it in their applications. By Ahmed Besbes.

The tutorial also covers:

  • Introduction to production machine learning and APIs
  • A quick overview of FastAPI features
  • Using FastAPI and SpaCy to build an inference API
  • Packaging the API with Docker and docker-compose
  • Deploying the API to AWS EC2 and automating the process with a Github Actions CI/CD

Broadly speaking and without going into many details, putting a model in production is a process in which a model is integrated into an existing IT environment and made available to other teams to use and consume. You will also get links to further resources and reading. Good read!

[Read More]