Tag: Big data
-
Deconstructing the 'CAP theorem' for CM and DevOps
Posted on January 18, 2026, Level advanced Resource Length long
As software engineering and operations forge a new cultural bond around continuous improvement of applications and infrastructure, the database is something "dev" and "ops" have in common -- and there are things to learn from both perspectives on distributed data. By Mark Burgess.
Tags devops distributed devops learning database big-data
-
Scientists built an AI co-pilot for prosthetic bionic hands
Posted on December 28, 2025, Level beginner Resource Length short
An AI assistant dramatically improves the usability of bionic hands, boosting success rates in delicate tasks and reducing the cognitive load on users. By Jacek Krywko.
Tags ai software learning cio management big-data data-science
-
How to find and remove unused Azure Data Factory Pipelines
Posted on December 17, 2025, Level intermediate Resource Length long
A 20-line PowerShell snippet scans every subscription, flags ADF factories that haven't executed a pipeline in 30 days, and hands you a clean-up hit-list in seconds. By Dieter Gobeyn.
Tags web-development app-development devops azure big-data
-
Why Apache Flink is not going anywhere
Posted on December 3, 2025, Level intermediate Resource Length long
Flink's complexity stems from supporting a variety of use cases and having a rich set of features, but can be simplified with proper tooling. By Yaroslav Tkachenk.
Tags apache data-science big-data devops software-architecture
-
Microservices for machine learning
Posted on August 7, 2025, Level intermediate Resource Length medium
Learn how author scaled my ML-powered finance tracker by breaking a monolithic design into microservices for better performance, maintainability, and deployment. Author's finance tracker project started with a simple idea: automatically categorize bank transactions using a text classification model. Author trained a basic logistic regression model on my transaction history, wrapped it in a Flask API, and called it done. By Ramya Boorugula.
Tags microservices machine-learning big-data cloud agile
-
Apache Airflow for MLOPS and ETL - Description, benefits and examples
Posted on July 24, 2025, Level intermediate Resource Length long
Apache Airflow is a leading open-source tool for workflow orchestration, designed to manage complex tasks in Python. Developed by Airbnb and now part of the Apache Software Foundation, it's widely adopted for its flexibility and scalability in data engineering workflows. By Rost Glukhov.
Tags apache open-source analytics big-data data-science
-
How to develop a RAG system using Node.js
Posted on July 20, 2025, Level beginner Resource Length medium
In this blog post, we'll build a simplified but powerful RAG system using Node.js and OpenAI's GPT model, perfect for developers curious to bridge the gap between raw LLM power and domain-specific intelligence. By Deep Panchal.
Tags nodejs app-development frontend web-development big-data
-
Evogene and Google Cloud unveil foundation model for generative molecule design
Posted on June 18, 2025, Level beginner Resource Length long
Evogene and Google Cloud are accelerating life science discovery with ChemPass AI, a generative AI foundation model focused on small-molecule design. Launched in May, this collaboration dramatically reduces the time and cost associated with identifying novel drug candidates and crop protection agents. ChemPass AI's core strength lies in its ability to simultaneously optimize multiple critical properties – potency, toxicity, stability, and bioavailability – within a single molecule generation cycle, surpassing previous approaches. By Antoine Tardif.
Tags cloud data-science gcp big-data google
-
Anonymize RAG data in IBM Granite and Ollama using HCP Vault
Posted on June 8, 2025, Level beginner Resource Length long
In this summary, we explore how IBM Granite and Ollama can be integrated with HCP Vault to handle sensitive data effectively. The method involves configuring vaults to mask or tokenize data before passing it to the LLMs, ensuring that personal identifiable information (PII) is protected. By Rosemary Wang.
Tags ibm big-data ai data-science devops
-
Introducing KBLaM: Bringing plug-and-play external knowledge to LLMs
Posted on May 20, 2025, Level beginner Resource Length medium
Large language models (LLMs) have demonstrated remarkable capabilities in reasoning, language understanding, and even creative tasks. Yet, a key challenge persists: how to efficiently integrate external knowledge. By Taketomo Isazawa.
Tags azure cloud ai cio big-data
-
Retrieval Augmented Generation (RAG) tutorial for beginners
Posted on May 9, 2025, Level beginner Resource Length medium
Retrieval-augmented Generation (RAG) is an AI approach that improves machine understanding and response accuracy. By integrating traditional AI language models with real-time retrieval of relevant external data, RAG bridges knowledge gaps, enabling more precise and contextually rich answers. By Vidhi Gupta.
Tags machine-learning data-science big-data ai learning
-
Learnings from a machine learning engineer — data
Posted on May 5, 2025, Level beginner Resource Length long
Practical insights for a data-driven approach to model optimization. By David Martin.
Tags machine-learning data-science big-data how-to learning