Tag: Data science
-
Humanness in the age of AI
Posted on April 7, 2023, Level beginner Resource Length long
A path to an open and permissionless identity protocol. The Worldcoin project is initiating an open and permissionless identity protocol called World ID. It empowers individuals to verify their humanness online while maintaining their anonymity through zero-knowledge proofs. By @worldcoin.org.
Tags crypto big-data cloud cio data-science miscellaneous
-
Mastering weather predictions: AI with LSTM Deep Learning models for accurate temperature forecasts
Posted on April 1, 2023, Level beginner Resource Length long
Predicting temperature trends with advanced deep learning techniques using LSTM. Weather forecasting is one of the most important tools in the modern world and developing a good temperature prediction model can be a huge competitive advantage for many businesses. Ambient temperature measurement is directly linked to several business areas such as agriculture, energy sector, trading, aviation, and many other sectors. By Octavio Santiago.
Tags big-data data-science machine-learning app-development learning
-
From 50 ML projects, 48 made it to production within 2 weeks. How?
Posted on March 20, 2023, Level beginner Resource Length medium
Putting machine learning (ML) models in production is considered an operational challenge that is performed after all the hard work on training and optimizing the model is completed. In contrast, serverless ML starts with a minimal model, including the operational feature pipeline(s) and inference pipeline. By Jim Dowling.
Tags big-data data-science cloud cio devops
-
Pandas 2.0 and its ecosystem (Arrow, Polars, DuckDB)
Posted on March 15, 2023, Level intermediate Resource Length medium
Data manipulation and analysis can be challenging and involve working with large datasets. Thankfully, a widely used Python library known as Pandas has become the go-to tool for processing and manipulating data. Pandas recently got an update, which is version 2.0. This article takes a closer look at what Pandas is, its success, and what the new version brings, including its ecosystem around Arrow, Polars, and DuckDB. By Simon Späti.
Tags big-data data-science python programming
-
A deep dive into AIOps and MLOps
Posted on March 14, 2023, Level intermediate Resource Length medium
Monitoring and managing a DevOps environment is complex. The volume of data generated by new distributed architectures (such as Kubernetes) makes it difficult for DevOps teams to effectively respond to customer requests. By Hicham Bouissoumer, Nicolas Giron.
Tags big-data data-science devops cloud
-
Deploy Apache Flink cluster on Kubernetes
Posted on March 11, 2023, Level intermediate Resource Length medium
When it comes to deploying Apache Flink on Kubernetes, you can do it in two modes, either session cluster or job cluster. A session cluster is a running standalone cluster that can run multiple jobs, while a Job cluster deploys a dedicated cluster for each job. By Elvis David.
Tags apache devops cloud data-science big-data
-
How to orchestrate an ETL Data Pipeline with Apache Airflow
Posted on March 10, 2023, Level intermediate Resource Length medium
Data Orchestration involves using different tools and technologies together to extract, transform, and load (ETL) data from multiple sources into a central repository. By Aviator Ifeanyichukwu.
Tags apache database nosql data-science python big-data
-
How to transform time series for deep learning
Posted on February 26, 2023, Level intermediate Resource Length long
Forecasting with deep neural networks. A time series is a sequence of values ordered in time. So, it needs to be transformed for supervised learning. By Vitor Cerqueira.
Tags machine-learning app-development data-science how-to big-data iot
-
Temporal graph neural networks with Pytorch - How to create a simple recommendation engine on an Amazon dataset
Posted on February 11, 2023, Level beginner Resource Length long
Over the course of the last few months, we at Memgraph have been working on something that we believe could be helpful with classical graph prediction tasks. With our latest newborn query module, you will have the option of performing both label classification and link prediction. By Antonio Filipovic.
Tags cloud big-data devops data-science python
-
Data integration vs. data ingestion: What are the differences?
Posted on February 5, 2023, Level intermediate Resource Length medium
Data integration and data ingestion are two IT disciplines that are often confused with one another. Here's how they differ and the challenges you may encounter. By Aminu Abdullahi.
Tags big-data cio data-science machine-learning
-
Data structures 101: How to use stacks and queues in Java
Posted on January 17, 2023, Level intermediate Resource Length long
Mastering data structures is a non-negotiable skill for success as a developer. Efficient data structures help execute effective programs. Today, many programming roles require great knowledge of data structures. They are also a fundamental part of coding interviews. By The Educative Team.
Tags java app-development programming queues data-science
-
Building more efficient data infrastructure for machine learning
Posted on December 17, 2022, Level intermediate Resource Length medium
The current influx of data — structured, semi-structured, and unstructured — being driven by an array of data sources is fueling opportunities to leverage machine learning to extract insights and accelerate innovations that can transform businesses and industries. As these data volumes continue to rise, companies are struggling with the complicated task of managing this data and figuring out how to harness it for analytics and AI. By Vedant Jain, Denny Lee.
Tags cio open-source big-data data-science machine-learning