Tag: Big data
-
Processing satellite image collections in R with the gdalcubes package
Posted on July 19, 2019, Level intermediate Resource Length long
Marius Appel wrote this piece about processing satellite image collections. This blog post introduces the gdalcubes R package, aiming at making the work with collections and time series of satellite imagery easier and more interactive.
Tags programming data-science big-data
-
Microsoft's Quantum Development Kit goes open source on GitHub
Posted on July 12, 2019, Level beginner Resource Length short
Announcement form Microsoft about open sourcing Quantum Devleopment Kit (QDK). The QDK launched in preview last year, and is Microsoft's attempt to get developers familiar with quantum computing before it goes mainstream. Developers will now be able to contribute to the QDK's development via GitHub. Quantum for everyone.
Tags big-data open-source cloud machine-learning data-science software
-
Python machine learning tutorial: predicting Airbnb prices
Posted on July 11, 2019, Level intermediate Resource Length long
Machine learning is pretty undeniably the hottest topic in data science right now. It's also the basic concept that underpins some of the most exciting areas in technology, like self-driving cars and predictive analytics. An article by dataquest.io in which you will be introduced to the fundamental concepts of machine learning. As you follow along, you'll build your very first model from scratch to make predictions, while understanding exactly how your model works.
Tags big-data machine-learning how-to python
-
Searching for ET using AI on GCP
Posted on July 10, 2019, Level intermediate Resource Length long
Rob Harrand wrote this article about aproject playing with open data from SETI. They say that the best way to learn data science is to create something. Some of the most interesting data publicly available in GitHub repositories is data from the SETI Institute (the Search for Extraterrestial Intelligence).
Tags big-data analytics cloud machine-learning data-science
-
Finding CRAN packages right from the R console
Posted on June 25, 2019, Level intermediate Resource Length short
The article from Joachim Zuckarelli about working woth Rlang. Currently, there are more than 14,000 R package contributions on CRAN providing R with an unparalleled wealth of features. The downside of the large and increasing amount of packages is that it becomes increasingly difficult to find the right tools to tackle a specific problem.
Tags programming big-data data-science
-
Image recognition in Python with TensorFlow and Keras
Posted on June 14, 2019, Level intermediate Resource Length medium
One of the most common utilizations of TensorFlow and Keras is the recognition/classification of images. If you want to learn how to use Keras to classify or recognize images, this article will teach you how.
Tags python big-data data-science
-
How to create histogram in Rlang
Posted on May 22, 2019, Level intermediate Resource Length short
In this article the author will show you how to create histogram in R using ggplot2 package. Written by Data Sharkie. When we get a new dataset for our analysis or research, often we would like to learn about the frequency of occurrence distribution of the variable of interest.
Tags analytics miscellaneous big-data cio data-science
-
Building self-served ETL pipeline for third-party data ingestion
Posted on April 18, 2019, Level intermediate Resource Length medium
An article by Nikolaos Tsipas from Skyscanner with help of colleagues Omar Kooheji and Michael Okarimia about how to solve the puzzle when there is a need to import datasets from external sources, and make them available for querying. Examples of imported data include: analytics metrics, advertising data, and currency exchange rates, all of which are used by Skyscanner engineers and data scientists.
Tags big-data data-science software-architecture
-
Google's EdgeTPU benchmarked vs Intel's Movidius
Posted on March 24, 2019, Level beginner Resource Length short
An article written by Frederik Bode about the first benchmark of Google's EdgeTPU Dev Board is in. Read about comparison is made against Intel's (first generation) Movidius Neural Compute Stick, and Google is the clear winner regarding inference time.
Tags big-data data-science analytics machine-learning
-
The data science behind Natural Language Processing
Posted on March 22, 2019, Level beginner Resource Length medium
John Thuma published this piece about the data science behind Natural Language Processing (NLP). Human communication is one of the most fascinating attributes of being sentient. We communicate in a variety of ways including speech and written symbols.
Tags miscellaneous big-data data-science learning
-
Managing analysis workflows in geospatial data science with GNU Make
Posted on March 3, 2019, Level intermediate Resource Length long
Martà Bosch wrote this guide how to go about using Jupyter Notebooks while using iterative approach to both data analysis and software development. He will also explain how to avoid some bad practices. Many issues can be settled by choosing helpful file names, good organization, documentation and source control of the code.
Tags big-data machine-learning data-science miscellaneous python
-
Understanding stabilising experience replay for deep multi-agent reinforcement learning
Posted on March 1, 2019, Level advanced Resource Length long
An article by Parnian Barekatain in which she describes some basic concepts in Reinforcement Learning. She also provides you with the link to Udacity's free course on Deep Learning with Pytorch.
Tags big-data machine-learning data-science miscellaneous