Tag: Data science

Microsoft's Quantum Development Kit goes open source on GitHub

Posted on July 12, 2019, Level beginner Resource Length short

Announcement form Microsoft about open sourcing Quantum Devleopment Kit (QDK). The QDK launched in preview last year, and is Microsoft's attempt to get developers familiar with quantum computing before it goes mainstream. Developers will now be able to contribute to the QDK's development via GitHub. Quantum for everyone.

Tags big-data open-source cloud machine-learning data-science software
Searching for ET using AI on GCP

Posted on July 10, 2019, Level intermediate Resource Length long

Rob Harrand wrote this article about aproject playing with open data from SETI. They say that the best way to learn data science is to create something. Some of the most interesting data publicly available in GitHub repositories is data from the SETI Institute (the Search for Extraterrestial Intelligence).

Tags big-data analytics cloud machine-learning data-science
The differences between Bitcoin and Libra should matter to policymakers

Posted on July 8, 2019, Level beginner Resource Length long

Peter van Valkenburgh article about differences between Bitcoin and Facebook's Libra. The two have different design goals, work in different ways, and raise different regulatory questions.

Tags crypto fintech data-science
Finding CRAN packages right from the R console

Posted on June 25, 2019, Level intermediate Resource Length short

The article from Joachim Zuckarelli about working woth Rlang. Currently, there are more than 14,000 R package contributions on CRAN providing R with an unparalleled wealth of features. The downside of the large and increasing amount of packages is that it becomes increasingly difficult to find the right tools to tackle a specific problem.

Tags programming big-data data-science
Image recognition in Python with TensorFlow and Keras

Posted on June 14, 2019, Level intermediate Resource Length medium

One of the most common utilizations of TensorFlow and Keras is the recognition/classification of images. If you want to learn how to use Keras to classify or recognize images, this article will teach you how.

Tags python big-data data-science
Look at how Twitter handles its time series data ingestion challenges

Posted on June 11, 2019, Level intermediate Resource Length short

Ram Dagar is author of this overview on the time series topic. The components of time-series are as complex and sophisticated as the data itself. With increasing time, the data obtained increases and it doesn't always mean that more data means more information but, larger sample avoids the error that due to random sampling.

Tags devops database machine-learning data-science software
How we use Apache Kafka and the Confluent Platform

Posted on June 4, 2019, Level intermediate Resource Length short

Jendrik Poloczek from confluent.io published this article about their experience building the core infrastructure to integrate, clean, and analyze blockchain data.

Tags blockchain apache apis data-science scala
Great engineer needs the liberal arts

Posted on May 23, 2019, Level beginner Resource Length medium

Thomas Betts wrote for infoq.com about how liberal arts eductaion can provide new insights and perspectives that shine a light on technical tasks for any software developer. E.g. empathy helps you know your audience and create great software that delights your customers.

Tags miscellaneous data-science learning programming
How to create histogram in Rlang

Posted on May 22, 2019, Level intermediate Resource Length short

In this article the author will show you how to create histogram in R using ggplot2 package. Written by Data Sharkie. When we get a new dataset for our analysis or research, often we would like to learn about the frequency of occurrence distribution of the variable of interest.

Tags analytics miscellaneous big-data cio data-science
Building self-served ETL pipeline for third-party data ingestion

Posted on April 18, 2019, Level intermediate Resource Length medium

An article by Nikolaos Tsipas from Skyscanner with help of colleagues Omar Kooheji and Michael Okarimia about how to solve the puzzle when there is a need to import datasets from external sources, and make them available for querying. Examples of imported data include: analytics metrics, advertising data, and currency exchange rates, all of which are used by Skyscanner engineers and data scientists.

Tags big-data data-science software-architecture
Google's EdgeTPU benchmarked vs Intel's Movidius

Posted on March 24, 2019, Level beginner Resource Length short

An article written by Frederik Bode about the first benchmark of Google's EdgeTPU Dev Board is in. Read about comparison is made against Intel's (first generation) Movidius Neural Compute Stick, and Google is the clear winner regarding inference time.

Tags big-data data-science analytics machine-learning
The data science behind Natural Language Processing

Posted on March 22, 2019, Level beginner Resource Length medium

John Thuma published this piece about the data science behind Natural Language Processing (NLP). Human communication is one of the most fascinating attributes of being sentient. We communicate in a variety of ways including speech and written symbols.

Tags miscellaneous big-data data-science learning

Tag: Data science

Microsoft's Quantum Development Kit goes open source on GitHub

Tags big-data open-source cloud machine-learning data-science software

Searching for ET using AI on GCP

Tags big-data analytics cloud machine-learning data-science

The differences between Bitcoin and Libra should matter to policymakers

Tags crypto fintech data-science

Finding CRAN packages right from the R console

Tags programming big-data data-science

Image recognition in Python with TensorFlow and Keras

Tags python big-data data-science

Look at how Twitter handles its time series data ingestion challenges

Tags devops database machine-learning data-science software

How we use Apache Kafka and the Confluent Platform

Tags blockchain apache apis data-science scala

Great engineer needs the liberal arts

Tags miscellaneous data-science learning programming

How to create histogram in Rlang

Tags analytics miscellaneous big-data cio data-science

Building self-served ETL pipeline for third-party data ingestion

Tags big-data data-science software-architecture

Google's EdgeTPU benchmarked vs Intel's Movidius

Tags big-data data-science analytics machine-learning

The data science behind Natural Language Processing

Tags miscellaneous big-data data-science learning