Tag: Big data
-
Neural networks, manifolds, and topology
Posted on January 22, 2018, Level beginner Resource Length long
Christopher Olah older article about excitement and interest in deep neural networks because they've achieved breakthrough results in areas such as computer vision.
Tags big-data machine-learning
-
Supercharging visualization with Apache Arrow
Posted on January 7, 2018, Level beginner Resource Length medium
Article on KDnuggets™ about how Apache Arrow provides a new way to exchange and visualize data at unprecedented speed and scale. Despite the fact that interactive visualization of large data sets on the web has traditionally been impractical.
Tags big-data analytics data-science big-data
-
A primer on deep learning
Posted on December 29, 2017, Level beginner Resource Length medium
Post written by Jeremy Fain -- the CEO and co-founder of Cognitiv, the first neural network technology. In it he addresses what deep learning, machine learning and artificial intelligence is.
Tags big-data data-science
-
Web scraping with Puppeteer and Chrome Headless
Posted on November 22, 2017, Level beginner Resource Length short
Emad Ehsan put together article about how to get started with Web Scraping in Chrome Headless. Chrome Headless is going to be industry leader in Automated Testing of web applications. Puppeteer is the official tool for Chrome Headless by Google Chrome team.
Tags web-development big-data machine-learning
-
AI turns design sketches into source code
Posted on October 27, 2017, Level beginner Resource Length long
Dimitar Mihov via [tnw](https://thenextweb.com) published article about Artificial Intelligence (AI) implemented and built by Airbnb that turns design sketches into product source code. The company is currently developing a new AI system that will empower its designers and product engineers to literally take ideas from the drawing board and turn them into actual products almost instantaneously.
Tags big-data programming data-science
-
Apache Spark natural language processing library
Posted on October 22, 2017, Level beginner Resource Length long
Excellent community blog and effort from the engineering team at John Snow Labs, explaining their contribution to an open-source Apache Spark Natural Language Processing (NLP) library. Apache Spark is a general-purpose cluster computing framework, with native support for distributed SQL, streaming, graph processing, and machine learning.
Tags big-data data-science
-
Using Machine Learning to Predict Value of Homes On Airbnb
Posted on July 31, 2017, Level beginner Resource Length medium
Robert Chang piece on how data products have always been an instrumental part of Airbnb's service. However, engineers have long recognized that it's costly to make data products.
Tags machine-learning big-data
-
Rearchitecting Airbnb's Frontend
Posted on May 27, 2017, Level intermediate Resource Length long
Adam Neary's neat article about rethought the architecture for the JavaScript side of the codebase at Airbnb.
Tags big-data frontend
-
Testing Machine Learning Algorithms with K-Fold Cross Validation
Posted on May 17, 2017, Level intermediate Resource Length medium
Norbert Krupa wrote blog post on choosing a machine learning algorithm, then using a validation technique. He uses Talend Studio without hand coding.
Tags machine-learning big-data
-
The Algorithms Behind Probabilistic Programming
Posted on February 1, 2017, Level beginner Resource Length medium
This post by Mike gives a feel for the content in our report on probabilistic programming by introducing the algorithms and technology that make probabilistic programming possible.
Tags programming big-data
-
Data Exploration with Python, Part 1
Posted on January 26, 2017, Level intermediate Resource Length long
Tony Ojeda witnessed the lack of structure in conventional approaches in Exploratory data analysis, so he decided to document his own process in an attempt to come up with a framework for data exploration.
Tags big-data data-science
-
Analyzing Big Data with Twitter
Posted on January 25, 2017, Level intermediate Resource Length 15h+
UC Berkeley published their Course Lectures: Analyzing Big Data With Twitter. Bit older but still very good - published and available for free. Over 15+ hours of video lectures. These lecture notes simply summarized the course at a high level.
Tags big-data