Check out this article about how to boost your data munging with R.
Here’s an article from IBM about the power of machine learning in Spark.
Learn more about image processing with Python.
This article from Domino Data Labs compares the performance of feather vs. data.table vs. readr vs. saverds/writerds packages.
Check out this article about a tool to help automate the machine learning pipeline process.
Learn how Airbnb uses R to scale its data science.
R-Bloggers has some tips on doing principal component analysis (PCA) in R.
Yelp lands their log data in Amazon S3. They’ve open sourced a couple tools that help process the data and load it to Amazon Redshift.
Here’s a detailed article from Cloudera about how to use Spark streaming to build a near real time dashboard.