Treasuredata put together a post comparing Presto and Hive.
Airbnb has just open sourced some new tools to use with the Presto database.
Not only does Google BigQuery offer up computing power on-demand, but also public data sets to analyze independently or combine with your own data. Here is Information about Google BigQuery Public Datasets – article by KDNuggets
Greenplum is being open sourced – article from DBMS2. Greenplum is a massively parallel (MPP) relational database for analysis that’s based on PostgreSQL.
OpenStack comes up huge for Walmart zite.to/1E2UDoq
Apache Storm and Kafka Together: A Real-time Data Refinery zite.to/1ClMCpS
Facebook Open Sources deep-learning modules for Torch zite.to/1ESLiAO
Medium open sources a data visualization tool that’s pretty handy zite.to/1xP6PpA
LinkedIn and Twitter Contribute Machine Learning Libraries to Open Source. Check out the InfoQ article.
New open-source Machine Learning Framework written in Java zite.to/1t3YQEc