Yelp lands their log data in Amazon S3. They’ve open sourced a couple tools that help process the data and load it to Amazon Redshift.
10 SQL Articles Everyone Must Read zite.to/19fbHvA
Hadoop’s new role: Adjunct data warehouse – article from GigaOm.
There are two main ways that Hadoop is currently being used within the enterprise:
- as a data lake – a place to drop all your data on the way to somewhere else
- a data warehouse – feeding data to analysts and report consumers
2015: The Year of Agile Data Warehousing zite.to/1zgS9S3
The problem of managing schemas radar.oreilly.com/2014/11/the-pr…
Bezos’s law signals it’s time to ditch the data center zite.to/VuVXgx
It’s great to see how other people are doing it. Check out the Data Warehouse and Analytics Infrastructure at Viki zite.to/Wb0Zjf.
Data Lakes vs Data Warehouses zite.to/1xqoByG