Greenplum is being open sourced – article from DBMS2. Greenplum is a massively parallel (MPP) relational database for analysis that’s based on PostgreSQL.
MADlib goes beta! Serious in-database analytics zite.to/1gmCx7i
Chorus In Action at Data Science London zite.to/14HBR5z
Here are the videos from the Giga Om Big Data Conference that just concluded in New York. One of the talks featured Greenplum founder Luke Lonergan. And here’s another talking about Open Source’s role in big data.
I’m interested to hear how this conference compared to the O’Reilly Strata Conference.
Cloudera has formed an integration alliance with Greenplum. Cloudera will integrate their distribution of Hadoop with Greenplum’s Chorus product. Read more at ZDNet.
Greenplum and Aster Data have launched bigdatanews.com to serve as a collaboration of data warehouse, business intelligence, and analytics industry experts.
Database vendor, Greenplum, is now offering a free download of the single node version of its database. It is available for several different operating systems. Here’s a DBMS2 article with some more information.
In another DBMS2 article, there’s some information about the pace of Greenplum’s recent customer acquisitions which bring it to 100+ as of this quarter.
And finally, here’s some information about Greenplum’s pricing: either subscription or perpetual.
Vertica has been touting the values of the columnar data store over the more traditional row structure of a database. Now Greenplum is offering its own column structure. The twist is that Greenplum allows for both row and column structures within the same database. Check out more information in this DBMS2 article.
Who has the biggest database? Due to the increasing amount of behavioral information tracked during a web browsing session, some internet properties are starting to rack up some pretty hefty databases.
Ebay has a 6.5 petabyte Greenplum warehouse and a 2.5 petabyte Teradata warehouse. This system ingests hundreds of billions of new rows of data every day.
Facebook has a 2.5 petabyte Hadoop system
Yahoo has more than 1 petabyte running on their homemade system