Greenplum is being open sourced – article from DBMS2. Greenplum is a massively parallel (MPP) relational database for analysis that’s based on PostgreSQL.
Tag Archives: greenplum
MADlib goes beta! Serious in-database analytics
MADlib goes beta! Serious in-database analytics zite.to/1gmCx7i
Chorus In Action at Data Science London
Chorus In Action at Data Science London zite.to/14HBR5z
Videos from GigaOm Structure Big Data Conference
Here are the videos from the Giga Om Big Data Conference that just concluded in New York. One of the talks featured Greenplum founder Luke Lonergan. And here’s another talking about Open Source’s role in big data.
I’m interested to hear how this conference compared to the O’Reilly Strata Conference.
Cloudera and EMC Greenplum form data warehouse alliance
Cloudera has formed an integration alliance with Greenplum. Cloudera will integrate their distribution of Hadoop with Greenplum’s Chorus product. Read more at ZDNet.
New big data site
Greenplum and Aster Data have launched bigdatanews.com to serve as a collaboration of data warehouse, business intelligence, and analytics industry experts.
Greenplum offers single node edition for free
Database vendor, Greenplum, is now offering a free download of the single node version of its database. It is available for several different operating systems. Here’s a DBMS2 article with some more information.
In another DBMS2 article, there’s some information about the pace of Greenplum’s recent customer acquisitions which bring it to 100+ as of this quarter.
And finally, here’s some information about Greenplum’s pricing: either subscription or perpetual.
Column store dbs versus rows
Vertica has been touting the values of the columnar data store over the more traditional row structure of a database. Now Greenplum is offering its own column structure. The twist is that Greenplum allows for both row and column structures within the same database. Check out more information in this DBMS2 article.
Web analytics databases keep getting bigger
Who has the biggest database? Due to the increasing amount of behavioral information tracked during a web browsing session, some internet properties are starting to rack up some pretty hefty databases.
Ebay has a 6.5 petabyte Greenplum warehouse and a 2.5 petabyte Teradata warehouse. This system ingests hundreds of billions of new rows of data every day.
Facebook has a 2.5 petabyte Hadoop system
Yahoo has more than 1 petabyte running on their homemade system