Amazon CTO, Werner Vogels, announces the launch of their new cloud based relational database – the Amazon Relational Database Service (RDS).
Database vendor, Greenplum, is now offering a free download of the single node version of its database. It is available for several different operating systems. Here’s a DBMS2 article with some more information.
In another DBMS2 article, there’s some information about the pace of Greenplum’s recent customer acquisitions which bring it to 100+ as of this quarter.
And finally, here’s some information about Greenplum’s pricing: either subscription or perpetual.
Vertica has been touting the values of the columnar data store over the more traditional row structure of a database. Now Greenplum is offering its own column structure. The twist is that Greenplum allows for both row and column structures within the same database. Check out more information in this DBMS2 article.
Who has the biggest database? Due to the increasing amount of behavioral information tracked during a web browsing session, some internet properties are starting to rack up some pretty hefty databases.
Ebay has a 6.5 petabyte Greenplum warehouse and a 2.5 petabyte Teradata warehouse. This system ingests hundreds of billions of new rows of data every day.
Facebook has a 2.5 petabyte Hadoop system
Yahoo has more than 1 petabyte running on their homemade system
Werner Vogels, CTO of Amazon, provides these tips to keep your database simple and running fast on his blog at All Things Distributed.
Here’s a detailed write up on how eGroups (Yahoo Groups) evolved their utilization of MySQL.
Switching to open source can be a great way to reduce your cost and gain some independence. If you are considering migrating from Oracle to MySQL, here are 50 things you might want to consider first.
Netezza opens box, builds developer network for predictive analytics and high-demand, non-SQL analyses.
By Doug Henschen
Appliances are changing the game in data warehousing, and if Netezza has its way, its appliance will also bring big changes to predictive analytics and other forms of high-demand analytic and algorithmic analyses.
Broadening the appeal of its Netezza Performance Server (NPS), the vendor announced this week that it has extended the capabilities of the appliance by opening it up to non-SQL programming, with leading examples including predictive modeling, scoring for marketing segmentation, Monte Carlo simulation, image analysis, geospatial analysis and fuzzy text search.
“Many of these analyses are being done on supercomputers or big grids because of the complexity and performance requirements,” says Justin Lindsey, chief technology officer. “Our appliance is really a high-performance, parallel computing environment, so instead of letting developers just push SQL toward the data, we’re letting them push complex, programmatic functions.” These functions are typically written in fast languages such as C or C++, says Lindsey, but the programming could also be in Java or another language.
Thus, with a new NPS (or a field upgrade of an existing device), customers will be able to handle both warehousing/mart and high-demand analytic/algorithmic activities on a single appliance. And as in warehousing applications, the benefit will be faster processing.
“It took Catalina Marketing about seven hours to handle a particular marketing scoring exercise across billions of records, but by moving it into NPS, they’re going to be able to do it in about half an hour,” says Lindsey.
To accelerate its push into high-end analytics, the vendor has made available a scaled-down development server, a software development kit and partner training. It has also organized and announced a Netezza Developer Network open to customers, systems integrators, complementary technology partners and the academic community. Founding members of the network include SAS, SPSS, Epsilon, Catalina Marketing and Carnegie Mellon University, among others.
“We have granted about 30 partner organizations small development environments so they can open the appliance and do their own programming inside it,” says Ellen Rubin, vice president of marketing. “We’re using the developer network both to drive requirements and to stimulate creativity and innovation.”