The Open Source Analytics Invasion zite.to/1t1Svpn
Tag Archives: open source
PostgreSQL Database Modeler
PostgreSQL Database Modeler zite.to/10qzGAh
Here’s an open source tool to help you build models for PostgreSQL databases.
Pentaho’s new open source BI release
CMS Wire review’s Pentaho’s newest release of their open source BI software.
Rainbird, real-time analytics at Twitter
And here is the Techcrunch commentary.
Mallet, an open source machine learning application
Check out Mallet, a Java-based package for statistical natural language processing, document classification, clustering, topic modeling, information extraction, and other machine learning applications to text.
Creating visualizations with open source language, Processing
Learn about what you can do with the open source language, Processing, to build visualizations.
Twitter and Cassandra
Cassandra is an open sourced distributed database that’s part of the Apache project. It was originally developed at Facebook. Twitter has announced that they will continue to use MySQL to store tweets but will be using Cassandra to develop a real-time analytics capability. Read the rest in the Techcrunch article.
Is Cloudera the new Red Hat?
As the Open Source software movement continues the strengthen, questions abound about where the opportunities to create commercially viable solutions. Red Hat did it with Linux. Can Cloudera do it with Hadoop? Read this GigaOm article.
Tom White on running Hadoop in the cloud
Here’s a video of Tom White from the Hadoop Summit talking about running Hadoop in the cloud. Tom is the author of Hadoop the Definitive Guide.
The NoSQL movement
NoSQL is about open source, distributed, non-relational databases. At a recent meetup in San Francisco, some of the following new technologies were discussed…
Voldemort
Cassandra
Dynomite
HBase
Hypertable
CouchDB
Here’s a ComputerWorld article with more info.