The coolest open source databases on the planet (NOSQL + RDBMS)

A list of alternative open source databases you might want to consider. Given all the hype about NOSQL databases it’s hard not to have come across databases like MongoDB and Voldemort but there are lots of others.

MongoDB

MongoDB

It is an open source, high-performance, scalable, schema-free & document-oriented (JSON-like data schemas) database.

Used by shutterfly, foursquare, bit.ly, sourceforge, github and the new york times.

Hypertable

Hypertable

Hypertable is a high performance distributed data storage system designed to support applications requiring maximum performance, scalability, and reliability. It is modeled after Google’s BigTable and mostly focuses on large-scale datasets.

Apache CouchDB

Apache CouchDB

A document-oriented database that can be queried and indexed in a MapReduce fashion using JavaScript. CouchDB offers a RESTful JSON API which can be accessed from any environment allowing HTTP requests.

The biggest user I am aware of is the BBC, they gave a good presentation at this years QCon conference in London.

Neo4j

Neo4j Graph Database

It is an embedded, disk-based, and fully transactional Java persistence engine that stores data structured in graphs rather than tables.

Neo4j offer a massive scalability. It can handle graphs of several billion nodes/relationships/properties on a single machine and can be scaled across multiple machines.

Riak

Riak

Riak combines a decentralized key-value store, a flexible map/reduce engine along with a friendly HTTP/JSON query interface.

Oracle Berkeley DB

Oracle Berkeley DB

It is an embeddable database engine that provides developers with fast, reliable, local persistence with zero administration.

Oracle Berkeley DB is a library that links directly into your application & enables you to make simple function calls rather than sending messages to a remote server for a better performance.

Apache Cassandra

Apache Cassandra

Cassandra is a highly scalable second-generation distributed database that is used by Facebook, Digg, Twitter and Cisco.

It aims to provide a consistent, fault-tolerant & highly available environment for storing data.

Memcached

Memcached

Memcached is an in-memory key-value store for small chunks of arbitrary data (strings, objects) from results of database calls, API calls, or page rendering.

It is intended for use in speeding up dynamic web applications by alleviating database load. Memcached is used by almost all the large scale internet sites today in one form or another.

Firebird

Firebird

Firebird is a relational database that can run on Linux, Windows & various UNIX platforms.

It offers high performance and powerful language support for stored procedures and triggers.

Redis

Redis

Redis is an advanced fast key-value database written in C which can be used like memcached, in front of a traditional database, or on its own.

It has support for many programming languages & used by popular projects like GitHub and Engine Yard.

There is also a PHP client named Rediska for managing Redis databases.

HBase

Hadoop HBase

HBase is a distributed & column-oriented store which can also be called as the Hadoop database.

The project aims to host very large tables like “billions of rows, millions of columns”. Yahoo used Hadoop quite a lot to process high volumes of weblogs.

It has a  REST-ful web service gateway that supports XML, Protobuf, and binary data encoding options.

Keyspace

Keyspace

It is a consistently replicated, fault-tolerant key-value store that works in Windows OS.

Keyspace offers high availability by masking server/network failures & appearing as a single, highly available service.

4store

4store

4store is a database storage and query engine that holds RDF data.

It is written in ANSI C99, designed to run on UNIX-like systems & offers a high performance, scalable & stable platform.

MariaDB

MariaDB

MariaDB is a backward compatible, drop-in replacement branch of the MySQL® Database Server.

It includes all major open source storage engines + the Maria storage engine.

Drizzle

Drizzle

It is a fork of MySQL that focuses on being a reliable database optimized for Cloud and Net applications.

HyperSQL

HyperSQL

It is a SQL relational database engine written in Java.

HyperSQL offers a small & fast database engine which has in-memory and disk-based tables, supports embedded/server modes.

Also, it has tools such as a command line SQL tool & GUI query apps.

MonetDB

MonetDB

MonetDB is a  database system for high-performance applications in data mining, OLAP, GIS, XML Query, text & multimedia retrieval.

Persevere

Persevere

It is an object storage engine and application server (running on Java/Rhino) that provides storage of dynamic JSON data for rapidly develop data-driven JavaScript-based rich internet applications.

eXist-db

eXist-db

eXist-db is built using XML technology. It stores XML data according to the XML data model & features efficient, index-based XQuery processing.

Others

Advertisements

About petehanlon

CTO Moneysupermarket.com
This entry was posted in Uncategorized. Bookmark the permalink.

2 Responses to The coolest open source databases on the planet (NOSQL + RDBMS)

  1. mark says:

    Pete I totally agree. I have been using graph/triple datastores for a couple of years now and the absolutely rock.

    • petehanlon says:

      Hey Mark, which ones have you been using? Interested to hear how the different products standup.

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out /  Change )

Google+ photo

You are commenting using your Google+ account. Log Out /  Change )

Twitter picture

You are commenting using your Twitter account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s