This is a (yet another) question for the users list.
On Jun 17, 2010 12:50 PM, <altanis@ceid.upatras.gr> wrote:
Hello,
I keep reading everywhere that Cassandra has supported multiple
datacenters from the beginning. I would like to know what does Cassandra
do to achieve that. Is it just that the developers have written some code
that supports that scenario, or is there something inherent in Cassandra's
design that is suitable for a multi DC environment, like minimizing
inter-DC traffic?
I have read about RackAwareStrategy on the wiki, and have also browsed
through some code (DataCenterShardStrategy), but I would like to see what
people have to say about this.
I also read about an implemenetation of Rack Awareness employing
Zookeeper, but I gather that wasn't released by Facebook and it was more
geared towards single-DC rack awareness because Zookeeper is a bit heavy
on the bandwidth.
Anyway, just to sum it up, my question is this: please explain in brief
the reasons why Cassandra is well suited for multi-DC environments.
Alexander Altanis