Cassandra is a highly-distributable NoSQL database with tunable consistency. What makes it highly distributable makes it also, in part, vulnerable: the whole deployment must run on synchronized clocks. It’s quite surprising that, given how crucial this is, it is not covered
Data tiedowns with reliable time stamping
Management teams are growing more reliant on the ability to immediately access and quickly sort through massive amounts of data to find the information they need – Data Governance for Financial Institution A “data tiedown” is a reliable and cross-checked timestamp
Time maps for visualizing time distribution networks
Without this information, a time distribution network is a black box. TimeKeeper looks at all the sources of time, in any protocol, that it can see around it, and tracks each source back to its source. These maps, especially in
Obama’s big data team
Those applications used a huge chunk of Amazon’s Eastern data center. “We had thousands of nodes,” said VanDenPlas. “We pushed 180TB of traffic with billions and billions of requests. We had 60% of all of Amazon’s medium [instances] in US
Big Data and politics
For the general public, there was no way to know that the idea for the Parker contest had come from a data-mining discovery about some supporters: affection for contests, small dinners and celebrity. But from the beginning, campaign manager Jim
Julia programming language and hadoop
While we’re being demanding, we want something that provides the distributed power of Hadoop — without the kilobytes of boilerplate Java and XML; without being forced to sift through gigabytes of log files on hundreds of machines to find our