comparison

Definition

As you have seen, Big Data is not only about building a data store that can easily accommodate terabytes, petabytes or even more, but is also about the tools that you can use to explore, analyze and feed it.
In this section, we will focus on the ingestion layer or the data pipeline.

These data stores are shemaless or schemafree, meaning that the records in the same logical container (table or collection or ...) can be of a different structure each. In other words, two consecutive records can have different number of columns, each of different type. More, each column can hold another record with its own set of columns, creating nested records.

Apache HBase

Positive

Negative

Storage is a key component of any system. We have heard a lot about storage.
But what is what ? Local storage versus remote storage.
Block storage, file storage, object storage, ... what's the differences ?
Let's try to raise a bit of the curtain on the storage aspect of your infrastructure.

One of the core components in any enteprise is the CMDB (Configuration Management DataBase). The CMDB is not just an inventory tools listing all the elements you have in your infrastructure, it is also a tool showing the dependencies between them.
Even in the case of small infrastructure, it is valuable to have in place a good inventory tools with dependency links between the elements.
In such CMDB tools, each componant is called "Configuration Item" or CI. A CI may be a server or a CPU in a server, a software, ...