Add comparison

Likelihood to Recommend

OpenText Analytics

If you do not have a large budget and are a large organization, I would steer clear of Actuate. If you are looking to do very complex washboarding, I would not use them. Your developers have to be very skilled to work with this. Plan to bring in consultants if necessary to help your process. Adhoc reporting is weak. If your pricing is user based and you expand, this could be very expensive.

Hadoop

Hadoop is very well suited for big data modeling problems in various industries like finance, insurance, healthcare, automobiles, CRM, etc. In every industry where you need data analysis in real time, Hadoop is a perfect fit in terms of storage, analysis, retrieval, and processing. It won't be a very good tool to perform ETL (Extract Transform Load) techniques though.

The distributed replicated HDFS filesystem allows for fault tolerance and the ability to use low cost JBOD arrays for data storage.

Yarn with MapReduce2 gives us a job slot scheduler to fully utilize available compute resources while providing HA and resource management.

The hadoop ecosystem allows for the use of many different technologies all using the same compute resources so that your spark, samza, camus, pig and oozie jobs can happily co-exist on the same infrastructure.

Without Cloudera as a management interface the hadoop components are much harder to manage to ensure consistency across a cluster.

The calculations of hardware resources to job slots/resource management can be quite an exercise in finding that "sweet spot" with your applications, a more transparent way of figuring this out would be welcome.

A lot of the roles and management pieces are written in java, which from an administration perspective can have there own issues with garbage collection and memory management.