Big Data Engineering at the NCSR-Demokritos -- Vangelis Karkaletsis

﻿In this talk we will present our activities, addressing challenges that arise when applying data analytics to heterogeneous and large-scale data. Heterogeneity and transparent distribution are the focus of the SemaGrow project (http://semagrow.eu/), approached as federated querying that is transparently optimized and where semantic transformations are dynamically applied. The outcome is a stack of technologies that simplify both the inclusion of heterogeneous data sources to a federated end-point and the development of client applications for this end-point. The SemaGrow Stack will be integrated in the Big Data Aggregator that is developed in the recently started Big Data Europe project (http://www.big-data-europe.eu/). The Big Data Aggregator will be piloted on diverse and challenging use cases defined by domain experts across the board of data-intensive science and technology.