Welcome to the BigDataDwg web

The purpose of the OGC Big Data DWG is to provide an open forum for work on Big Data interoperability, access, and analytics. To this end, the open forum will pursue collaborative information collection and liaisons with other Big Data working groups. The group will consolidate findings on this public wiki, and later on in a Best Practice paper, to inform OGC and the greater public. In this respect, this wiki also is meant to encourage and collect feedback.

This Twiki as well as the corresponding email list provide a public forum for communication. Following registration anyone can contribute to this wiki, but, of course, responsibly. Instructions can be found on the TWiki Text Formatting Rules page.

Background

“Big Data” is an umbrella term coined by Doug McLaney and IBM several years ago to denote data posing problems, summarized as the four Vs:

Volume – the sheer size of “data at rest”

Velocity – the speed of new data arriving (“data at move”)

Variety – the manifold different

Veracity – trustworthiness and issues of provenance

Several additional Vs have been suggested meantime, including value, verisimilitude, visualization. Generally it seems accepted that a core challenge is doing rapid, flexible analytics on Big Data.
Major efforts are undertaken on Big Data worldwide, involving and affecting science, industry, government, and citizens alike. Manifold research and development is going on, mobilizing massive financial resources.

This development needs to see a response from OGC – in particular, as location-based and geo data form substantial contributors to the Big Data deluge. Further, with the advent of increased machine-machine communication, interoperability is gaining even more importance. OGC, therefore, should establish a position addressing all levels, including – but not limited to – science, implementation, market value, and societal effects.

BigData.DWG Goals

The OGC Big Data WG will specifically focus on spatio-temporal data, in line with OGC’s mission. With the same inaccuracy as the term “Big Data” itself, we give them the working title “Big Earth Data” for now

What does Big Earth Data mean in an OGC context? What characterizes them?

What are the challenges, if any, of Big Earth Data for OGC’s data and service interface specifications?

What is the market value of Big Earth Data, and how can OGC support leveraging it?

Thus, the BigData.DWG will aim to clarify some foundational terminologies in the context of data analytics understanding differences/overlaps with terms like data analysis, data mining, etc. Further, a systematic classification of analysis algorithms, analytics tools, data and resource characteristics, and scientific queries will be established.

Key Activities

The following activities of the BigData.DWG have been spotted initially:

Establish a working communication infrastructure, including a public wiki.

The information you supply is used for OGC purposes only. We will never pass your contact details to any third party without your prior consent. If you enter content here you are agreeing to the OGC privacy policy.