Data By the Bay is the first Data Grid conference matrix with 6 vertical application areas spanned by multiple horizontal data pipelines, platforms, and algorithms. We are unifying data science and data engineering, showing what really works to run businesses at scale.

Sign up or log in to save this to your schedule and see who's attending!

Some technologies for building data visualizations lend themselves to dynamic applications and interactivity (D3, HighCharts). Other technologies offer a lot of flexibility and precision (SQL), ease-of-use (SAP, Excel), or breadth of visualization types (Tableau, Stata). Doing data exploration at varying levels of aggregation is still a challenge for all of these tools. This talk will explore use cases involving visualizations which require varying levels of aggregation in the same visualization, and some tools, techniques, and technologies to support those visualizations. Examples will include selection techniques in SQL, data preparation scripts to prepare data for D3 visualizations, and using Excel for prototyping and checking conclusions. ClearStory Data has used a combination of Spark, D3, and React to create a web-based application which makes data combination and exploration clear, interactive, and maintainable even for the largest data sets. This talk will also discuss findings specifically relevant to supporting interactivity and clarity in data exploration of varying aggregation levels.

Katherine Ahern manages the Analysis and Visualizations group at ClearStory Data, where she focuses on usability for complex analytic workflows, including getting accurate results combining diverse data sources. Before coming to ClearStory she worked on a web-based analytics tool... Read More →