Sign up or log in to save this to your schedule and see who's attending!

A major financial institute supports its analytic workload via docker containers with access to R, Python, and SAS. This talk discusses how we built and deployed containers with the open source tools needed for their Data Scientists to complete their work. It discusses how containers enable their IT department to meet the ad hoc, compute intensive, and scaling demands of the organization. It also covers how provisioning thin clients via Jupyter Notebooks, R Studio, and SAS Studio, empowers Data Scientists with the tool of their choice. An exciting differentiator for the Data Scientist, is the ability to send a portion of the analytic workload to run inside their Hadoop cluster. Lastly, we discuss extending the container by pushing the analytic workload to run inside the Hadoop cluster; thus enabling the Data Scientists to dive inside the data lake and harness the power of all the data.

Doug Liming is an Enterprise Architect with SAS Institute in Cary, North Carolina. He was a DBA for 16 years before trading hats. He is now focusing on all things Hadoop and Hadoop within the enterprise. How Hadoop is the hub for the entire enterprise and how Open Source is now a major player.