Access Notes

Website

Description

Come join us for Pythonic talks!

Thanks to this month's sponsor, CloudBolt Software.

This month:

ETL testing with Python by Sev Leonard

ETL is the process of Extracting data from a source, Transforming it, and Loading it into storage. This process is commonly used to combine multiple data sources into data warehouses. In this talk Sev will give an overview of ETL with processes he designed for creating a data warehouse for medical data. The bulk of this talk will concern how Python and Pytest were used to test the ETL processes. This will include modeling ETL processes as objects, using test fixtures and markers for dynamic test suite generation, making tests extensible through parameterization, creating test data, and advice on navigating the pytest docs. This talk is targeted to intermediate Pythonistas interested in ETL, testing, or data magic.

Sev is a software developer and cat dad recently departed from his position at OHSU where he developed data management tools for cancer researchers and clinicians. He is coming soon to Nuna, a healthcare data and analytics startup as a Senior Software Engineer.

Looking for a job?
Your friends at CloudBolt want your help developing their state of the art Cloud Management software. Built with Django and ranked as the #1 product of its kind, CloudBolt is looking for talented engineers of all kinds in inner SE Portland.