GuruLink helps you stand out from the crowd.

Data Engineer

San Francisco, California
- Permanent

Job Description

Our client is on a mission to structure and understand the world's medical data. They are starting this process by using terabytes of data of clinician notes which contain Electronic Health Records of some of the world's largest healthcare systems.

We are currently on the lookout for Data Engineers to work on the product that drives the core of our clients business - you are a backend expert able to unify data and build systems that scale from both an organizational and operational perspective.

Some of your responsibilities will include, but not be limited too: -Developing data infrastructure to sanitize, ingest, and normalize a broad range of data, health records, journals, crowd-sourced labeling, medical ontologies, and other human inputs-Building performant and expressive interfaces to data-Building infrastructure to help scale up the data ingest, and build large-scale cloud-based machines

Must Have Skills:

-Building data pipelines from disparate sources-Hands on experience with building and scaling compute clusters-Building and supporting machine learning pipelines that scale, not just computationally, but in ways that are iterative, flexible, and geared for collaborating -Understanding of databases and large-scale data processing frameworks like Hadoop and/or Spark-You know how to pick the right tool for the specific job in front of you and are not afraid to try new things-Combination of creative & analytical skills-Capability to design a system that is able of pulling together, training, and testing dozens of data sources under a unified ontology