Software Developer - Operations and Reliability

Montreal, QC, Canada

Full-time

Company Description

About MPC Film

The Moving Picture Company (MPC) has been a global leader in visual effects (VFX) for over 25 years and counting, with established facilities across the globe. We are renowned for adding visual wonder and creative expertise to the film, advertising and entertainment industries, with some of our most famous projects including the blockbuster movies Blade Runner 2049, The Jungle Book, and The Martian, and advertising campaigns for Channel 4, Sony and IKEA.

MPC is part of Technicolor’s premiere portfolio of VFX brands and services, working alongside The Mill, Mr. X and Mikros to provide offerings tailored to the specific creative and storytelling needs of each project. Within this collective, MPC has built a reputation for industrial-scale production of images, and the software development to enable this.

About The Team and Our Work

Our status as a world-class VFX facility has been achieved through the development of industry-leading software which empowers our artists to create stunning imagery. We have curated a considerable portfolio of off-the-shelf and in-house software to meet these unique requirements, and continue to evolve and improve our technology as new needs emerge. The Core Engineering team operates within a larger R&D devision to provide the business critical infrastructure that enables multiple projects with thousands of shots to be simultaneously completed across the globe efficiently and to the highest quality.

A blend of globally distributed software, systems and operations experts, we are responsible for building and maintaining key infrastructure and services in collaboration with site-local engineering teams and other specialized development teams. With users in all areas of the company, our solution (the Core Platform) is based on a distributed micro-services environment providing capabilities in areas such as compute, storage, and digital asset management. Development and operational support spans the stack from operating system through to desktop/web application front-ends.

In recognition of the flexibility, scalability and maturity of the Core Platform that has been created by the team for MPC, we have been tasked with establishing the Core Platform elsewhere, making it available to other business units that are part of Technicolor’s portfolio.

Job Description

We are hiring for an Integration Engineer to work in the Core Engineering team.

As a member of the Core Engineering team you'll be joining a highly passionate team that is responsible for the efficiency, performance, availability and monitoring of MPC's micro-services framework, forming the backbone of MPC's VFX pipeline, as well as the services that surround it such as the monitoring platform and render farm.

This requires a mix of engineering skills ranging from systems knowledge, software architecture and scalability design. One day you will be designing and provisioning dashboards, the next trying to catch runaway threads with your bare hands and the following day assisting development teams in writing scalable code or improving the performance of the micro-services platform by fine tuning Python code.

The role is in a fast paced environment where at times going from 0 to 100 in a jiffy is expected. This requires a systematic approach to problem solving as well as being able to think several steps ahead. We are often the last line of defense in case of a problem. This means the problems reaching the team are the harder but also more interesting problems that often do not have a single solution.

To be able to diagnose and resolve some of these problems we depend heavily on monitoring and alerting and would like a person that is up to date with today's tooling. We use a variety of products currently (ELK, TICK, Opentracing) to collect metrics from our micro-services as well as our entire render farm, collecting thousands of metrics a second, which brings along its own scalability problems. We we would like someone that is familiar with the various stacks both from a usage perspective as well as a deployment perspective.

Running at this scale means we love automation to reduce the repetitive tasks and automate as much as possible to give the team the time to tackle the more interesting design problems to further our entire stack. We use a mix of Saltstack and Ansible.

The role is aimed at improving the systems already in place at MPC as well as helping with future developments to deploy these systems into more modern environments, such as Kubernetes, and different business units inside Technicolor each with their own challenges.

All in all this role covers a broad range of subjects, challenges and opportunities which requires someone that truly enjoys squeezing the last microsecond out of a thread, doesn't mind getting their hands dirty and take apart an entire system to really get a feel for how it is working and where possible provide more tooling, insight and automation.

If you recognize yourself in this team and are as passionate about efficiency, large scale problems and monitoring as we are and as our artists are about creating award winning pixels we'd truly love to hear from you.