Systems Reliability Engineer

\nQntfy is looking for a talented and motivated SRE to join our ops team. You will be responsible for deploying, configuring, and maintaining the core systems and services that our software and business depends on. We need someone who is interested in designing sustainable, best-in-class infrastructure and reliability processes.\n\nWe move quickly and are not beholden to any single technology but we do have favorites. An ideal candidate will have experience with, or the ability to figure out quickly, tools like Mesos/Marathon, Kubernetes, Ansible, and Docker. As an SRE at Qntfy, you will have the freedom and responsibility to recommend and implement core architectural changes in support of our long-term technological vision. As to our stack, we have both on-premises and AWS deployments to manage and are looking to increase our use of Kubernetes.\n\nResponsibilities:\n\n\n* Help to determine production standards alongside software engineers from day 0.\n\n* Communicate with peers, customers, and partners to foster cooperation and development.\n\n* Design and implement the systems to support major new features for our platform.\n\n* Translate team needs into technical requirements and produce stable solutions.\n\n* Effectively estimate time to implement solutions.\n\n* Plan, execute, maintain and improve infrastructure.\n\n* Debug, automate, and monitor operations.\n\n* Record and make available postmortem records of incident response\n\n\n\n\nQualifications:\n\n\n* BS or Master’s degree in Computer Science/Engineering, related degree, or equivalent experience.\n\n* 3+ years experience with DevOps, SysAdmin, and/or datacenter operations.\n\n* Ability to architect and deploy services to support distributed systems while maintaining flexibility and high-quality documentation.\n\n* Strong work-ethic and passion for problem solving.\n\n\n\n\nPreferred Qualifications:\n\n\n* 3+ years work with Kubernetes, Docker, and/or Mesos/Marathon.\n\n* 3+ years working with public cloud infrastructure and tooling\n\n* Experience provisioning new systems in a reproducible and maintainable fashion (including the use of technologies like Ansible, Terraform, and Kops).\n\n* High level of proficiency with Linux systems and services.\n\n* Strong understanding of security best practices and their implementations\n\n* Experience with scripting languages\n\n\n

👉 Please reference you found the job on Remote OK, this helps us get more companies to post here!

When applying for jobs, you should NEVER have to pay to apply. That is a scam! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. Scams in remote work are rampant, be careful! When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.