Search by Job Title

Site Reliability EngineerAppFolio

THIS JOB HAS EXPIRED

As a Site Reliability Engineer (SRE), you will help us grow, automate, and monitor the infrastructure platform we use to power AppFolio.

Your job will be to ensure that our production environment is secure and available, and that software is released and deployed in an efficient and streamlined manner. We are big believers in automation, and an important part of this job is to develop our suite of automated deployment and management tools. As a key member of our SaaS operations team, you share the responsibility of designing, deploying, and operating multiple data centers and host facilities.

Success in this role requires very strong system administration skills, an aptitude for distributed systems and attention to minute details. You need to have well developed network, systems and code-level troubleshooting abilities. You are expected to analyze complex system behaviors or performance problems, and need to be able to trace issues across multiple systems. The SRE works as a first responder and is ultimately responsible for the availability of AppFolio's systems.

Responsibilities:
Create reliable ways to ensure consistent, high uptime for all of our SaaS offerings
Identify and resolve system weaknesses that could impact our reliability
Lead and assist in troubleshooting system-wide problems
Analyze and advise on system reliability and security impacts
Develop automated tools for deploying our applications and maintaining our system infrastructure
Develop tools for tracking and analyzing system and application logs and events
Share the responsibility of monitoring and incident handling

Knowledge and Skills:
Strong familiarity with operating and troubleshooting distributed systems
Familiarity with running Ruby on Rails applications
Skilled in at least two of the following programing languages: Ruby, Java, C/C++, Perl, Bash, Puppet
Experience with Networking
Very comfortable with Linux
Strong analytical and troubleshooting skills
Enthusiasm for tackling complex problems
Ability to respond quickly to changing conditions under high pressure
Strong ability to multi-task
Ability to work cross-functionally in a dynamic environment
Ability to follow through on tasks until completed
High degree of flexibility
BS, MS or Ph.D in Computer Science or related fields