experience and education
• bachelor’s degree in engineering or equivalent work experience.
• 7+ years of infrastructure and operations management experience at a global scale.
• 7+ years of experience in operations management, including monitoring, configuration management, automation, backup, and recovery.
• broad experience in the data center, networking, storage, server, linux, and cloud technologies.
• broad knowledge of release engineering: build, integration, deployment, and provisioning, including familiarity with different upgrade models.
• demonstratable experience with executing, or being involved of, a complete end-to-end project lifecycle. skills
• excellent communication and teamwork skills – both oral and written.
• skilled at collaborating effectively with both operations and engineering teams.
• process and documentation oriented. • attention to details. excellent problem-solving skills.
• ability to simplify complex situations and lead calmly through periods of crisis.
• experience implementing and optimizing operational processes.
• ability to lead small teams: provide technical direction, prioritize tasks to achieve goals, identify dependencies, report on progress.
technical skills
• strong fluency in linux environments is a must.
• good sql skills. • demonstratable scripting/programming skills (bash, python, ruby, or go) and the ability to develop custom tool integrations between multiple systems using their published api’s / cli’s.
• l3, load balancer, routing, and vpn configuration.
• kubernetes configuration and management.
• expertise using version control systems such as git.
• configuration and maintenance of database technologies such as cassandra, mariadb, elastic.
• designing and configuration of open-source monitoring systems such as nagios, grafana, or prometheus.
• designing and configuration of log pipeline technologies such as elk (elastic search logstash kibana), fluentd, grok, rsyslog, google stackdriver.