Experience and Education
Bachelors degree in engineering or equivalent work experience.
7+ years of infrastructure and operations management experience at a global scale.
7+ years of experience in operations management, including monitoring, configuration management, automation, backup, and recovery.
Broad experience in the data center, networking, storage, server, Linux, and cloud technologies.
Broad knowledge of release engineering: build, integration, deployment, and provisioning, including familiarity with different upgrade models.
Demonstratable experience with executing, or being involved of, a complete end-to-end project lifecycle. Skills
Excellent communication and teamwork skills both oral and written.
Skilled at collaborating effectively with both Operations and Engineering teams.
Process and documentation oriented. Attention to details. Excellent problem-solving skills.
Ability to simplify complex situations and lead calmly through periods of crisis.
Experience implementing and optimizing operational processes.
Ability to lead small teams: provide technical direction, prioritize tasks to achieve goals, identify dependencies, report on progress.
Strong fluency in Linux environments is a must.
Good SQL skills. Demonstratable scripting/programming skills (bash, python, ruby, or go) and the ability to develop custom tool integrations between multiple systems using their published APIs / CLIs.
L3, load balancer, routing, and VPN configuration.
Kubernetes configuration and management.
Expertise using version control systems such as Git.
Configuration and maintenance of database technologies such as Cassandra, MariaDB, Elastic.
Designing and configuration of open-source monitoring systems such as Nagios, Grafana, or Prometheus.
Designing and configuration of log pipeline technologies such as ELK (Elastic Search Logstash Kibana), FluentD, GROK, rsyslog, Google Stackdriver.