this for overseas recruitment. the position is based in yerevan, armenia.
job responsibilities:
• responsible for keeping all customer facing systems running smoothly while applying sound engineering principles, operational discipline, and mature automation
• be on a shift rotation to respond to availability and performance incidents and provide first line support
• use your shift to hunt for potential issues and prevent incidents by debugging production issues across all levels of the application stack
• improve the software deployment process to make it as reliable as possible while engaging with software developers
• continuously enhance monitoring and alerting and focus on symptoms and not on outages
• participate in post incident reviews, document findings and automate self healing jobs to reduce mttr
requirements
• familiar with the linux shell for administration and troubleshooting
• familiar with the usage of configuration management systems like chef, ansible, puppet
• have programming skills - ruby, go, python
• have experience with nginx, haproxy, docker, kubernetes, terraform, or similar technologies
• ability to use gitlab
• familiarity with monitoring tools such as elk, grafana, application performance monitoring and packet trace analysis tools (. wireshark)
• hunting mentality for system uptime and performance - explore edge cases, failure modes, behaviors, specific implementations.
• at least 5 years of experience in it infrastructure or software development
• ba/bs in computer science, engineering or related technology field