Responsibilities
Responsible for the company’s production environment, including complex architecture with multiple virtual servers, deployments & various cloud technologies
Manage the availability, latency, scalability and efficiency of our Networks services by engineering reliability into software and systems
Respond to and resolve emergent service problems; build tools and automation to prevent problem recurrence
Review and influence new and evolving design, architecture, standards, and methods for operating services and systems
Participate in software and system performance analysis and tuning, service capacity planning and demand forecasting
Requirements
At least 3 years working and administering Linux systems with deep system & networking understanding
Proven programming and scripting experience – e.g. Python, Go, Ruby, Bash
Knowledge in networking and internet technologies – e.g. HTTP servers, DNS, switch/router administration, firewalls, proxies, etc.
Experience and love for monitoring, logging and metrics – e.g. Sensu, Nagios, InfluxDB. Graphite, Grafana, etc.
Excellent communication and teamwork skills
Ability to work in a dynamic multi-task environment with context switches
Participate in on-call support of our 24×7 production environment
Preferred:
Knowledge of and a passion for configuration management tools such as Ansible, Chef and Puppet
Experience with cloud IaaS such as AWS, Azure, Google Cloud and SaaS in general
Experience in Docker & container technology, ELK stacks, Jenkins administration, Git administration, Slack bot development