Key Responsibilities
Operate and manage our large-scale infrastructure in the cloud.
Build cloud automation and internal tools.
Monitor and support the 24/7 operation, build dashboards and reporting metrics.
Continually perform analysis and capacity planning for our clusters.
Troubleshoot, triage staging and address production issues.
Participate in 24/7 on-call rotations.
Experience and knowledge:
Hands-On Experience working with AWS: VPC, EC2, ECS
Experience in supporting 24×7 environment
At least 3 years of relevant work experience in implementing and maintaining highly available enterprise infrastructure solutions and services
Expert knowledge of Windows Systems Administration, best practices, security hardening
Hands-on experience in continuous integration and building tools along with version control systems such as Git.
Hands-on experience in provisioning and configuration management tools (e.g Ansible, Chef, etc.)
Solid understanding of the CI/CD process.
Knowledge in Active Directory, Group Policy
Knowledge in McAfee software (EPO, SIEM)
Experience working with monitoring and log analyze tools
Required Non-Technical Skills and Experience:
Strong professional communication skills in English (written and verbal).
Strong trouble shooting Skills
Team Player. Bring a positive attitude and a very developed customer service orientation
Strong analytical and problem solving skills.
Reliable, resilient, organized, detail-oriented, independent and self-motivated individual
Self-starter, fast learner, able to prioritize and manage time effectively.
Good end-to-end system view
Works well under pressure and deadline