Responsibilities:
• Lead a team of 6 excellent DataInfra Engineers.
• Design and implement BIG DATA CI/CD
• Deep knowledge and understating of the use cases running on our Hadoop clusters
• Design automation needed for the day-to-day teamwork
• Design and implement best practices/Monitoring/capacity planning of our technological stack
• Implement an infrastructure that will allow stakeholders to easily set up, operate and use various distributed systems
• Our tech stake: Hadoop, Vertica, Presto, Airflow, MySql, Presto and more.
• Requirements
• At least 3 years’ experience as a DBA/DevOps team leader – Must
• Good knowledge in Linux
• Experience with Python and/or Bash.
• Experience working with production distributed data pipelines.
• Experience in defining work methodologies with data stakeholders
• Hands-on experience in Linux internals and performance tuning – Linux sysadmin – Advantage
• Knowledge and experience in JVM languages Java/Scala/etc.- Advantage
• Cloudera administrator certification- Advantage