Skip navigation EPAM

Senior Data DevOps Engineer Remote

Senior Data DevOps Engineer Description

We are seeking a highly skilled and experienced Senior Data DevOps Engineer to join our remote team. As an expert in Data DevOps, you will be responsible for installing, monitoring, troubleshooting, and maintaining Kafka platform, ensuring optimal performance and security while developing new features, automation, and integration. The role involves supporting PagerDuty and Uptrends SaaS, including automation of these platforms support.


#LI-DNI#EasyApply

Responsibilities

  • Install and provision new Kafka clusters and supporting components
  • Regularly monitor the health and performance of the Kafka platform and data pipelines
  • Identify and fix issues related to the platform, including data pipelines, network problems, cloud or containerization resources failures, or software bugs
  • Perform regular performance tuning of Kafka platform components
  • Monitor and optimize the cost and performance of Kafka clusters
  • Upgrade the Kafka platform to newer versions, including planning, testing, and implementation
  • Manage the security of the Kafka platform, including access control lists, encryption, and regular security reviews
  • Perform regular backups and disaster recovery procedures
  • Manage the capacity of the Kafka platform, including projecting future growth and scaling needs
  • Document procedures, configurations, and issue resolutions, and share knowledge with the team

Requirements

  • 3+ years of relevant work experience
  • Proven experience in the implementation and maintenance of Confluent Platform
  • Relevant certifications (such as Confluent Certified Developer or Administrator for Apache Kafka) would be an advantage
  • Strong knowledge of HELM
  • Experience in automating processes and maintaining automated scripts
  • Understanding of networking and capability to coordinate with different teams including the Networking team and CICD team, among others
  • Strong written and verbal communication skills
  • Excellent problem-solving skills and the ability to troubleshoot complex issues
  • B2+ English level proficiency

Nice to have

  • Experience working in a large-scale global organization on a cloud environment
  • Familiarity with other cloud platforms, such as AWS or GCP

在亿磐成长

周剑
解决方案架构师
苏州

朱晓华
首席软件测试工程师
苏州

金秋
首席软件工程师
苏州

我们在世界其他地方。。。