Skip navigation EPAM

Senior Data DevOps Engineer Remote

  • hot

Senior Data DevOps Engineer Description

DESCRIPTION



We're seeking a Senior Data DevOps Engineer to join our team and play a crucial role in shaping the future of our CVML (Computer Vision and Machine Learning) platform.
If you're an experienced engineer with a passion for innovation, this could be the perfect opportunity for you.

EPAM is a leading global provider of digital platform engineering and development services. We are committed to having a positive impact on our customers, our employees, and our communities. We embrace a dynamic and inclusive culture. Here you will collaborate with multi-national teams, contribute to a myriad of innovative projects that deliver the most creative and cutting-edge solutions, and have an opportunity to continuously learn and grow. No matter where you are located, you will join a dedicated, creative, and diverse community that will help you discover your fullest potential.

#REF_CL23_AR

Responsibilities

  • As a Senior Data DevOps Engineer a EPAM, you'll be tasked with a range of responsibilities that are integral to our platform's success. Your day-to-day tasks will include a
  • Crafting Infrastructure as Code: Develop Terraform and Terragrunt configurations to efficiently manage our infrastructure. Work on GitHub Actions workflows to streamline our development processes
  • Data Access Permissions: Troubleshoot and resolve data access permission issues, particularly within AWS S3 and AWS IAM
  • Kubeflow Mastery: Take ownership of our Kubeflow ML pipelines, resolving issues related to CPU, memory, GPU, and permissions
  • Scripting for Automation: Utilize your programming skills in Python and Golang to develop scripts for various automation tasks, contributing to the efficiency and scalability of our platform

Requirements

  • Terraform and Terragrunt Proficiency: You should be highly skilled in using these tools for infrastructure management
  • Kubernetes Expertise: Deep knowledge of Kubernetes, including experience with AWS EKS and KubeSpray
  • AWS Mastery: A strong command of AWS services, from network management to LoadBalancer and IAM
  • Istio Understanding: Familiarity with Istio, including sidecars, mTLS (mutual TLS), and the ingress gateway
  • Monitoring Skills: Experience with monitoring tools like Prometheus and Grafana
  • Programming Prowess: Strong Python programming skills
  • GitHub Proficiency: Familiarity with GitHub and GitHub Actions

Nice to Have

  • Distributed Tracing: Familiarity with distributed tracing tools such as Zipkin and Istio
  • Golang Proficiency: A background in Golang programming
  • Kubeflow Knowledge: Experience with Kubeflow, a popular ML platform
  • Pulumi Expertise: Familiarity with Pulumi for infrastructure as code

We Offer

  • Health Insurance
  • Life Insurance (SVO)
  • Occupational Risk Insurance (ART)
  • Paid Time Off – Vacations. 14 calendar days a year, the number of days will increase by seniority based on local law rules
  • Sick leave
  • Exceptional Leave. Take paid time off for your major life changes (childbirth, marriage, etc.)
  • Compensation of costs for internet, electricity, and personal laptop usage (if applicable)
  • Stable full-time workload
  • Thousands of projects for top brands
  • Stable income
  • Referral Program
  • Certification opportunities
  • Unlimited access to LinkedIn learning solutions
  • Language courses
  • Relocation Assistance Package

Conditions

  • By applying to our role, you are agreeing that your personal data may be used as in set out in EPAM´s Privacy Notice and Policy

在亿磐成长

周剑
解决方案架构师
苏州

朱晓华
首席软件测试工程师
苏州

金秋
首席软件工程师
苏州

我们在世界其他地方。。。