Site Reliability Engineering Manager, Storage - Apple Cloud Services

Apple Inc., City of Westminster

Site Reliability Engineering Manager, Storage - Apple Cloud Services

Salary not available. View on company website.

Apple Inc., City of Westminster

  • Full time
  • Permanent
  • Onsite working

Posted 3 days ago, 11 Oct | Get your application in today.

Closing date: Closing date not specified

job Ref: a87cf9bd40f94d9fb0393f334564f6c3

Full Job Description

The Storage SRE organization is seeking a strong engineering leader to manage Storage focused SRE teams, working closely with peer SRE teams and development partners. You'll help build and optimize the Storage stack from the bare metal to the top of the application, helping design provisioning systems, code deployment, monitoring, alerting, and performance improvements. Together with the team, you'll help run the storage used by some of Apple's largest teams.

  • Proven experience in a leadership role within an SRE or DevOps team, specifically focused on distributed storage.
  • Strong background in distributed systems, storage architectures, and data management.
  • Deep knowledge of SRE principles, including monitoring, alerting, error budgets, fault analysis, and other common reliability engineering concepts
  • Lead initiatives to enhance the scalability and performance of distributed storage systems.
  • Collaborate with engineering teams to design and implement robust and scalable storage solutions. Bachelor's or Master's degree in Computer Science, Engineering, or a related field.
  • , Experience with Kubernetes, Docker, and containerization
  • Proficient in at least one of these programming languages: Golang, Java or Rust
  • Knowledge of distributed storage (block storage), or similar large scale distributed databases
  • Familiarity with CI/CD pipelines and infrastructure as code (Terraform, Ansible).
  • Knowledge of security best practices and compliance requirements in storage systems.
  • Understanding of data durability, consistency models, and storage performance optimization techniques.