SRE Engineer, ASE, London

Apple Inc.

SRE Engineer, ASE, London

Salary Not Specified

Apple Inc., City of Westminster

  • Full time
  • Permanent
  • Onsite working

Posted 3 weeks ago, 30 Aug | Get your application in now before you miss out!

Closing date: Closing date not specified

job Ref: 59794ddf42ea4b12acde50ff520deebb

Full Job Description

As a Site Reliability Engineer you will be responsible for providing the platform for mission critical cloud systems to maintain constant uptime, scale seamlessly, and allow for new applications and services to flourish. The successful candidate will be highly self-motivated with a passion for excellence, quality and detail. The SRE will not only support operations, but also work closely with the developers and architects within the team to aid in the design and assist with the implementation to improve stability, security and scalability. As an SRE at Apple, you will: Operate, monitor, and triage all aspects of our production and non-production environments. Design, build and implement innovative solutions for previous, present and future issues. Prepare alert handling procedures, runbooks, and collaborate with the off-shore SRE teams. Automate deployment and orchestration of services into the cloud environment as well as other routine processes. Actively participate in capacity
planning, scale testing, and disaster recovery exercises. Interact with and support partner teams, including engineering, QA, and program management. Cultivate and maintain relationships with internal and external third-party vendors.

  • In depth experience in a Site Reliability Engineering, DevOps, or Infrastructure focused role

  • Must be an expert and have in-depth professional experience working with Kubernetes

  • Experience operating large scale multi tenant Infrastructure as a Managed service

  • Able to troubleshoot issues across the entire infrastructure stack

  • Ability to implement and coordinate telemetry using monitoring and observability tools such as Splunk, Grafana, and Prometheus

  • Outstanding organizational and communications skills, Proficient in GoLang

  • Knowledge of the Linux operating system and its variations

  • Experience with GitOps, CI/CD tools, and deployment strategies like Spinnaker, Argo