Senior Engineer - Cloud Development

Graphcore

Senior Engineer - Cloud Development

Salary Not Specified

Graphcore, Bristol

  • Full time
  • Permanent
  • Onsite working

Posted 2 days ago, 25 Sep | Get your application in today.

Closing date: Closing date not specified

job Ref: d42d4132e8c042ca81ecba4dc898ce7f

Full Job Description

We are looking for a Senior Engineer to join our Cloud Development Team. Working closely with our colleagues in Platform Engineering, Datacentre Operations and Product Development, you will optimise our fleet of groundbreaking AI systems. As part of our Platform Engineering organisation, you will be involved in the cloud integration, validation, performance benchmarking, optimisation, and development of our high-performance AI solutions. These include in-house AI systems alongside off-the-shelf high-performance servers, switches and storage solutions. This is a hand-on role requiring a proven background in infrastructure, cloud deployment using Infrastructure-as-Code, OpenStack and high-performance networking and storage systems. The successful candidate may have been working in an IT organisation, a datacentre, a cloud provider or as a developer of orchestration or cloud components.
The Platform Engineering team at Graphcore builds Graphcore products into large-scale AI solutions for our customers and within that, this team is responsible for providing such systems to our internal users via private clouds. Often these internal systems will be using and developing pre-release hardware and software.
Responsibilities

Partner with the system architecture and engineering teams to develop complete cloud-ready AI solutions based on Graphcores next-generation AI products.
Work with our Datacentre Operations Engineers to maintain the fleet of AI systems at peak performance in our private clouds.
Operate and extend existing OpenStack cloud services and contribute to the deployment and development of new ones, support internal end-users with application services on our private clouds.
Configure and test new Graphcore AI hardware and systems using Infrastructure-as-code as they are deployed in internal and external datacentres.
Drive corrective actions for systems that are not operating accurately, working with DC operations, Engineering and datacentres as required.
Develop tested and optimised configurations for our AI Cloud Reference Design.

The ideal candidate will bring extensive softwareengineering experience with a proven track record of delivering technicaloutput as an individual contributor. Proven Linux scripting ability and system administration, as well as a hands-on understanding of the technologies underpinning cloud services, virtual networks, resource management and monitoring are essential. They will have experience with OpenStack deployments or the technologies they rely on, as well as container deployment and management using Docker, Podman or Kubernetes. We are looking to find a candidate who is a confident user of version control system, who brings experience with Continuous Integration or testing pipelines and solutions. They should also have some knowledge of continuous system management concepts and tools.
On top of these technical skills, we would like to identify candidates who are able to work independently on critical infrastructure who maintain a focus on end-user availability. They should understand how to prioritise, as well as assess risk, issues, impacts and constraints, and have strong communication and presentation skills.

About Graphcore
How often do you get the chance to build a technology that transforms the future of humanity? Graphcore products have set the standard in made-for-AI compute hardware and software, gaining global attention and industry acclaim. Now we are developing the next generation of artificial intelligence compute with systems that will allow AI researchers to develop more sophisticated models, help scientists unlock exciting new discoveries, and power companies around the world as they put AI at the heart of their business. We recently joined SoftBank Group, bringing large and ongoing investment from one of the worlds leading backers of innovative AI companies.

In addition to a competitive salary, Graphcore offers flexible working, a generous annual leave policy, private medical insurance and health cash plan, a dental plan, pension (matched up to 5%), life assurance and income protection. We have a generous parental leave policy and an employee assistance programme (which includes health, mental wellbeing, and bereavement support). We offer a range of healthy food and snacks at our central Bristol office and have our own barista bar! We welcome people of different backgrounds and experiences; were committed to building an inclusive work environment that makes Graphcore a great home for everyone. We offer an equal opportunity process and understand that there are visible and invisible differences in all of us. We can provide a flexible approach to interview and encourage you to chat to us if you require any reasonable adjustments.