Research Engineering Specialist

Salary Not Specified

Meta, West Ealing, Ealing

Full time
Permanent
Onsite working

Posted 1 day ago, 30 Oct | Get your application in today.

Closing date: Closing date not specified

job Ref: 1953e400835648d1ae8cd6974944a4ce

Full Job Description

The Llama, GenAI team is seeking an AI Specialist for a Research Engineering role. This position involves advancing reasoning capabilities in large language models. In this role, you will work with our state-of-the-art Llama, model and implement cutting-edge techniques, including but not limited to Monte Carlo tree search, online reinforcement learning and advanced efficient inference methods with Llama. The focus will range from verifiable reasoning, such as mathematical reasoning, to more complex areas, including legal reasoning, causal analysis and decision-making. Research Engineering Specialist - Llama, GenAI Responsibilities

Execute research to push forward state-of-the-art reasoning capabilities in large language models.
Contribute to code health, research experiments, and organization of research insights, and communicate results effectively.
Collaborate with a global team from diverse backgrounds on the entire large language model post-training pipeline.
Thrive in a fast-paced environment with ambitious targets.

A minimum of a Bachelor's degree in Computer Science, Computer Engineering, Artificial Intelligence, Machine Learning, or a related field.
Relevant industry or research experience related to the job, including experience in collaborative team environments.
Proficiency in software development and deep learning frameworks, such as PyTorch, and others.
Specialised experience in large language model reasoning through post-training, alignment demonstrated by past projects.
An advanced degree in machine learning and natural language processing (Master's or PhD) is preferred.
Experience in reasoning (math and code and other non verifiable reasoning) space with LLMs.
Experience in reinforcement learning and tree search algorithms
Experience in various reword modeling in LLM alignment