Data Scientist

LexisNexis, Faringdon, Oxfordshire

Data Scientist

Salary not available. View on company website.

LexisNexis, Faringdon, Oxfordshire

  • Full time
  • Temporary
  • Remote working

Posted 4 days ago, 27 Oct | Get your application in now to be included in the first week's applications.

Closing date: Closing date not specified

job Ref: 2f96ae13baf646dfa76542cabe8822b6

Full Job Description

We are seeking a GenAI/Data Scientist to join our AI Innovation Team. This role will focus on experimenting, optimising and applying Generative AI off the shelf models to extract valuable insights from large-scale patent datasets and enhance our search and analytics tools. You will collaborate with data scientists, product teams, and stakeholders across different geographies, driving innovation through LLMs (Large Language Models) and advanced AI methodologies.,

  • Perform EDA, design, build, and deploy pipelines utilizing LLM models to enhance patent search and analytics applications.
  • Stay up to date with the latest research and developments in the field of natural language processing (NLP) and machine learning.
  • Work on GenAI techniques like Prompt Engineering, RAG (Retrieval-Augmented Generation) and perform evaluation using frameworks to optimise LLM performance.
  • Develop and implement machine learning workflows, focusing on the integration of GenAI with existing data infrastructure.
  • Collaborate with the team to explore Generative AI use cases, including automated summarisation, natural language understanding, and text generation.
  • Conduct experiments to evaluate model performance, identify areas for improvement, and implement enhancements.
  • Perform continuous evaluations and improvements of models to handle large volumes of patent data.
  • Work with stakeholders across teams to identify key areas for AI-driven innovation and enhancement in data products.
  • Use Python, SQL, PySpark and related technologies to develop scalable solutions, focusing on large-scale data processing., Perform EDA, design, build, and deploy pipelines utilizing LLM models to enhance patent search and analytics applications.
  • Stay up to date with the latest research and developments in the field of natural language processing (NLP) and machine learning.
  • Work on GenAI techniques like Prompt Engineering, RAG (Retrieval-Augmented Generation) and perform evaluation using frameworks to optimise LLM performance.
  • Develop and implement machine learning workflows, focusing on the integration of GenAI with existing data infrastructure.
  • Collaborate with the team to explore Generative AI use cases, including automated summarisation, natural language understanding, and text generation.
  • Conduct experiments to evaluate model performance, identify areas for improvement, and implement enhancements.
  • Perform continuous evaluations and improvements of models to handle large volumes of patent data.
  • Work with stakeholders across teams to identify key areas for AI-driven innovation and enhancement in data products.
  • Use Python, SQL, PySpark and related technologies to develop scalable solutions, focusing on large-scale data processing
  • We promote a healthy work/life balance across the organisation. We offer an appealing working prospect for our people. With numerous wellbeing initiatives, shared parental leave, study assistance and sabbaticals, we will help you meet your immediate responsibilities and your long-term goals.
  • Working flexible hours - flexing the times when you work in the day to help you fit everything in and work when you are the most productiv

    Immediate 23 Jan, 2025 Not Specified 24 Oct, 2024 3 year(s) or above Aws,Azure,Transformer,Maternity,Nlp,Analytical Skills,Happiness,Paternity,Access,Dental Insurance,Data Science,Children,Python,Data Processing,Evaluation Methodologies No No, Demonstrate 3+ years of experience in data science, with a focus on NLP, Generative AI and LLMs.
  • Proficiency in Python and experience working with transformer based LLMs and NLP frameworks (e.g. Hugging Face, Spacy, Pytorch/Tensorflow etc).
  • Knowledge of Prompt Engineering, RAG techniques and various evaluation methodologies for integrating GenAI with search/retrieval systems and measure the quality.
  • Experience working with cloud platforms like Azure, AWS, or GCP for machine learning workflows.
  • Understanding of data engineering pipelines and distributed data processing (e.g., Databricks, Apache Spark).
  • Strong analytical skills, with the ability to transform raw data into meaningful insights through AI techniques.
  • Experience with LangChain / LlamaIndex, vector databases (e.g., FAISS), fine-tuning models on domain-specific data would be an advantage

    The LexisNexis Intellectual Property (IP) division (
  • https://www.lexisnexisip.com ) provides international patent content and a suite of online and analytic tools that meet the evolving needs of the intellectual property market. We deliver data to support LexisNexis IP search and analytics applications, empowering our customers with actionable insights and metrics for critical business decisions. Our corporate culture thrives on excellence, innovation, and a strong dedication to our customers, employees, and communities. Working here means joining a vibrant, diverse, and collaborative team where you are free to grow and contribute actively.

    We promote a healthy work/life balance across the organisation. We offer an appealing working prospect for our people. With numerous wellbeing initiatives, shared parental leave, study assistance and sabbaticals, we will help you meet your immediate responsibilities and your long-term goals.
  • Working flexible hours - flexing the times when you work in the day to help you fit everything in and work when you are the most productive
  • Working for you We know that your wellbeing and happiness are key to a long and successful career. These are some of the benefits we are delighted to offer:
  • Generous holiday allowance with the option to buy additional days
  • Health screening, eye care vouchers and private medical benefits
  • Wellbeing programs
  • Life assurance
  • Access to a competitive contributory pension scheme
  • Save As You Earn share option scheme
  • Travel Season ticket loan
  • Electric Vehicle Scheme
  • Optional Dental Insurance
  • Maternity, paternity and shared parental leave
  • Employee Assistance Programme
  • Access to emergency care for both the elderly and children
  • RECARES days, giving you time to support the charities and causes that matter to you
  • Access to employee resource groups with dedicated time to volunteer
  • Access to extensive learning and development resources
  • Access to employee discounts scheme via Perks at Work

Relevant jobs