cover image
PolyAI

Research Scientist - Large Language Model Post-Training

Remote

London, United Kingdom

Full Time

25-03-2025

Job Specifications

PolyAI automates customer service through lifelike voice assistants that let customers lead a conversation. Our voice assistants make it possible for businesses to deliver outstanding customer service that rivals their human agents. Our customers, which include the world’s leading logos, are expanding how they use our platform, driving automation of critical customer service operations and integrating PolyAI into their daily customer service workflows.

As a Research Scientist specialising in large language model post-training, you will play a key role in shaping and implementing strategies for aligning language models for use in our conversational AI platform. Your primary focus will be on post-training techniques such as preference- finetuning, reward modelling, synthetic data generation etc.

Responsibilities

Train models and conduct experiments to assess model performance in live deployments
Work on experimental model architectures, exploring multimodal, efficient long-context etc.
Develop post-training strategies to achieve state of the art performance on domain-specific tasks
Generate, collect, and annotate contact center data from sources such as real customer calls, chats, online open datasets, and synthetic data
Develop robust evaluation benchmarks to track improvements in production models
Collaborate with the legal and compliance team to address any compliance or data privacy-related issues
Work closely with product and engineering teams to ensure alignment with business and production goals
Stay informed about the latest advancements in machine learning, ASR, TTS, and LLM to continuously enhance our technologies

Requirements

A degree in Computer Science, Machine Learning, or a related field, or equivalent industry experience
3+ years of experience working with deep learning and statistical models
Strong knowledge of data quality standards and annotation processes, with the ability to independently evaluate and improve models
Proficiency in Python and familiarity with relevant ML frameworks and libraries (e.g., PyTorch)
Experience with cloud services such as AWS, GCP, or Azure
Excellent verbal and written communication skills, with the ability to convey complex technical concepts to diverse audiences
A passion for solving technical challenges and driving practical solutions

Preferred Qualifications:

Experience working with LLMs and data preparation pipelines.
Experience with speech models, such as ASR or TTS.

Benefits

Participation in the company’s employee share options plan

25 days holiday, plus bank holidays

Flexible working from home policy

Work from outside of the UK for up to 6 months each year

Enhanced parental leave

Bike2Work scheme

Annual learning and development allowance

One-off WFH allowance when you join

Company-funded fertility and family-forming programmes

Menopause care programme with Maven

Private healthcare and dental cover, discounts on gym members and relaxation apps, and access to a range of mental health programs

At PolyAI, we take great pride in our values—they guide everything we do. We believe that a strong culture leads to meaningful work and lasting impact.

Our Core Values Are

Only the best: We expect the best from our people, we hire people that expect the best from themselves, and we nurture this drive for excellence.
Ownership: We care deeply about what we do. We take ownership of our initiatives, decisions and outcomes.
Relentlessly improve: We demand more from ourselves and are always evolving. Continuous, obsessive improvement is the only way we will transform the world of conversational AI.
Bias for action: Our world moves quickly and so do we. We take calculated risks and we deliver impact fast.
Disagree and commit: We are all working toward the same goal. If we donʼt agree with something, we work hard to understand it and when a decision is made, we accept it and give it our all.
Build for people: We are hyper-focused on delivering the best automated experiences possible so that we can empower people to get exactly what they need, when they need it.

PolyAI is proud to be an equal-opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees. All employment decisions at PolyAI will be based on the business needs without attention to ethnicity, religion, sexual orientation, gender identity, family or parental status, national origin, neurodiversity status or disability status.

Kindly find the Privacy Notice for our recruitment process by following the link here. This document provides important information regarding how we handle your personal data throughout the recruitment journey.

About the Company

PolyAI builds enterprise conversational assistants that carry on natural conversations with customers to solve their problems. Our conversational assistants understand customers, regardless of what they say or how they say it. Know more

Related Jobs

Company background Company brand
Company Name
Songtradr
Job Title
Senior Software Engineer
Job Description
Company Description From one tiny Santa Monica office in 2014, we now have offices in 15 major cities worldwide, delivering 100%+ YoY revenue growth in recent years. In this short time span, we have developed some of the most progressive teams and technologies and continue innovating how B2B music is researched, discovered, created, and transacted. With these capabilities, we’ve helped our clients grow their businesses and foster positive brand affinity with their respective audiences – all through the power of music. Our clients include Google, Adidas, Netflix, VRBO, Fenty, and many more. We’re thrilled to celebrate their success and are proud our efforts have been recognized across the creative industry. Our teams regularly receive top awards such as Cannes Gold Lion, D&AD, Clios, Music and Sound Awards, London International Awards, Transform, and more. And yet, our work is far from done. We will continually challenge ourselves, question, and strive for excellence…. All done with a shared love of music and technology and changing the industry for good. About Bandcamp Bandcamp (a Songtadr company) is the world’s largest online record store and music community where passionate fans discover, connect with and directly support the artists they love. With the majority of revenue going directly to artists, Bandcamp fosters a communal experience with music discovery, where artists and fans support each other in a vibrant ecosystem. Bandcamp artists and labels release and sell a broad range of music products, including digital and physical records, vinyl, apparel and merchandise. Learn more at https://bandcamp.com/ and follow on LinkedIn, Instagram, Facebook and Tiktok. Job description Bandcamp is looking for a generalist senior software engineer with a focus on front-end technology to join the feature engineering team. As a senior software engineer, you will design, develop and maintain software applications for Bandcamp’s seller tools team. As a member of Bandcamp’s feature engineering team, you will collaborate with other engineers, product management and design to clarify specifications and build high quality, moderately complex, well-tested software that delivers an excellent user experience. You will also participate in code reviews, troubleshooting and bug fixes. This role requires a balance of teamwork and independent initiative. You thrive in cross-functional team settings while also demonstrating the ability to own projects from kickoff to deployment with little supervision. You will be expected to communicate effectively with teammates while making autonomous decisions when needed. You combine technical expertise with a self-motivated approach and a genuine passion for Bandcamp's mission of supporting independent music. Required experience 6+ years of front- and back-end programming experience. Programming experience in Ruby or a similar high-level language like Python, C# or Java. Front-end development experience using Vue or similar technologies like React. Relational database experience, including writing SQL directly in online, high-performance, transactional systems. Other requirements Your working hours must have 4 hours of overlap with US central time. You must be able to attend and participate in a daily standup meeting at 11:00am US central time. You must be willing and able to learn and work with a large legacy codebase. Nice to have Linux or macOS command-line expertise, including use of zshell or bash. Prior experience building consumer software products. An eye for—and focus on—user experience. Personal Attributes You are passionate about independent music and artist empowerment. You take complete ownership and accountability for owned tasks and are comfortable with self-direction and independence. You are passionate about finding the right solution and getting the details right. Perks of the Job Flexible remote/hybrid work Health care benefits Paid vacation time
United Kingdom
Remote
Full Time
08-04-2025
Company background Company brand
Company Name
Accuris
Job Title
Senior ML Engineer
Job Description
About us: Accuris, a company long-known for accelerating innovation in engineering workflows and supporting the vibrancy of the engineering community, launched in May 2023 as a standalone company. Accuris was formerly known as S&P Global’s Engineering Solutions division. The Company is valued for its standards content and workflow solutions like Engineering Workbench, Goldfire, Haystack and Parts Management Solutions. Under its previous owners, including S&P Global, IHS and IHS Markit, Accuris has been an integral part of the engineering ecosystem for more than 60 years. In the Accuris we think differently, combining the knowledge and resources of an established company with the unapologetic boldness of a startup. We build software solutions fueled by trusted data that connect to engineering workflows in revolutionary ways, illuminating answers that previously were impossible to find and empowering our clients to envision the future, so they can identify the best course of action in the present. We’re disrupting the current digital transformation landscape with state-of-the-art AI developed by a passionate team with a bias to action. Are you ready to join us in building the software that will power the most impactful companies in the world? Our AI team is looking for new talents for the role of Senior ML/AI Engineer critical for success of projects related to Natural Language Processing, Data Capturing, Content Understanding and Information Retrieval domains. You will be responsible for design and engineering aspects of innovative projects related to search, automatic structuring and understanding of unstructured content. Your role is needed to ensure efficient architectures and low-level design of deep learning-based and LLM-based solutions for high-quality deliverables into Accuris products with focus on innovation, scientific research, experimentation, and optimal design. Your duties will include: Building and optimization of LLM-based solutions. Building and optimization of ML/DL training, testing and inference pipelines for the latest GPU and CPU hardware and cloud environments. Design and building of DL/ML-based systems optimized to meet production requirements. Development and optimization of production inference and data pipelines. Delivering AI/ML solutions to production. Analysis, transformation and bootstrapping of datasets. Requirements: BS degree in math and computer science, or related field. 4+ years of experience as machine learning engineer or software engineer in ML-related projects. Experience in Natural Language Processing and/or Information Retrieval. Solid understanding of deep learning methods. Experience in ML/DL modeling frameworks like PyTorch and/or other DL frameworks. Experience with popular modern NLP libraries/frameworks including Hugging Face Transformers. Algorithmic skills Developed programming skills (Python) English language (B1+) The following will hugely increase our interest: MS or PhD Degree. Publications in related domain. Experience with inference and serving engines. Experience in building LLM-based solutions. Skills in algorithms and data structures design. Linux user experience About Company Statement: Accuris delivers essential intelligence that powers decision making. We provide the world’s leading organizations with the right data, connected technologies and expertise they need to move ahead. We think differently, combining the knowledge and resources of an established company with the unapologetic boldness of a startup. (https://accuristech.com/) Our mission: build an evolvable knowledge and data platform that enables STEM professionals to unlock and deliver innovation to the world’s most complex problems. Accuris provides equal employment opportunities to all employees and applicants for employment and prohibits discrimination and harassment of any type without regard to race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state or local laws. This policy applies to all terms and conditions of employment, including recruiting, hiring, placement, promotion, termination, layoff, recall, transfer, leaves of absence, compensation and training.
United Kingdom
Remote
Full Time
08-04-2025
Company background Company brand
Company Name
AndrofitAI
Job Title
Senior Computer Vision Developer (Path to CTO)
Job Description
We’re Hiring: Senior Computer Vision Developer (Path to CTO) - Equity Position Location: Remote Industry: Wellness, Fitness, AI, ML, Computer Vision Equity: Offering in lieu of initial salary (details below) Commitment: Minimum 1-2 days per week About AndroFit Ai: AndroFit Ai is a pioneering London-based startup at the intersection of wellness, fitness, and cutting-edge technology, including AI, ML, and computer vision. We are developing revolutionary AI-driven personal trainers aimed at enhancing physical fitness through precise body posture and movement analysis along with nutrition, mental health, welness, physio therapy, etc. With a potential market size of £200 billion in the UK alone and plans to expand into other Western markets, we are poised for significant growth. We have successfully secured funding from angel investors, developed a ready prototype, engaged interested B2B clients, and crafted a clear plan for revenue generation. We are gearing up to raise upwards of £500,000 in our next funding round with strong interest already from potential investors. Role Overview: Senior Computer Vision Developer (Path to CTO) What You’ll Do: Design and implement computer vision algorithms for body posture analysis using real-time video data. Build and iterate on our prototype to create a functional, user-friendly AI fitness tool. Collaborate closely with the founding team to shape the product and technical roadmap. Lay the groundwork for scalable architecture as we prepare for growth. What We’re Looking For: Hands-on experience with computer vision (e.g., OpenCV, TensorFlow, PyTorch) and body posture analysis (e.g., pose estimation models like OpenPose, MediaPipe, or custom solutions). Strong coding skills in Python or similar languages, with a knack for rapid prototyping. Passion for fitness, AI, and startups—bonus if you’ve worked on AI-driven consumer products. Comfortable working in a fast-paced, equity-driven environment with big upside potential. What We Offer: Equity compensation until our next funding round (in progress), with competitive salary post-funding. A chance to join as a founding team member and shape our tech vision. Potential to step into the CTO role if we’re a great fit and the prototype succeeds. Remote work flexibility (add location preferences if applicable, e.g., "Remote – US preferred"). How to Apply: Submit your resume, GitHub/portfolio link, and complete our short Google Form: https://shorturl.at/EbFqN to tell us about your experience. We’re moving fast—apply by 10th April 2025
United Kingdom
Remote
Full Time
07-04-2025
Company background Company brand
Company Name
Mach42
Job Title
Senior/Principal Analog Verification Engineer
Job Description
Join our team developing cutting-edge deep learning technology accelerating the verification of analog circuits. The successful candidate will have an important role in future product development and the company's long-term vision for its technology. The role comes with a competitive compensation package including early equity options in a fast-growing company. Work will be remote initially (based in the UK), with an option for hybrid working soon. Mach42 is a spin-out company from the University of Oxford, founded by world-class scientists and engineers. Responsibilities As a Senior/Principal Analog Verification Engineer, the successful candidate will be expected to: Create and validate behavioural models using Verilog-A for analog and mixed signal designs. Drive the creation of an industry-leading machine-learning based analog verification flow as part of our product development. Debug and resolve complex simulation and modelling issues. Support our product development team by creating crisp, actionable user requirements. Engage with our strategic partners to identify solutions leveraging our AI technology. Key requirements Master’s degree or bachelor’s degree in computer or electronics engineering with 5 years or more of relevant industry experience Expertise in Spectre® and Verilog-A model development at both block and subsystem/system level. Strong understanding of analog design concepts (e.g. PLLs, LDOs, ADCs, DACs). Excellent problem-solving skills and attention to detail. Strong written and oral communication skills. Outstanding teamworking/collaboration skills. Preferred qualifications PhD degree on quantitative fields Knowledge of the EDA / Semiconductor industry, in particular analog design experience Passion for exploring new applications of deep learning in industry Benefits Compensation: £60k - £80k base salary (Senior)/ £75k - £100k base salary (Principal) + annual bonus Flexible working Opportunity to work with a world-class team Equity allocation
United Kingdom
Remote
Full Time
08-04-2025