cover image
Microsoft

Member of Technical Staff, AI Data

On site

London, United Kingdom

Full Time

11-04-2025

Job Specifications

Help build the world’s most advanced multimodal dataset at Microsoft AI

We are on a mission to create the largest and most advanced multimodal dataset in the world. This dataset, spanning all modalities from across the web and beyond, will power the training of the world’s most capable AI frontier models, pushing the boundaries of scale, performance, and product deployment.

The AI Data team at Microsoft AI is responsible for all aspects of data preparation to support our model pre-training operations, including collecting data from the source, extracting and transforming the most useful data, and understanding the impact of changes to data by training and evaluating new models. We are an interdisciplinary team of engineers and scientists, learning from each other, and collaborating to create the best models and products. We work closely with the teams that transform pre-trained models into the models that power the consumer Copilot experience.

About

We are looking for outstanding individuals excited about contributing to the next generation of systems that will transform the field. In particular, we are looking for candidates who:

Are passionate about the role of data in large-scale AI model training
Will thrive in a highly collaborative, fast-paced environment
Have a high degree of craftsmanship and pay close attention to details
Demonstrate a proactive attitude and enthusiasm for exploring new methods and technologies
Effectively manage multiple responsibilities and can adjust to shifting priorities.

Responsibilities

Design and develop data pipelines that ingest enormous amounts of multi-modal training data (text, audio, images, video).
Build and maintain cutting-edge infrastructure that can store and process the petabytes of data needed to power models.
Partner with the pretraining and post-training teams to improve our data recipe by rigorous and careful experimentation.
Collaborate with the product team and other engineers and researchers across Microsoft AI to identify gaps in the current generation of models.
Embody our culture and values.

Required/Minimum Qualifications

Bachelor's Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field AND experience in business analytics, data science, software development, data modelling or data engineering work
OR equivalent experience.
Expertise in large scale data engineering ideally applied to AI
Expertise in Spark, Kubernetes or similar.

#copilot #microsoftAI

About the Company

Every company has a mission. What's ours? To empower every person and every organization to achieve more. We believe technology can and should be a force for good and that meaningful innovation contributes to a brighter world in the future and today. Our culture doesn’t just encourage curiosity; it embraces it. Each day we make progress together by showing up as our authentic selves. We show up with a learn-it-all mentality. We show up cheering on others, knowing their success doesn't diminish our own. We show up every day o... Know more

Related Jobs

Company background Company brand
Company Name
SentinelOne
Job Title
Staff AI Platform Engineer
Job Description
About Us At SentinelOne, we’re redefining cybersecurity by pushing the limits of what’s possible—leveraging AI-powered, data-driven innovation to stay ahead of tomorrow’s threats. From building industry-leading products to cultivating an exceptional company culture, our core values guide everything we do. We’re looking for passionate individuals who thrive in collaborative environments and are eager to drive impact. If you’re excited about solving complex challenges in bold, innovative ways, we’d love to connect with you. What are we looking for? Join our dynamic team of AI engineers and researchers from around the world! We’re passionate about bringing the latest breakthroughs in Artificial Intelligence and Machine Learning to our customers, and we need someone like you to help make it happen. We are looking for an AI Platform Engineer who will be directly involved in developing our core AI technology and products. We are the team behind PurpleAI (released for general availability in April 2024, see also the latest demo), awarded one of the Top10 hottest cyber products of 2023. What will you do? You'll be part of a team of globally distributed AI engineers and researchers, developing features for SentinelOne products. As a Senior Platform Engineer on our team, you will develop and ship features for AI team products and services. You will implement features in Python, write robust tests, tackle bugs with finesse, and ensure top-notch security and quality in our platforms. You will collaborate across multiple engineering teams to design and build new capabilities that will be used across SentinelOne’s industry-leading platform. We are a team with a broad set of responsibilities, and a diverse set of backgrounds. Any of the following skills would signify a excellent fit with our team: Developing complex applications leveraging API-based LLMs Training and deploying modern machine learning pipelines Experience in data science methodologies and technologies Computer security and security research Experience in Site Reliability Engineering (SRE) or other operational roles Experience in Spark, Databricks, Weights and Biases, Airflow, MLFlow, or other MLOps/AIOps technologies Modern Python libraries, coding practices, and tooling (asyncio, pydantic, pandas, ruff, mypy) Experience with GPU technologies and compute frameworks (Morpheus, RAPIDS, JAX, Pytorch, CUDA) Familiarity with Google Cloud Platform and/or Microsoft Azure What skills and experience should you bring? We are looking for exceptional engineers who are passionate about delivering great AI products. To that end, we are looking for engineers with the following experience: A degree in computer science or software engineering. Additional or directly relevant experience will be considered in lieu of a degree. 5+ years of experience in Python development and shipping Python code in production environments 3+ years of experience solving complex problems using modern AI/ML techniques Experience in software engineering and with data structures and algorithms Comfortable working cross-functionally across both research and product teams Experience with large-scale high-load distributed systems Excellent communication skills and ability to work asynchronously Experience with Docker, Kubernetes, Terraform, ArgoCD or other similar technologies Familiarity with Amazon Web Services Why us? You will work on real-world problems and make an impact by protecting our customers from cyber threats. You will be building the next phase of growth of SentinelOne with our incredible platform & best tech in the industry! You will tackle challenges leading from the front and work with the very BEST in the industry. Private medical care, accident cover and life insurance Restricted Stock Units with annual refreshers Employee Stock Purchase Programme Flexible working hours and access to several co-working spaces High-end MacBook or Windows laptop and home-office-setup gear Volunteering day off and 4_ Wellness Days per year (ad-hoc days off for self care) Global gender-neutral parental leave and grandparent leave Global Employee Assistance Programme offering confidential counseling Full access to LinkedIn Learning, an e-learning platform Full access to Wellness Coach, a mental well-being and fitness application Company inclusion networks and Mentorship programme SentinelOne is proud to be an Equal Employment Opportunity and Affirmative Action employer. We do not discriminate based upon race, religion, color, national origin, gender (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender identity, gender expression, age, status as a protected veteran, status as an individual with a disability, or other applicable legally protected characteristics. SentinelOne participates in the E-Verify Program for all U.S. based roles.
United Kingdom
Remote
Full Time
23-04-2025
Company background Company brand
Company Name
Barrington James
Job Title
Production Data Scientist
Job Description
Barrington James are recruiting for a Production Performance specialist for our Client who is well known within their field and rapidly growing. Role Purpose: As a Production Performance Specialist, you will analyze production targets, volumes, and yields to ensure alignment with organizational goals. You will focus on optimizing production processes by conducting a comprehensive analysis of production flows and outputs, providing insights to drive continuous improvements. Key Responsibilities: Analyze production targets, volumes, and yields against organizational objectives. Conduct in-depth analysis of production flows and outputs to assess performance. Identify and analyze performance indicators to recommend process improvements. Collaborate with cross-functional teams to gather data and insights. Develop and maintain reports and dashboards to monitor production performance. Provide actionable recommendations based on data insights. Work with production teams to implement and evaluate process changes. Stay current on industry trends and best practices in production analysis. Qualifications and Skills: Bachelor’s degree in business administration, economics, engineering, or a related field. Strong analytical skills and attention to detail. Proficiency in data analysis tools and techniques. Excellent communication and interpersonal skills. Ability to work both independently and collaboratively in a fast-paced environment. Strong problem-solving and solution-oriented mindset.
Worthing, United Kingdom
On site
Full Time
23-04-2025
Company background Company brand
Company Name
Sanderson
Job Title
Data Engineer
Job Description
Data Engineer Location: Edinburgh (Hybrid Working) Salary: Up to £41,300 Working Hours: Monday to Friday, core office hours Outsource UK is proud to be recruiting on behalf of a respected retail banking group for a Data Engineer to join their growing team in Edinburgh. This is an excellent opportunity for a data-driven professional who’s passionate about building and maintaining high-quality data solutions in a fast-paced, customer-focused environment. You’ll work with modern tools and collaborate with teams across the business to enable smarter decision-making and improved service delivery. What You’ll Do Develop, maintain, and optimise data pipelines and ETL processes Collaborate with analysts, engineers, and stakeholders to understand data requirements Support the delivery of data solutions aligned with business objectives Ensure data integrity, quality, and compliance with governance policies Contribute to continuous improvement of data platforms and workflows What We’re Looking For: Experience working as a Data Engineer or in a similar role within financial services or a regulated industry Proficiency in SQL, data modelling, and ETL tools Familiarity with cloud-based data platforms (e.g., Azure, AWS, or GCP) Strong problem-solving skills and attention to detail Excellent communication and collaboration abilities If you are an experienced Data Engineer and interested in the above role then apply today or email astafford@outsource-uk.co.uk for further information
Edinburgh, United Kingdom
On site
Full Time
23-04-2025
Company background Company brand
Company Name
Oracle
Job Title
Autonomous Database Engineer (Cloud DBA & DevOps) - UK
Job Description
Job Description The Autonomous Database Team is responsible for building the Cloud service framework powering various Oracle Autonomous Database cloud services, including Autonomous Data Warehouse (ADW) and Autonomous Transaction Processing (ATP). The framework automates deployment, scaling and management of databases in the cloud. It is built on top of Oracle's Cloud Infrastructure (OCI) Layer. The Autonomous Database cloud service framework features APIs to handle all lifecycle management operations of databases. It also performs operations autonomously based on internal and external events. The team has the unique opportunity to make significant contributions to the full stack of Oracle technology, from database kernel to cloud service platform and to customer-facing portals. You will be responsible for deploying fixes for the Autonomous Database service with a deep focus on architecture, production operations, performance management, deployment and release engineering. You will work with multiple cross-functional teams helping deliver new and outstanding experiences to our stakeholders while ensuring reliability and performance. The successful candidate will be expected to interface with both senior management and the various delivery teams across OCI and database security and compliance architecture team to define and maintain our compliance posture in an environment of increasing regulatory expectations and growth. We are a dynamic and enthusiastic team with great emphasis on go-getters and proactive individuals. Overview from oracle.com with links to other collateral: https://www.oracle.com/autonomous-database/ In this role you will need to: Operate and performs maintenance to cloud database services running within the region. Assess, prioritize, and communicate risks and urgency to leadership and engineering teams Take ownership of the implementation and production operations of a wide array of core system platform solutions React to production deficiencies by continuously implementing automation, self-healing, and real-time monitoring to production systems Ensure thorough documentation of incidents through company-standard reporting methods. Stay informed of cloud infrastructure stacks Preferred Qualifications: At least a Bachelor’s degree, in Computer Science, MIS or another technical field, or equivalent work experience. Solid experience with Linux. Experience troubleshooting complex software and/or networking issues. Knowledge on Cloud computing architecture, primarily Identity, Compute and Networking Expert level experience, understanding, implementation and troubleshooting of Oracle Database technology including RAC, Dataguard, ASM, RMAN preferred. Development skills utilizing Python, shell, SQL, Terraform Prior DevOps or continuous delivery and deployment experience preferred Strong communication and analytical skills Proven ability to quickly learn new technical domains and then train others. This is full-remote role but you are required to obtain Gov Security Clearance. Career Level - IC3 Qualifications Career Level - IC3 About Us As a world leader in cloud solutions, Oracle uses tomorrow’s technology to tackle today’s challenges. We’ve partnered with industry-leaders in almost every sector—and continue to thrive after 40+ years of change by operating with integrity. We know that true innovation starts when everyone is empowered to contribute. That’s why we’re committed to growing an inclusive workforce that promotes opportunities for all. Oracle careers open the door to global opportunities where work-life balance flourishes. We offer competitive benefits based on parity and consistency and support our people with flexible medical, life insurance, and retirement options. We also encourage employees to give back to their communities through our volunteer programs. We’re committed to including people with disabilities at all stages of the employment process. If you require accessibility assistance or accommodation for a disability at any point, let us know by emailing accommodation-request_mb@oracle.com or by calling +1 888 404 2494 in the United States. Oracle is an Equal Employment Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability and protected veterans’ status, or any other characteristic protected by law. Oracle will consider for employment qualified applicants with arrest and conviction records pursuant to applicable law.
Reading, United Kingdom
Remote
Full Time
23-04-2025