MLSys 2025 Career Opportunities
Here we highlight career opportunities submitted by our Exhibitors, and other top industry, academic, and non-profit leaders. We would like to thank each of our exhibitors for supporting MLSys 2025.
Search Opportunities
New York
Quantitative Strategies
Overview
Technology is integral to virtually everything the D. E. Shaw group does, which is why we seek exceptional software developers with a range of quantitative and programming abilities. Members of our technical staff collaborate on challenging problems that directly impact the firm’s continued success, utilizing their excellent analytical, mathematical, and software design skills as well as some of the most advanced computing resources in the world. Software developers have the opportunity to be part of an inclusive, collaborative, and engaging working environment.
What you’ll do day-to-day
Specific responsibilities may include formulating statistical models for our computerized trading strategies, developing distributed systems to analyze and react to incoming data in real time, and creating tools for advanced mathematical modeling.
Who we’re looking for
- Successful developers have traditionally been the top students in their programs and have extensive software development experience.
- We welcome outstanding candidates at all experience levels.
- The expected annual base salary for this position is USD 200,000. Our compensation and benefits package includes substantial variable compensation in the form of a year-end bonus, guaranteed in the first year of hire, a sign-on bonus, a relocation bonus, and benefits including medical and prescription drug coverage, 401(k) contribution matching, wellness reimbursement, family building benefits, and a charitable gift match program.
San Francisco, California & Seattle, Washington
Are you a charismatic data engineering Developer Advocate at heart, someone who excels in the spotlight, loves presenting and hosting events, and possesses a deep knowledge of the Databricks Intelligence Platform?
As a Databricks Developer Advocate, you’ll be a crucial link between our engineering teams and the broad community of data professionals building their careers on Databricks technologies. This role will leverage your strong expertise in Databricks to educate, inspire, and support our growing user base.
Your responsibilities will encompass a wide range of knowledge-sharing activities. You’ll deliver engaging talks, host insightful panels and meetups, create informative blogs and video content, develop comprehensive courseware, provide expert answers in community forums, and conduct one-on-one sessions with influential data engineers and scientists. These efforts will help disseminate best practices and foster a thriving Databricks community.
Community engagement will be at the heart of your role. You’ll work closely with the Databricks community and the product and engineering teams, ensuring that user needs and product development align seamlessly. Reporting directly to the Head of Developer Relations, you’ll collaborate with fellow developer advocates and program managers to create and execute a cohesive and impactful developer relations strategy.
The ideal candidate for this position will embody the values of our Developer Relations team: a deep passion for data and AI, genuine empathy and technical understanding of developers’ needs, and a strong commitment to effectively explaining our products.
You’ll use your diverse Databricks skill set to educate the community about Databricks technologies and continuously gather and utilize community feedback to improve the developer experience.
The impact you will have:
- Create compelling content to inspire data practitioners, helping them discover new features and run projects at scale.
- Drive awareness and adoption of Databricks technologies through speaking engagements at industry events.
-Create high-quality educational content like videos, sample notebooks, datasets, tutorials, courseware, and blog posts.
- Expand and nurture the Databricks user community by organizing and growing meetups and user groups and providing support to data scientists, engineers, and analysts in online communities.
- Collaborate with product and engineering teams to share community learnings and influence product direction.
- Create and manage developer-focused programs to foster engagement and loyalty, creating a positive developer experience for data practitioners.
-Gather and analyze feedback from the community to drive continuous improvement of Databricks products and services.
What we look for:
- 5+ years experience as DevRel and as a software developer, solutions architect, or related profession
- Subject matter knowledge of solving data engineering problems at all scales.
- Active participation and recognized leadership in Databricks community forums, chats, and meetups
- Comprehensive understanding of the Databricks products and ecosystem
- Proven track record in nurturing developer communities, organizing user groups, and facilitating global meetups
- Exceptional communication skills, with a talent for articulating complex concepts through writing, teaching, videos, and public speaking
- Deep empathy for developer needs, with the ability to craft engaging experiences across tutorials, notebooks, and community platforms
- Adept at collaborating with cross-functional stakeholders to align community initiatives with product objectives
- Demonstrated ability to catalyze growth in both digital and in-person developer communities
The Department of Computer Science wishes to appoint up to five academics in Artificial Intelligence and Machine Learning to strengthen our rapidly growing AI & Machine Learning Research Group.
We are a highly collaborative team, working not only with other researchers in our department but across the university and beyond. The Department of Computer Science hosts the UKRI Centre for Doctoral Training in Accountable, Responsible and Transparent Artificial Intelligence (ART-AI), which brings together AI challenges across science, engineering, humanities and social sciences. Also hosted by the Department of Computer Science is the Centre for the Analysis of Motion, Entertainment Research and Applications (CAMERA), offering a range of exciting opportunities for AI research around virtual production, motion capture and volumetric capture. In addition to university and external HPC resources, the department has its own cloud service with over 80 GPUs.
As a department, we will offer you support and growth opportunities, including career mentoring, and opportunities to progress, manage and lead. We will help you to develop collaborations with regional, national and international application sector partners, for example in entertainment, sport, defence and healthcare. We will support the deepening of existing academic collaborations and the development of new ones. You will offer us a strong academic record and the ability and enthusiasm to create an engaging experience for our excellent students.
The University of Bath is based on an attractive, single-site campus that facilitates interdisciplinary research. It is located on the edge of the World Heritage City of Bath and offers the lifestyle advantages of working and living in one of the most beautiful areas in the UK.
For more information, please view the full advert on our website.
New York
Quantitative Strategies / Technology
Overview
At the D. E. Shaw group, technology is integral to virtually everything we do. We’re seeking exceptional software developers with expertise in generative AI (GAI) to join our team. As a lead software developer in GAI, you’ll lead innovative projects and teams, leveraging your extensive experience and leadership skills to advance our GAI initiatives. By making GAI more accessible for both technical and non-technical users across the firm, you’ll drive substantial business impact.
What you’ll do day-to-day
You’ll join a dynamic environment, leading efforts in advancing GAI capabilities. Depending on your skills and interests, potential areas of focus may include:
- Leading the development and maintenance of shared GAI infrastructure and applications, ensuring data is prepared and integrated for effective use in GAI initiatives, and enhancing software development team productivity through GAI.
- Building sophisticated retrieval-augmented generation (RAG) pipelines over large document sets to improve data utility and accessibility across the firm.
- Managing collaboration with internal groups and end users, accelerating AI product development and deployment, and customizing solutions to their needs.
- Leading experimentation with new AI-driven tools and applications, integrating them into various platforms, and fostering collaboration to enhance AI effectiveness.
- Driving greenfield projects, which offer significant opportunities for ownership and growth in a rapidly expanding GAI landscape.
Who we’re looking for
- We’re looking for candidates who have a strong background in software development and a solid understanding of GAI technologies.
- Successful developers have traditionally been top performers in their academic programs and possess a strong foundation in AI-related projects.
- We’re particularly interested in outstanding candidates who have 6+ years of overall experience; who are eager to thrive in an inclusive, collaborative, and fast-paced environment; and who have a proven track record of leading projects and successfully leading or managing teams.
- The expected annual base salary for this position is USD 275,000 USD to USD 350,000. Our compensation and benefits package includes substantial variable compensation in the form of a year-end bonus, guaranteed in the first year of hire, and benefits including medical and prescription drug coverage, 401(k) contribution matching, wellness reimbursement, family building benefits, and a charitable gift match program.
New York
Quantitative Strategies
Overview
Machine learning developers at the D. E. Shaw group work closely with researchers to creatively apply their knowledge of machine learning and software engineering to design, build, and maintain systems for high-performance, large-scale knowledge discovery in financial data. Machine learning developers have the opportunity to be part of an inclusive, collaborative, and engaging working environment.
What you’ll do day-to-day
Specific responsibilities include designing, implementing, testing, and documenting modules for all stages of the pipeline from data to predictions, assembling these modules into end-to-end systems, and interacting with researchers to achieve highly productive experimentation, model construction, and validation.
Who we’re looking for
- Successful candidates will have a strong knowledge of software engineering, machine learning, and open-source machine learning ecosystems. A track record of building and applying high-performance machine learning systems is desired. While an impressive record of academic achievement is a plus, we welcome outstanding candidates from diverse academic disciplines and backgrounds.
- The expected annual base salary for this position is USD 250,000 to USD 350,000. Our compensation and benefits package includes substantial variable compensation in the form of a year-end bonus, guaranteed in the first year of hire, a sign-on bonus, and benefits including medical and prescription drug coverage, 401(k) contribution matching, wellness reimbursement, family building benefits, and a charitable gift match program.
San Francisco, California
Description Founded in late 2020 by a small group of machine learning engineers and researchers, MosaicML enables companies to securely fine-tune, train and deploy custom AI models on their own data, for maximum security and control. Compatible with all major cloud providers, the MosaicML platform provides maximum flexibility for AI development. Introduced in 2023, MosaicML’s pretrained transformer models have established a new standard for open source, commercially usable LLMs and have been downloaded over 3 million times. MosaicML is committed to the belief that a company’s AI models are just as valuable as any other core IP, and that high-quality AI models should be available to all.
Now part of Databricks since July 2023, we are passionate about enabling our customers to solve the world's toughest problems — from making the next mode of transportation a reality to accelerating the development of medical breakthroughs. We do this by building and running the world's best data and AI platform so our customers can use deep data insights to improve their business. We leap at every opportunity to solve technical challenges, striving to empower our customers with the best data and AI capabilities.
You will: - Design and productionize state of the art tooling and open source technologies to enable the development of frontier foundation models for Databricks customers - Solve complex problems at scale around data preprocessing, model training, hyperparameter tuning and model evaluation Implement advanced optimization techniques to reduce the resource footprint of models while preserving their performance and balancing usability for our developers and customers - Collaborate with product managers and cross-functional teams to drive technology-first initiatives that enable novel business strategies and product roadmaps - Facilitate our user community through documentation, talks, tutorials, and collaborations - Contribute to the broader AI community by publishing research, presenting at conferences, and actively participating in open-source projects, enhancing Databricks' reputation as an industry leader.
Below are some example projects: - Composer: Large-scale distributed deep learning training library - Streaming: Library for efficient data loading from cloud object storage -LLM Foundry: Framework for developing and evaluating Large Language Models
We look for: - Hands on experience with the internals of deep learning frameworks (e.g. PyTorch, TensorFlow) and GenAI models (e.g. GPT, StableDiffusion) - Experience with large scale, distributed training on GPUs (e.g., Nvidia, AMD) and alternative deep learning accelerators - Strong sense of design and usability - Effective communication skills and the ability to articulate complex technical ideas to cross-disciplinary internal and external stakeholders - Prior history of contributing to or developing open source projects is a bonus but not a requirement
We value candidates who are curious about all parts of the company's success and are willing to learn new technologies along the way.
Bath
Lecturer / Senior Lecturer
Department: Computer Science
Salary: Salary for a Lecturer (Grade 8) is £46,735 rising to £55,755 per annum. Salary for a Senior Lecturer (Grade 9) is £57,422 rising to £66,537 per annum
The Department of Computer Science wishes to appoint up to seven academics in Artificial Intelligence and Machine Learning.
About the role
You will work with colleagues, students and researchers to develop and publish papers. You will apply for research funding to support your ideas. You will find ways of making your research available to society.
You will design and deliver teaching materials for lectures, tutorials, and labs.
You will have a few internal roles to help the Department run smoothly.
The support and growth opportunities we provide
Training for an HEA fellowship qualification
New Lecturers will be enrolled into the Pathway to HEA Fellowship. Senior Lecturers will have the option to do so.
Mentoring
All of our staff are allocated a mentor when they join. Your mentor will support you in your day-to-day job and help you progress.
About you
Our ideal candidate for both Lecturer and Senior Lecturer positions will hold a PhD or equivalent in a relevant discipline, along with a UG degree or equivalent experience.
- You should demonstrate substantial research experience in your field, with an emerging track record for Lecturers and an established research profile with funding success for Senior Lecturers
- A deep conceptual understanding of their subject, alongside experience teaching at UG/PG levels, is essential
- Strong written, verbal, and interpersonal skills, along with the ability to form positive collaborations, are required
- Senior Lecturers should also exhibit academic leadership and a clear research vision. Both roles demand excellent organisational and administrative skills
- A commitment to excellence in research and teaching, student experience, and ethical professional conduct is essential
SF Bay Area or New York City
About the role We’re looking for seasoned ML Infrastructure engineers with experience designing, building and maintaining training and serving infrastructure for ML research.
Responsibilities:
Provide infrastructure support to our ML research and product
Build tooling to diagnose cluster issues and hardware failures
Monitor deployments, manage experiments, and generally support our research
Maximize GPU allocation and utilization for both serving and training
Requirements:
4+ years of experience supporting the infrastructure within an ML environment
Experience in developing tools used to diagnose ML infrastructure problems and failures
Experience with cloud platforms (e.g., Compute Engine, Kubernetes, Cloud Storage)
Experience working with GPUs
Nice to have
Experience with large GPU clusters and high-performance computing/networking
Experience with supporting large language model training
Experience with ML frameworks like Pytorch/TensorFlow/JAX
Experience with GPU kernel development
New York
Quantitative Strategies / Technology
Overview
At the D. E. Shaw group, technology is integral to virtually everything we do. We’re seeking exceptional software developers with expertise in generative AI (GAI) to join our team. As a software developer in GAI, you’ll work on innovative projects, leveraging your quantitative and programming skills to advance our GAI initiatives. By making GAI more accessible for both technical and non-technical users across the firm, you’ll drive substantial business impact.
What you’ll do day-to-day
You’ll join a dynamic environment, contributing to our efforts in advancing GAI capabilities. Potential areas of focus include:
- Developing and maintaining shared GAI infrastructure and applications, ensuring firmwide data integration and enhancing software development across the firm.
- Working on foundational building blocks, such as vector databases and LLM gateways, to support AI tools and applications.
- Leveraging state-of-the-art cloud models for scalable and high-availability solutions.
- Scaling the adoption of GAI tools, expanding AI models, and integrating them with internal knowledge sources to drive innovation.
- Collaborating with internal groups and end users to accelerate AI product development and deployment, tailoring solutions to their needs.
- Experimenting with new AI-driven tools and applications, integrating them into various platforms, and facilitating collaboration to enhance the effectiveness of AI applications.
- Working on greenfield projects, which offer opportunities to shape the future of GAI at the firm and make a significant impact.
Who we’re looking for
- We’re looking for candidates who have a strong background in software development and a solid understanding of GAI technologies.
- Successful developers have traditionally been top performers in their academic programs and possess a strong foundation in AI-related projects.
- We welcome outstanding candidates at all experience levels who are excited to work in an inclusive, collaborative, and fast-paced environment.
- The expected annual base salary for this position is USD 200,000 to USD 250,000. Our compensation and benefits package includes variable compensation in the form of a year-end bonus, guaranteed in the first year of hire, and benefits including medical and prescription drug coverage, 401(k) contribution matching, wellness reimbursement, family building benefits, and a charitable gift match program.
SF Bay Area or New York City
About the role and team Joining us as a Research Engineer on the ML Systems team, you’ll be working on cutting-edge ML training and inference systems, optimizing the performance and efficiency of our GPU clusters, and developing new technologies that fine-tune leading consumer AI models with a data flywheel, and serve 20K+ QPS in production with LLMs. Your work will directly contribute to our groundbreaking advancements in AI, helping shape an era where technology is not just a tool, but a companion in our daily lives. At Character.AI, your talent, creativity, and expertise will not just be valued—they will be the catalyst for change in an AI-driven future.
What you'll do The ML Systems team is responsible for the research and deployment of systems that efficiently utilize GPU for AI-enabled products.
As a research engineer, you will work across teams and our technical stack to improve our training performance and inference runtime. You will get to shape the conversational experience of millions of users per day.
Example projects:
Write efficient Triton kernels and tune them for our specific models and hardware
Develop prefix-aware routing algorithms to improve serving cache hit rate
Train and distill LLMs to improve latency while preserving accuracy and engagements
Build an efficient and scalable distributed RLHF stack powering the model innovations
Develop systems for efficient multimodal (image gen/video gen) model training & inference
Who you are "All Industry Levels": at least PhD (or equivalent) research experience
Write clear and clean production system code
Strong understanding of modern machine learning techniques (reinforcement learning, transformers, etc)
Track record of exceptional research or creative ML systems projects
Comfortable writing model development code (PyTorch) for either training or inference
Nice to Have Experience training large models in a distributed setting utilizing PyTorch distributed, DeepSpeed, Megatron.
Experience working with GPUs & collectives (training, serving, debugging) and writing kernels (Triton, CUDA, CUTLASS).
Experience with LLM inference systems and literature such as vLLM and FlashAttention.
Familiarity with ML deployment and orchestration (Kubernetes, Docker, cloud)
Publications in relevant academic journals or conferences in the field of machine learning and systems