MLSys 2025 Career Opportunities
Here we highlight career opportunities submitted by our Exhibitors, and other top industry, academic, and non-profit leaders. We would like to thank each of our exhibitors for supporting MLSys 2025.
Search Opportunities
San Francisco, California
Description Founded in late 2020 by a small group of machine learning engineers and researchers, MosaicML enables companies to securely fine-tune, train and deploy custom AI models on their own data, for maximum security and control. Compatible with all major cloud providers, the MosaicML platform provides maximum flexibility for AI development. Introduced in 2023, MosaicML’s pretrained transformer models have established a new standard for open source, commercially usable LLMs and have been downloaded over 3 million times. MosaicML is committed to the belief that a company’s AI models are just as valuable as any other core IP, and that high-quality AI models should be available to all.
Now part of Databricks since July 2023, we are passionate about enabling our customers to solve the world's toughest problems — from making the next mode of transportation a reality to accelerating the development of medical breakthroughs. We do this by building and running the world's best data and AI platform so our customers can use deep data insights to improve their business. We leap at every opportunity to solve technical challenges, striving to empower our customers with the best data and AI capabilities.
You will: - Design and productionize state of the art tooling and open source technologies to enable the development of frontier foundation models for Databricks customers - Solve complex problems at scale around data preprocessing, model training, hyperparameter tuning and model evaluation Implement advanced optimization techniques to reduce the resource footprint of models while preserving their performance and balancing usability for our developers and customers - Collaborate with product managers and cross-functional teams to drive technology-first initiatives that enable novel business strategies and product roadmaps - Facilitate our user community through documentation, talks, tutorials, and collaborations - Contribute to the broader AI community by publishing research, presenting at conferences, and actively participating in open-source projects, enhancing Databricks' reputation as an industry leader.
Below are some example projects: - Composer: Large-scale distributed deep learning training library - Streaming: Library for efficient data loading from cloud object storage -LLM Foundry: Framework for developing and evaluating Large Language Models
We look for: - Hands on experience with the internals of deep learning frameworks (e.g. PyTorch, TensorFlow) and GenAI models (e.g. GPT, StableDiffusion) - Experience with large scale, distributed training on GPUs (e.g., Nvidia, AMD) and alternative deep learning accelerators - Strong sense of design and usability - Effective communication skills and the ability to articulate complex technical ideas to cross-disciplinary internal and external stakeholders - Prior history of contributing to or developing open source projects is a bonus but not a requirement
We value candidates who are curious about all parts of the company's success and are willing to learn new technologies along the way.
New York
Quantitative Strategies / Technology
Overview
At the D. E. Shaw group, technology is integral to virtually everything we do. We’re seeking exceptional software developers with expertise in generative AI (GAI) to join our team. As a software developer in GAI, you’ll work on innovative projects, leveraging your quantitative and programming skills to advance our GAI initiatives. By making GAI more accessible for both technical and non-technical users across the firm, you’ll drive substantial business impact.
What you’ll do day-to-day
You’ll join a dynamic environment, contributing to our efforts in advancing GAI capabilities. Potential areas of focus include:
- Developing and maintaining shared GAI infrastructure and applications, ensuring firmwide data integration and enhancing software development across the firm.
- Working on foundational building blocks, such as vector databases and LLM gateways, to support AI tools and applications.
- Leveraging state-of-the-art cloud models for scalable and high-availability solutions.
- Scaling the adoption of GAI tools, expanding AI models, and integrating them with internal knowledge sources to drive innovation.
- Collaborating with internal groups and end users to accelerate AI product development and deployment, tailoring solutions to their needs.
- Experimenting with new AI-driven tools and applications, integrating them into various platforms, and facilitating collaboration to enhance the effectiveness of AI applications.
- Working on greenfield projects, which offer opportunities to shape the future of GAI at the firm and make a significant impact.
Who we’re looking for
- We’re looking for candidates who have a strong background in software development and a solid understanding of GAI technologies.
- Successful developers have traditionally been top performers in their academic programs and possess a strong foundation in AI-related projects.
- We welcome outstanding candidates at all experience levels who are excited to work in an inclusive, collaborative, and fast-paced environment.
- The expected annual base salary for this position is USD 200,000 to USD 250,000. Our compensation and benefits package includes variable compensation in the form of a year-end bonus, guaranteed in the first year of hire, and benefits including medical and prescription drug coverage, 401(k) contribution matching, wellness reimbursement, family building benefits, and a charitable gift match program.