Datadog logo

Senior Staff Machine Learning Engineer - BitsAI

Datadog
Full-time
On-site
New York City, New York, United States
Machine Learning

Datadog is building AI-assisted agents to help our users solve operational and security-related problems in an automated way. This ambitious project consumes data about our customers’ systems, discovers potential problems, proposes solutions, and offers those solutions to users through both manual and automated steps. The goal of the team is to reduce the time users spend on DevSecOps tasks, freeing them up to accomplish more.

We’re looking for a Senior Staff Machine Learning Engineer to lead the development of all AI agents at Datadog. We will leverage existing models, fine-tune (or eventually train new systems) where needed, and build a team of end-to-end agentic systems that perform tasks for users on Datadog, just like a human might. Think of this as working to assist and eventually replace the need for a wide range of jobs done on Datadog every day, including Operational Incident Management and Investigation, Security Analyst Triaging, and more. You will work across our science and engineering teams to build applied AI systems at production scale, experiment with and iterate on new ideas in the space, and help Datadog develop the AI agents of the future.

At Datadog, we place value in our office culture - the relationships and collaboration it builds and the creativity it brings to the table. We operate as a hybrid workplace to ensure our Datadogs can create a work-life harmony that best fits them. 

What You’ll Do:

  • Lead technical effort to develop the architecture, patterns, and individual solutions for new LLM driven AI Agent systems

  • Help drive the AI/ML strategy across the company for AI Agent driven systems

  • Enhance the AI/ML competence across an organization of engineers, engineering managers, and product managers.

  • Provide mentorship on the usage of ML

  • Partner and provide technical leadership to other Staff Engineers and Product partners, policy owners and operational teams, to identify and demonstrate AI/ML opportunities. Encourage critical thinking within teams to develop this understanding themselves

  • Collaborate closely with data scientists and engineers to develop the next generation of Datadog’s AI Agent systems 

  • Represent and evangelize AI/ML across Datadog

  • Regularly speak with users to discover their needs and feedback

  • Navigate tradeoffs between thoroughness and time-to-market, understanding how to provide the right level of value to customers at the right time

  • Work with other Datadog teams to integrate their domain expertise into solution pathways

  • Mentor and grow other engineers on the team and beyond

Who You Are:

  • You’re passionate about Generative AI/LLMs and its ability to help software engineers and security engineers be more productive and keep systems more secure

  • You have practical knowledge of the modern machine learning lifecycle

  • You ship fast, route around blockers, and are comfortable working with ambiguity

  • You love talking with users

  • You’re happy to jump into any part of the stack and do whatever’s needed to move a project forward

  • You may have been a founder or early builder at a startup– this is a similar kind of effort

  • You have a BS/MS/PhD in a Computer Science, Engineering or related scientific field or equivalent experience