Job Title: Software Engineer - DevOps
Location: New York, NY (Hybrid)
Type: Full-time
About Us: We are an innovative AI SaaS startup based in New York, focused on building industry leading AI Agents-based automation solutions to empower businesses, especially starting in financial services. Our cloud-native platform leverages the latest in AI and machine learning to deliver scalable, intelligent automation. As we rapidly scale, we're looking for a talented Cloud DevOps Engineer to help optimize and secure our infrastructure while ensuring continuous delivery of high-quality software.
Role Overview: As a Cloud DevOps Engineer, you will play a critical role in designing, implementing, and maintaining our cloud infrastructure. You will work closely with the software engineering and AI/ML teams to automate deployments, monitor performance, and ensure security and scalability. You will drive the adoption of DevOps best practices, including CI/CD pipelines, cloud orchestration, and infrastructure as code, ensuring our AI platform is reliable, efficient, and scalable.
Key Responsibilities:
Design, build, and maintain scalable, secure, and high-performance cloud infrastructure (AWS/Azure/GCP).
Automate infrastructure management using Infrastructure-as-Code (OpenTofu, Pulumi, etc.).
Develop, implement, and manage CI/CD pipelines to streamline the software development lifecycle.
Monitor, troubleshoot, and optimize system performance, availability, and security, including on-call support.
Collaborate with software engineering and AI/ML teams to align infrastructure with our AI product goals.
Ensure infrastructure meets the highest security standards and compliance requirements.
Establish and maintain logging, monitoring, and alerting systems to support rapid response to issues.
Conduct cloud cost optimization and ensure efficient use of cloud resources.
Enable rapid scaling of infrastructure in response to growing data and user demands.
Contribute to disaster recovery planning, backup strategies, and fault-tolerant designs.
Requirements:
Bachelor's or Master's degree in Computer Science.
5+ years of experience in a Cloud DevOps role, working in cloud environments like AWS, Google Cloud, or Azure.
Expertise in cloud infrastructure management tools like Terraform/OpenTofu, CloudFormation, Ansible or Pulumi.
Strong experience with CI/CD tools like Github Workflows, Jenkins, GitLab, or equivalent.
Experience with containerization (Docker, Kubernetes, ECS, etc.) and orchestration tools.
Solid understanding of networking, security best practices, and monitoring in cloud environments.
Proficiency in scripting languages like Python, Bash, or PowerShell.
Familiarity with logging/monitoring solutions like Prometheus, Grafana, Datadog, or CloudWatch.
Knowledge of cloud-native technologies, microservices architecture, and serverless computing.
Experience with cloud security management, including IAM policies, VPC configuration, and data encryption.
Excellent problem-solving skills and ability to work in a fast-paced, agile environment.
Preferred Qualifications:
Experience in AI/ML environments, supporting machine learning operations (MLOps/LLMOps) and data pipelines.
AWS/GCP/Azure certifications.
Knowledge of GitOps practices and tools (Flux, ArgoCD, etc.).
Understanding of regulatory and compliance standards (GDPR, SOC 2, ISO 27001, etc.).
What We Offer:
Competitive salary and equity options.
Flexible working environment with hybrid options.
Opportunity to work with leading-edge AI technologies in a fast-growing startup.
Collaborative, inclusive, and innovative team culture.
Professional development opportunities, including certifications and training.
Health insurance and other benefits.
How to Apply: If you're passionate about cloud infrastructure, DevOps best practices, and want to help shape the future of AI technology, we'd love to hear from you. Please submit your resume and a brief cover letter outlining your experience and interest in the role at https://jobs.ashbyhq.com/artian.