DevOps Engineer – AI Marketing Platform

Remote
United States
Posted 1 week ago

Jasper, the leading AI marketing platform (recognized as one of Fast Company’s Top 15 Most Innovative AI Companies of 2024), is seeking an experienced DevOps Engineer to join its Platform team. This is a highly autonomous, high-impact role focused on infrastructure engineering, reliability, and delivery pipeline optimization for AI-powered products at scale.

This is a Full-time, Fully Remote role open to candidates located anywhere in the continental US. The expected base salary range is $170,000 – $200,000 annually, plus an equity grant.


Role Summary and AI/Kubernetes Mandate

Reporting to the Staff DevOps Engineer, you will drive developer velocity and system reliability. The core responsibilities focus on scaling cloud-native infrastructure, implementing advanced CI/CD techniques, and supporting specialized AI/ML compute requirements.

What You’ll Do:

  • Cloud-Native Infrastructure: Design, implement, and operate cloud-native infrastructure (GCP focus) that scales efficiently, fails gracefully, and optimizes for performance and cost.
  • Infrastructure-as-Code (IaC): Develop IaC solutions using Terraform and Helm to create self-healing, automated, and observable systems.
  • Delivery Pipelines: Build and refine software delivery pipelines to enable safe, fast, and frequent deployments with robust testing, rollback, and progressive release mechanisms.
  • AI/ML Support: Collaborate with ML and product teams to support AI model training and inference through scalable compute and storage infrastructure, including GPU-based compute.
  • Reliability Engineering: Identify and eliminate single points of failure, performance bottlenecks, and scalability limits through proactive monitoring and reliability practices.
  • Security: Implement and enforce security best practices, including secrets management, access control, and compliance across all infrastructure layers.

Required Experience and Technical Qualifications

The ideal candidate possesses expert-level skills in running production Kubernetes clusters, utilizing Terraform for IaC, and implementing robust observability with tools like Datadog.

  • Core Expertise:
    • Deep experience running Kubernetes in production (cluster management, networking, storage, security).
    • Expertise with Terraform, Helm, and configuration management to build reproducible, version-controlled infrastructure.
    • Proven success designing and maintaining CI/CD pipelines (GitHub Actions, Argo CD, Jenkins, etc.) balancing speed and safety.
    • Practical knowledge of Google Cloud Platform (GCP) and cloud-native architectures.
  • Automation & Observability:
    • Strong background in observability (especially Datadog)—skilled at instrumentation, dashboard creation, and intelligent alerting.
    • Solid scripting skills in Python, Go, or Bash, with a focus on automation and operational efficiency.
  • AI/Security Experience:
    • Experience supporting AI/ML workloads, including GPU-based compute and multi-language environments (TypeScript, Python, Go).
    • Familiarity with container security, secrets management, and policy enforcement.
  • (Bonus): History of open source contributions in infrastructure, CI/CD, or observability projects.

Job Features

Job CategoryAI (Artificial Intelligence), DevOps

Apply For This Job

A valid phone number is required.