{"id":535,"date":"2025-11-05T09:26:32","date_gmt":"2025-11-05T09:26:32","guid":{"rendered":"https:\/\/skillbasedmatching.com\/jobs\/?post_type=jobpost&#038;p=535"},"modified":"2025-11-05T09:26:35","modified_gmt":"2025-11-05T09:26:35","slug":"senior-cloud-infrastructure-engineer-multi-cloud-mlops","status":"publish","type":"jobpost","link":"https:\/\/skillbasedmatching.com\/jobs\/current-jobs\/senior-cloud-infrastructure-engineer-multi-cloud-mlops\/","title":{"rendered":"Senior Cloud Infrastructure Engineer \u2013 Multi-Cloud &#038; MLOps"},"content":{"rendered":"\n<p><strong>Hatch<\/strong>, a fast-moving technology company solving real-world business problems with AI, is seeking a <strong>Senior DevOps Engineer<\/strong> (titled Cloud Infrastructure Engineer) to join its high-impact engineering team. This senior role is focused on building the resilient, secure, and scalable infrastructure that powers the company&#8217;s core platform and AI product lines.<\/p>\n\n\n\n<p>This is a <strong>Full-time, Hybrid<\/strong> position based in <strong>SOHO, New York City<\/strong>. Candidates <strong>must be based in NYC<\/strong>, and <strong>visa sponsorship is not available<\/strong>.<\/p>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\"\/>\n\n\n\n<h3 class=\"wp-block-heading\">Core Mandate: Infrastructure, MLOps, and Reliability<\/h3>\n\n\n\n<p>You will own the infrastructure that enables the company&#8217;s velocity, focusing on the specialized compute and data needs of machine learning workflows.<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>Infrastructure at Scale:<\/strong>\n<ul class=\"wp-block-list\">\n<li>Evolve the <strong>multi-cloud infrastructure (AWS &amp; GCP)<\/strong> using <strong>Infrastructure-as-Code (Terraform or Ansible)<\/strong>.<\/li>\n\n\n\n<li>Manage scalable, secure, and cost-efficient environments across all stages (dev, staging, production).<\/li>\n\n\n\n<li>Implement systems that support the <strong>compute-heavy and storage-intensive needs<\/strong> of machine learning and data processing pipelines.<\/li>\n\n\n\n<li>Participate in an <strong>on-call rotation<\/strong>.<\/li>\n<\/ul>\n<\/li>\n\n\n\n<li><strong>ML Platform Support (MLOps):<\/strong>\n<ul class=\"wp-block-list\">\n<li>Collaborate with ML engineers to <strong>productionize models<\/strong> and manage workflows across training, testing, and deployment.<\/li>\n\n\n\n<li>Implement infrastructure to support <strong>versioning, orchestration, and monitoring of ML models<\/strong> (using tools like Kubeflow, SageMaker, or VertexAI).<\/li>\n\n\n\n<li>Optimize data pipelines and model serving for low-latency and high-throughput performance.<\/li>\n<\/ul>\n<\/li>\n\n\n\n<li><strong>Reliability &amp; Observability:<\/strong>\n<ul class=\"wp-block-list\">\n<li>Drive the strategy for <strong>observability, logging, and alerting<\/strong> across distributed systems.<\/li>\n\n\n\n<li>Lead incident response, <strong>root cause analysis (RCA)<\/strong>, and system hardening.<\/li>\n\n\n\n<li>Implement best practices for infrastructure security and container hardening.<\/li>\n<\/ul>\n<\/li>\n<\/ul>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\"\/>\n\n\n\n<h3 class=\"wp-block-heading\">Required Experience and Technical Stack<\/h3>\n\n\n\n<p>The role requires a senior engineer with deep AWS experience, IaC expertise, and specialized knowledge of the MLOps lifecycle.<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>Experience:<\/strong> <strong>3+ years<\/strong> of experience in DevOps, SRE, or platform engineering in high-growth environments.<\/li>\n\n\n\n<li><strong>Cloud Expertise:<\/strong> <strong>3+ years of experience with AWS infrastructure and services<\/strong>, including networking, IAM, ECS\/EKS, and serverless computing.<\/li>\n\n\n\n<li><strong>MLOps Experience:<\/strong> <strong>Experience supporting machine learning teams or MLOps platforms<\/strong> (e.g., model training pipelines, feature stores, online inference).<\/li>\n\n\n\n<li><strong>IaC &amp; CI\/CD:<\/strong> Strong experience with <strong>Terraform or Ansible<\/strong> and CI\/CD tooling (GitHub Actions, ArgoCD, etc.).<\/li>\n\n\n\n<li><strong>Containerization:<\/strong> Strong knowledge of container orchestration (<strong>Kubernetes preferred<\/strong>).<\/li>\n\n\n\n<li><strong>Observability:<\/strong> Strong knowledge of observability stacks (Prometheus, Grafana, Sentry, DataDog, etc.).<\/li>\n\n\n\n<li><strong>Programming:<\/strong> Familiarity with at least one programming language (<strong>Python, Go, Erlang, Rust, etc.<\/strong>).<\/li>\n\n\n\n<li><strong>Preferred:<\/strong> Exposure to <strong>agentic programming workflows<\/strong> and RHCE\/RHCSA or equivalent certifications.<\/li>\n<\/ul>\n\n\n\n<h3 class=\"wp-block-heading\"><\/h3>\n","protected":false},"excerpt":{"rendered":"<p>Hatch, a fast-moving technology company solving real-world business problems with AI, is seeking a Senior DevOps Engineer (titled Cloud Infrastructure Engineer) to join its high-impact engineering team. This senior role is focused on building the resilient, secure, and scalable infrastructure that powers the company&#8217;s core platform and AI product lines. This is a Full-time, Hybrid [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"menu_order":0,"template":"","jobpost_category":[1294,46],"jobpost_job_type":[40],"jobpost_location":[469],"jobpost_tag":[2569,1769,2570,1232,2571,1334,2573,1002,2572,2568,1004],"class_list":["post-535","jobpost","type-jobpost","status-publish","hentry","jobpost_category-cloud-engineering","jobpost_category-data","jobpost_job_type-hybrid","jobpost_location-new-york-ny","jobpost_tag-aws-gcp-multi-cloud","jobpost_tag-cloud-infrastructure-engineer","jobpost_tag-hybrid-nyc","jobpost_tag-kubernetes","jobpost_tag-ml-platform-support","jobpost_tag-mlops","jobpost_tag-no-sponsorship","jobpost_tag-observability","jobpost_tag-python-go","jobpost_tag-senior-devops","jobpost_tag-terraform"],"_links":{"self":[{"href":"https:\/\/skillbasedmatching.com\/jobs\/wp-json\/wp\/v2\/jobpost\/535","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/skillbasedmatching.com\/jobs\/wp-json\/wp\/v2\/jobpost"}],"about":[{"href":"https:\/\/skillbasedmatching.com\/jobs\/wp-json\/wp\/v2\/types\/jobpost"}],"author":[{"embeddable":true,"href":"https:\/\/skillbasedmatching.com\/jobs\/wp-json\/wp\/v2\/users\/1"}],"wp:attachment":[{"href":"https:\/\/skillbasedmatching.com\/jobs\/wp-json\/wp\/v2\/media?parent=535"}],"wp:term":[{"taxonomy":"jobpost_category","embeddable":true,"href":"https:\/\/skillbasedmatching.com\/jobs\/wp-json\/wp\/v2\/jobpost_category?post=535"},{"taxonomy":"jobpost_job_type","embeddable":true,"href":"https:\/\/skillbasedmatching.com\/jobs\/wp-json\/wp\/v2\/jobpost_job_type?post=535"},{"taxonomy":"jobpost_location","embeddable":true,"href":"https:\/\/skillbasedmatching.com\/jobs\/wp-json\/wp\/v2\/jobpost_location?post=535"},{"taxonomy":"jobpost_tag","embeddable":true,"href":"https:\/\/skillbasedmatching.com\/jobs\/wp-json\/wp\/v2\/jobpost_tag?post=535"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}