{"id":457,"date":"2025-10-27T20:02:55","date_gmt":"2025-10-27T20:02:55","guid":{"rendered":"https:\/\/skillbasedmatching.com\/jobs\/?post_type=jobpost&#038;p=457"},"modified":"2025-10-27T20:02:58","modified_gmt":"2025-10-27T20:02:58","slug":"site-reliability-engineer-sre-health-tech-operations","status":"publish","type":"jobpost","link":"https:\/\/skillbasedmatching.com\/jobs\/current-jobs\/site-reliability-engineer-sre-health-tech-operations\/","title":{"rendered":"Site Reliability Engineer (SRE) \u2013 Health Tech Operations"},"content":{"rendered":"\n<p><strong>SafeRide Health<\/strong> is seeking a <strong>Site Reliability Engineer (SRE)<\/strong> to join their IT Infrastructure team. This critical role is responsible for ensuring that user-facing services and production systems remain <strong>highly available, reliable, and scalable<\/strong> by developing and implementing new processes that support software delivery excellence and operational discipline.<\/p>\n\n\n\n<p>This is a <strong>Full-time, Remote<\/strong> position in the United States.<\/p>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\"\/>\n\n\n\n<h3 class=\"wp-block-heading\">Core Responsibilities and Operational Discipline Mandate<\/h3>\n\n\n\n<p>The SRE will focus on minimizing downtime, automating tasks, and proactively managing system health and capacity. A key component involves defining and monitoring <strong>Service Level Objectives (SLOs)<\/strong> and collaborating closely with development teams.<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>Reliability &amp; Availability:<\/strong> Focus on availability, reliability, and scalability to keep systems and services running smoothly with minimal downtime.<\/li>\n\n\n\n<li><strong>Incident Management:<\/strong> Define and monitor <strong>SLOs<\/strong>, respond to and diagnose system incidents, and conduct <strong>post-mortems<\/strong> to prevent future occurrences.<\/li>\n\n\n\n<li><strong>Automation:<\/strong> Develop and maintain tools and scripts to <strong>automate repetitive tasks<\/strong> such as deployments, configuration management, and monitoring.<\/li>\n\n\n\n<li><strong>Monitoring &amp; Alerting:<\/strong> Implement and manage monitoring and alerting systems (<strong>Prometheus, DataDog, New Relic, Grafana, Splunk<\/strong>) to provide visibility and quickly detect potential issues.<\/li>\n\n\n\n<li><strong>Capacity &amp; Risk Mitigation:<\/strong> Perform <strong>capacity planning<\/strong> by monitoring resource usage to forecast future needs, and collaborate with stakeholders to identify and mitigate operational risks.<\/li>\n\n\n\n<li><strong>Optimization:<\/strong> Analyze metrics from operating systems and applications to identify areas for performance improvement.<\/li>\n<\/ul>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\"\/>\n\n\n\n<h3 class=\"wp-block-heading\">Required Experience and Technical Qualifications<\/h3>\n\n\n\n<p>The ideal candidate has progressive experience in technology operations, with hands-on proficiency in production monitoring, containerized cloud infrastructure, and automation scripting.<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>Minimum Experience (Mandatory):<\/strong>\n<ul class=\"wp-block-list\">\n<li>Minimum of <strong>5 years<\/strong> progressive experience in an IT, Software Engineering, Technology Operations, or Business Continuity role.<\/li>\n\n\n\n<li>Minimum of <strong>2 years of hands-on experience<\/strong> in a <strong>Site Reliability, DevOps, or IT Observability role<\/strong>.<\/li>\n<\/ul>\n<\/li>\n\n\n\n<li><strong>Technical Proficiency:<\/strong>\n<ul class=\"wp-block-list\">\n<li><strong>Basic proficiency in an AWS containerized environment<\/strong> running infrastructure as code.<\/li>\n\n\n\n<li>Demonstrated proficiency with <strong>production monitoring and alerting tools (DataDog is a major plus!)<\/strong>.<\/li>\n<\/ul>\n<\/li>\n\n\n\n<li><strong>Key Skills:<\/strong>\n<ul class=\"wp-block-list\">\n<li><strong>Cloud Technologies:<\/strong> Expertise in major cloud platforms such as <strong>AWS and Azure<\/strong>.<\/li>\n\n\n\n<li><strong>Tools &amp; Technologies:<\/strong> Experience with tools for <strong>Infrastructure as Code (Terraform)<\/strong> and <strong>containerization (Docker)<\/strong>.<\/li>\n\n\n\n<li><strong>Systems Engineering:<\/strong> Deep knowledge of operating systems, networking, storage, and distributed systems.<\/li>\n\n\n\n<li><strong>Programming &amp; Scripting:<\/strong> Proficiency in coding languages like <strong>Python, Ruby, and JavaScript<\/strong> for automation and infrastructure management.<\/li>\n<\/ul>\n<\/li>\n<\/ul>\n","protected":false},"excerpt":{"rendered":"<p>SafeRide Health is seeking a Site Reliability Engineer (SRE) to join their IT Infrastructure team. This critical role is responsible for ensuring that user-facing services and production systems remain highly available, reliable, and scalable by developing and implementing new processes that support software delivery excellence and operational discipline. This is a Full-time, Remote position in [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"menu_order":0,"template":"","jobpost_category":[648],"jobpost_job_type":[39],"jobpost_location":[1041],"jobpost_tag":[397,188,126,2059,997,1019,1264,1197,1949,2058,1625,2057,1259,24,2014,1018,1623,1020,1004],"class_list":["post-457","jobpost","type-jobpost","status-publish","hentry","jobpost_category-information-technology","jobpost_job_type-remote","jobpost_location-united-states","jobpost_tag-automation","jobpost_tag-aws","jobpost_tag-azure","jobpost_tag-capacity-planning","jobpost_tag-datadog","jobpost_tag-devops","jobpost_tag-distributed-systems","jobpost_tag-docker","jobpost_tag-grafana","jobpost_tag-health-tech","jobpost_tag-incident-management","jobpost_tag-it-observability","jobpost_tag-prometheus","jobpost_tag-python","jobpost_tag-remote-us","jobpost_tag-site-reliability-engineer","jobpost_tag-slos","jobpost_tag-sre","jobpost_tag-terraform"],"_links":{"self":[{"href":"https:\/\/skillbasedmatching.com\/jobs\/wp-json\/wp\/v2\/jobpost\/457","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/skillbasedmatching.com\/jobs\/wp-json\/wp\/v2\/jobpost"}],"about":[{"href":"https:\/\/skillbasedmatching.com\/jobs\/wp-json\/wp\/v2\/types\/jobpost"}],"author":[{"embeddable":true,"href":"https:\/\/skillbasedmatching.com\/jobs\/wp-json\/wp\/v2\/users\/1"}],"wp:attachment":[{"href":"https:\/\/skillbasedmatching.com\/jobs\/wp-json\/wp\/v2\/media?parent=457"}],"wp:term":[{"taxonomy":"jobpost_category","embeddable":true,"href":"https:\/\/skillbasedmatching.com\/jobs\/wp-json\/wp\/v2\/jobpost_category?post=457"},{"taxonomy":"jobpost_job_type","embeddable":true,"href":"https:\/\/skillbasedmatching.com\/jobs\/wp-json\/wp\/v2\/jobpost_job_type?post=457"},{"taxonomy":"jobpost_location","embeddable":true,"href":"https:\/\/skillbasedmatching.com\/jobs\/wp-json\/wp\/v2\/jobpost_location?post=457"},{"taxonomy":"jobpost_tag","embeddable":true,"href":"https:\/\/skillbasedmatching.com\/jobs\/wp-json\/wp\/v2\/jobpost_tag?post=457"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}