{"id":295,"date":"2025-09-29T15:47:03","date_gmt":"2025-09-29T15:47:03","guid":{"rendered":"https:\/\/skillbasedmatching.com\/jobs\/?post_type=jobpost&#038;p=295"},"modified":"2025-09-29T15:47:06","modified_gmt":"2025-09-29T15:47:06","slug":"site-reliability-engineer-remote-us","status":"publish","type":"jobpost","link":"https:\/\/skillbasedmatching.com\/jobs\/current-jobs\/site-reliability-engineer-remote-us\/","title":{"rendered":"Site Reliability Engineer (Remote, US)"},"content":{"rendered":"\n<p>Red Hat is seeking a <strong>Site Reliability Engineer (SRE)<\/strong> to join the team responsible for developing, scaling, and operating its <strong>OpenShift<\/strong> managed cloud services. This is a crucial role focused on running Red Hat&#8217;s enterprise Kubernetes distribution at scale, demanding expertise in coding, operations, and large-scale distributed system design. The position is <strong>fully remote<\/strong> within the US, with specific locations noted as Remote US CO, WA, and CA.<\/p>\n\n\n\n<p>The salary range for this role is highly competitive, spanning <strong>$94,550.00 to $191,840.00<\/strong> annually, with the final offer based on qualifications, experience, and location.<\/p>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\"\/>\n\n\n\n<h3 class=\"wp-block-heading\">Key Responsibilities and Contributions<\/h3>\n\n\n\n<p>As an SRE, you will be a core contributor to the service&#8217;s reliability and scalability, working within a small, agile, global team that practices continuous improvement and blameless postmortems.<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>Code and Development:<\/strong> Contribute code to increase the <strong>scalability and reliability<\/strong> of the service. You will also contribute software tests and participate in peer reviews to ensure code quality.<\/li>\n\n\n\n<li><strong>Automation and Efficiency:<\/strong> Focus on <strong>eliminating work through automation<\/strong> and making the monitoring system more sustainable.<\/li>\n\n\n\n<li><strong>Operational Excellence:<\/strong> Participate in a regular <strong>on-call schedule<\/strong> (including occasional paid weekends and holidays) and practice sustainable <strong>incident response and blameless postmortems<\/strong>.<\/li>\n\n\n\n<li><strong>Support and Mentoring:<\/strong> Resolve customer issues escalated from the Global Support team and help develop peers&#8217; capabilities through knowledge sharing, mentoring, and collaboration.<\/li>\n\n\n\n<li><strong>Agile Work:<\/strong> Work within a small agile team to develop and improve SRE software, plan, and self-improve.<\/li>\n<\/ul>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\"\/>\n\n\n\n<h3 class=\"wp-block-heading\">Required Experience and Technical Skills<\/h3>\n\n\n\n<p>The ideal candidate will have a strong blend of software engineering and cloud operations expertise.<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>Education &amp; Experience:<\/strong> BS in Computer Science or a related technical field, or equivalent experience.<\/li>\n\n\n\n<li><strong>Software Engineering:<\/strong> <strong>3+ years<\/strong> of software engineering experience with at least one object-oriented language (<strong>Python, Golang, Java, C, C++<\/strong>). <strong>Golang is preferred<\/strong>.<\/li>\n\n\n\n<li><strong>Cloud Operations:<\/strong> <strong>3+ years<\/strong> of experience managing <strong>Linux-based systems<\/strong> in a public cloud (<strong>AWS, GCP, or Azure<\/strong>).<\/li>\n\n\n\n<li><strong>Monitoring:<\/strong> <strong>3+ years<\/strong> of experience with enterprise systems monitoring; knowledge of <strong>Prometheus is preferred<\/strong>.<\/li>\n\n\n\n<li><strong>Cloud &amp; Containers:<\/strong>\n<ul class=\"wp-block-list\">\n<li><strong>1+ year<\/strong> experience delivering hosted cloud services.<\/li>\n\n\n\n<li><strong>1+ year<\/strong> experience with <strong>Kubernetes<\/strong>.<\/li>\n\n\n\n<li><strong>1+ year<\/strong> experience with <strong>containers on Linux<\/strong>.<\/li>\n<\/ul>\n<\/li>\n\n\n\n<li><strong>Technical Fundamentals:<\/strong> Solid understanding of standard <strong>TCP\/IP networking<\/strong> and common protocols like <strong>DNS and HTTP<\/strong>.<\/li>\n\n\n\n<li><strong>Soft Skills:<\/strong> Excellent communication skills in a global team environment and a demonstrated ability to quickly and accurately <strong>troubleshoot systems issues<\/strong>.<\/li>\n<\/ul>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\"\/>\n\n\n\n<h3 class=\"wp-block-heading\">Benefits and Company Culture<\/h3>\n\n\n\n<p>Red Hat offers a comprehensive benefits package applicable to full-time, permanent US associates, including medical, dental, and vision coverage, a <strong>401(k) with employer match<\/strong>, paid time off and holidays, and paid parental leave.<\/p>\n\n\n\n<p>Red Hat emphasizes an inclusive culture built on open source principles, encouraging associates from diverse backgrounds to share ideas and challenge the status quo. The company is an equal opportunity and affirmative action employer.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Red Hat is seeking a Site Reliability Engineer (SRE) to join the team responsible for developing, scaling, and operating its OpenShift managed cloud services. This is a crucial role focused on running Red Hat&#8217;s enterprise Kubernetes distribution at scale, demanding expertise in coding, operations, and large-scale distributed system design. The position is fully remote within [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"menu_order":0,"template":"","jobpost_category":[1224],"jobpost_job_type":[39],"jobpost_location":[1041],"jobpost_tag":[188,126,1262,1260,614,267,1261,991,231,1232,183,168,1257,1259,24,1258,81,1018,1020],"class_list":["post-295","jobpost","type-jobpost","status-publish","hentry","jobpost_category-cloud-engineer","jobpost_job_type-remote","jobpost_location-united-states","jobpost_tag-aws","jobpost_tag-azure","jobpost_tag-cloud-services","jobpost_tag-containers","jobpost_tag-gcp","jobpost_tag-golang","jobpost_tag-incident-response","jobpost_tag-information-technology","jobpost_tag-java","jobpost_tag-kubernetes","jobpost_tag-linux","jobpost_tag-networking","jobpost_tag-openshift","jobpost_tag-prometheus","jobpost_tag-python","jobpost_tag-red-hat","jobpost_tag-remote","jobpost_tag-site-reliability-engineer","jobpost_tag-sre"],"_links":{"self":[{"href":"https:\/\/skillbasedmatching.com\/jobs\/wp-json\/wp\/v2\/jobpost\/295","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/skillbasedmatching.com\/jobs\/wp-json\/wp\/v2\/jobpost"}],"about":[{"href":"https:\/\/skillbasedmatching.com\/jobs\/wp-json\/wp\/v2\/types\/jobpost"}],"author":[{"embeddable":true,"href":"https:\/\/skillbasedmatching.com\/jobs\/wp-json\/wp\/v2\/users\/1"}],"wp:attachment":[{"href":"https:\/\/skillbasedmatching.com\/jobs\/wp-json\/wp\/v2\/media?parent=295"}],"wp:term":[{"taxonomy":"jobpost_category","embeddable":true,"href":"https:\/\/skillbasedmatching.com\/jobs\/wp-json\/wp\/v2\/jobpost_category?post=295"},{"taxonomy":"jobpost_job_type","embeddable":true,"href":"https:\/\/skillbasedmatching.com\/jobs\/wp-json\/wp\/v2\/jobpost_job_type?post=295"},{"taxonomy":"jobpost_location","embeddable":true,"href":"https:\/\/skillbasedmatching.com\/jobs\/wp-json\/wp\/v2\/jobpost_location?post=295"},{"taxonomy":"jobpost_tag","embeddable":true,"href":"https:\/\/skillbasedmatching.com\/jobs\/wp-json\/wp\/v2\/jobpost_tag?post=295"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}