Associate Site Reliability Engineer

  • Full-Time
  • Remote
  • Red Hat Software
  • Posted 3 years ago – Accepting applications
Job Description
Job summary: The Red Hat OpenShift Site Reliability Engineering Platform team (SRE-P) is seeking an Associate Site Reliability Engineer (SRE) to join our team. OpenShift is Enterprise Kubernetes and SRE-P delivers OpenShift Dedicated (OSD) as a cloud service. In this entry-level SRE role, you’ll participate in both the development and operations of OSD. You will interact with other SREs and product engineers around the world to deliver a large cloud-based container orchestration platform for sophisticated enterprise IT customers. You'll write Kubernetes Operators and other software to autonomously manage the environment, and resolve faults by analyzing distributed systems and data when issues arise. As an Associate Site Reliability Engineer, you’ll work in a fast-paced globally distributed team while quickly learning new skills and creating ways to consistently meet service-level agreements (SLAs) for our global OpenShift Dedicated cloud service.
OpenShift SRE-P is a growing, sophisticated, global, fast-paced team inside the world's Open Source leader with constant opportunities to learn new skills and innovate new solutions to meet our customers' demands. As an SRE on this team, you'll directly contribute to Red Hat's success in the rapidly growing Kubernetes-as-a-service market.Primary job responsibilities:
  • Design, write, operate, and debug Kubernetes Operators and other softawre to provision, upgrade, monitor, and heal a large global fleet of OpenShift clusters deployed across multiple public clouds
  • Support the operations of OpenShift Dedicated by responding to and troubleshooting system alerts
  • Provide engineering support to Red Hat's global technical support team to resolve customer issues
  • Participate in the development of new features and capabilities for OpenShift Dedicated
  • Participate in a follow-the-sun on-call rotation, including periodic weekend and holiday on-call duties
Required skills:
  • Software development experience using a general purpose language; golang is preferred
  • Linux administration experience; Red Hat Enterprise Linux (RHEL), CentOS, or Fedora are preferred
  • Basic knowledge of software development lifecycle tools such as github and Jenkins
  • Basic knowledge of monitoring systems; Prometheus is preferred
  • Basic experience with public cloud platforms such as Amazon Web Services (AWS), Google Cloud Platform, or Microsoft Azure
  • Experience supporting a live product
  • Passion to learn new technologies
  • Passion to build elegant software systems
  • Passion to troubleshoot complex technical issues
  • Passion for automation
About Red Hat: At Red Hat, we connect an innovative community of customers, partners, and contributors to deliver an open source stack of trusted, high-performing solutions. We offer cloud, Linux, middleware, storage, and virtualization technologies, together with award-winning global customer support, consulting, and implementation services. Red Hat is a rapidly growing company supporting more than 90% of Fortune 500 companies.
Apply to this Job