• Software Engineer - Site Reliability, OpenShift

    Location DE-Berlin
    Posting date 1 month ago(1/15/2020 6:11 AM)
    Job ID
    75863
    Category
    Software Engineering
  • Company Description

    At Red Hat, we connect an innovative community of customers, partners, and contributors to deliver an open source stack of trusted, high-performing solutions. We offer cloud, Linux, middleware, storage, and virtualization technologies, together with award-winning global customer support, consulting, and implementation services. Red Hat is a rapidly growing company supporting more than 90% of Fortune 500 companies.

    Job summary

    The Red Hat OpenShift Site Reliability Engineering (SRE) team is looking for a Software Engineer to join our global team in Germany. In this role, you will work on Red Hat OpenShift, which is enterprise Kubernetes, as part of a team that develops and operates Red Hat OpenShift Dedicated, a public cloud service based on Red Hat OpenShift for large enterprise customers. You’ll play a key role in contributing to solutions that make Red Hat OpenShift Dedicated scalable, featureful, resilient, and secure while maintaining a balance between development and operations work. You’ll contribute to the design and development of automation software to provision, upgrade, monitor, and heal a large global fleet of Red Hat OpenShift clusters deployed across multiple public clouds. You'll participate in a global on-call rotation and help lead incident management, root cause analysis, and continuous improvement activities, managing engineering efforts against a service-level agreement (SLA) and error budget. OpenShift SRE is a sophisticated, global, fast-paced team inside the world's open source leader with constant opportunities to learn new skills and innovate new solutions to meet our customers' demands. As an Software Engineer on this team, you will directly contribute to Red Hat's success in the rapidly growing Kubernetes as a Service (KaaS) market.

    Primary job responsibilities

    • Design and write automation software to provision, upgrade, monitor, and heal a large global fleet of Red Hat OpenShift clusters deployed across multiple public clouds
    • Identify single points of failure and other high-risk architecture issues; propose and implement more resilient resolutions
    • Participate in the release cycles of our offerings, deploying code to integration, staging, and production environments, integrating with continuous integration (CI) and continuous delivery (CD) tooling, monitoring, and change management
    • Perform software updates, peer code reviews, testing, and Common Vulnerabilities and Exposures (CVE) analysis; respond to security threats
    • Interact with automated monitoring and healing infrastructure to ensure healthy environments
    • Provide engineering support to Red Hat's global technical support team to resolve customer issues
    • Create and maintain standard operating procedures (SOPs) for performing maintenance tasks, applying configuration changes, and remediating problems in our environment
    • Participate in a global on-call rotation, including periodic weekend and holiday on-call duties

    Required skills

    • 5+ years of software engineering experience using object-oriented languages; Golang, Python, and Java are a plus
    • 3+ years of experience managing Linux-based systems in a public cloud like Amazon Web Services (AWS), Google Cloud Platform (GCP), or Microsoft Azure
    • 3+ years of experience with enterprise system monitoring; knowledge of Prometheus is a plus
    • 3+ years of experience with enterprise configuration management tools like Red Hat Ansible Automation, Puppet, or Chef
    • 1+ year(s) of experience delivering hosted cloud services
    • 1+ year(s) of experience with Kubernetes
    • 1+ year(s) of experience with containers on Linux
    • Superior communications skills and experience working directly with and presenting to customers
    • Demonstrated ability to quickly and accurately troubleshoot systems issues
    • Solid understanding of standard TCP/IP networking and common protocols like DNS and HTTP


    Red Hat is proud to be an equal opportunity workplace and an affirmative action employer. We review applications for employment without regard to their race, color, religion, sex, sexual orientation, gender identity, national origin, ancestry, citizenship, age, uniformed services, genetic information, physical or mental disability, medical condition, marital status, or any other basis prohibited by law.


    Red Hat does not seek or accept unsolicited resumes or CVs from recruitment agencies. We are not responsible for, and will not pay, any fees, commissions, or any other payment related to unsolicited resumes or CVs except as required in a written contract between Red Hat and the recruitment agency or party requesting payment of a fee.

     

    Options

    Sorry the Share function is not working properly at this moment. Please refresh the page and try again later.
    Share on your newsfeed