Lead Site Reliability Engineer (SRE)

Distributed

Lead Site Reliability Engineer (SRE)

Fully Remote

3 months initially (to extend) 

Who are we?

We’re a software development company building the world’s Elastic Workforce, reinventing work and challenging the assumption that a local team = the best team.

We help businesses deliver technical projects better than ever before through our platform and on-demand Elastic Teams™. 

What’s in it for you? Our mission is to create freelance jobs with more benefits than permanent.

Want to know more? read: https://distributed.co/about

About this role 

We’re working with a key customer to enhance their operational efficiency by assessing their current SRE processes. You’ll be working closely with the individual business units to understand what operational processes are currently in place and how these can be improved. 

Your Responsibilities

  • Assess current operational procedures and processes 
  • Engage with key technical and non-technical stakeholders to understand requirements for observability, backup and efficiency 
  • Work alongside a Business Architect to provide a report of recommended next steps based on current and future state 
  • Create an action plan for the implementation of these improved SRE processes 

    About You

    We’re looking for passionate technologists who enjoy working in collaborative agile teams. You’ll need to be a clear, concise & engaging communicator with people on your team. We enjoy the big picture and the detail; we want people who excel at both.

    Software development lifecycle including design, development, testing, packaging, deployment, upgrade and support.

    • Openstack cloud infrastructure experience
    • OpenStack development 
      • Familiar with Openstack components like Keystone, Nova, Neutron, Glance etc.
    • Python experience 
    • Ability to write patches for Openstack in python and contribute to community
    • Experience supporting Software-defined storage with Ceph or other cloud-based storage.
    • Hypervisor technologies including KVM
    • Redhat Enterprise Linux and/or CentOS build, development, and operations experience
    • Experience in building and maintaining code distribution through automated pipelines
    • Experience with Ansible or Puppet for configuration management
    • Software-defined network technologies including OVS, OVN, NFV, etc.
    • IaaC experience – Terraform, Ansible, Git, GitLab, Jenkins, Helm, ArgoCD, Conjur/Vault

        About us

        Distributed is proud to be an equal opportunities employer. Employees and contractors, as well as prospective employees and contractors, will all be treated equally and fairly. Distributed is committed to ensuring no less favourable treatment is experienced by any current or prospective employee because of any of the protected characteristics under the UK Equality Act 2010 or equivalent local equality legislation.

        By submitting your application you give us permission to store and use the information from your CV and your answers to application questions.

        Source
        remotive.com

        Comments are closed.