Hexagon

Cloud Platform SRE Team Manager - Anywhere in Europe

Job Locations UK-ENG-Swindon
Req# ID
2024-11610
# of Openings
1
Job Posting Category
Product Management
Travel %
5
Job Location : Location
UK-ENG-Swindon

Responsibilities

Hexagon’s Asset Lifecycle Intelligence division(Hexagon) is seeking a Cloud Platform SRE Team Manager in the United Kingdom or anywhere in Europe, who can lead a small team of highly skilled engineers in a fast-paced, rapidly evolving environment.

 

You'll have the opportunity to shape and define the DevOps and SRE practice moving forward, helping us build out our processes, working with your team and other stakeholders to drive quality, reliability & process improvements throughout the development teams.

 

You and your team will be central to the success of our new cloud SaaS platform and stakeholders will rely on your knowledge and expertise.

 

 

  • Lead and manage a small team of Site Reliability Engineers (SREs) responsible for building and maintaining our platform infrastructure.
  • Develop, mentor, and empower your team members to enhance their technical skills, expand their knowledge base, and achieve their professional goals. We encourage a culture of learning and collaboration, where team members can thrive and excel.
  • Drive the team's efforts to ensure the availability, scalability and performance of our applications by implementing best practices for site reliability engineering.
  • Collaborate closely with development teams to implement efficient code and deployment strategies using GitOps principles.
  • Continuously monitor logs and telemetry to spot potential issues, weaknesses or failures.
  • Identify areas for improvement and implement appropriate optimizations within your own team or with partner teams.
  • Work on incidents in conjunction with team members and coordinating with wider stakeholders to resolve production issues promptly.
  • Maintain and evolve our infrastructure-as-code configurations, CI/CD & automation scripts and be responsible for ensuring that our cloud platforms are secure, resilient, and scalable.
  • Stay up-to-date with the latest industry trends, tools, and practices related to site reliability engineering and cloud technologies.

Qualifications

Essential Experience:

 

  • A track record of building and managing SRE / DevOps or App Support teams in a large, fast-moving organisation.
  • 4-6 years experience in software development.
  • 3+ years of commercial experience in an SRE (or similar) role.
  • Degree preferred or equivalent years of practical job experience in a similar function or role.
  • Hands on experience of building and maintaining enterprise grade platforms using a major public cloud platform (Azure, AWS or GCP).
  • In-depth knowledge of commercial logging and telemetry platforms, ideally App Insights, Grafana & Prometheus.
  • Demonstrable experience of deploying, patching and maintaining Kubernetes clusters and associated toolsets.
  • Experience of authoring and maintaining infrastructure-as-code configurations, ideally in Terraform.
  • Experience of building CI/CD pipelines in a relevant framework (preferably Azure DevOps).
  • Experienced in authoring automation scripts using PowerShell and Bash.
  • Experience of large scale identity management and implementing Role-Based Access Controls (RBAC), ideally with Azure Active Directory.
  • Experience of authoring and maintaining Helm charts.
  • Familiarity with GitOps principles. Hands-on experience of ArgoCD or Flux is desirable.
  • Confidence in analyzing and questioning the assumptions and justifications behind technical specifications/implementations, with an open mind to considering alternative approaches which may be a better fit.
  • Willingness to share out-of-hours on-call responsibilities with the team.
  • The successful candidate will be a champion of DevOps culture and SRE principles in the division.

 

Benefits:

 

  • Competitive salary and benefits package.
  • Opportunities for professional growth and advancement within a dynamic and exciting environment.
  • Work with a talented, global team of experts in cutting-edge technologies.
  • Collaborative and inclusive work environment.
  • Flexible work schedule, with the choice to work from an office, from home, or on a hybrid.

 

#LI-AW1

Options

Sorry the Share function is not working properly at this moment. Please refresh the page and try again later.
Share on your newsfeed

Connect With Us!