Hexagon’s Asset Lifecycle Intelligence division(Hexagon) is seeking a Cloud Platform SRE Team Manager in the United Kingdom or anywhere in Europe, who can lead a small team of highly skilled engineers in a fast-paced, rapidly evolving environment.
You'll have the opportunity to shape and define the DevOps and SRE practice moving forward, helping us build out our processes, working with your team and other stakeholders to drive quality, reliability & process improvements throughout the development teams.
You and your team will be central to the success of our new cloud SaaS platform and stakeholders will rely on your knowledge and expertise.
- Lead and manage a small team of Site Reliability Engineers (SREs) responsible for building and maintaining our platform infrastructure.
- Develop, mentor, and empower your team members to enhance their technical skills, expand their knowledge base, and achieve their professional goals. We encourage a culture of learning and collaboration, where team members can thrive and excel.
- Drive the team's efforts to ensure the availability, scalability and performance of our applications by implementing best practices for site reliability engineering.
- Collaborate closely with development teams to implement efficient code and deployment strategies using GitOps principles.
- Continuously monitor logs and telemetry to spot potential issues, weaknesses or failures.
- Identify areas for improvement and implement appropriate optimizations within your own team or with partner teams.
- Work on incidents in conjunction with team members and coordinating with wider stakeholders to resolve production issues promptly.
- Maintain and evolve our infrastructure-as-code configurations, CI/CD & automation scripts and be responsible for ensuring that our cloud platforms are secure, resilient, and scalable.
- Stay up-to-date with the latest industry trends, tools, and practices related to site reliability engineering and cloud technologies.