Site Reliability Engineer II

Reed Business Information Limited

Site Reliability Engineer II

Salary Not Specified

Reed Business Information Limited, Bedford Place, City of Southampton

  • Full time
  • Permanent
  • Onsite working

Posted 2 weeks ago, 24 Apr | Get your application in now before you miss out!

Closing date: Closing date not specified

job Ref: efdc92d2127e447b97e82b520f0569cf

Full Job Description

As a Site Reliability Engineer II, you will ensure the reliability and scalability of the company's systems and applications by designing, implementing, and maintaining infrastructure, automation, and monitoring solutions.,

  • Delivering resilient application stacks via "Infrastructure as Code" and other DevOps practices

  • Monitoring system performance and availability, proactively identifying and resolving issues.

  • Automating infrastructure deployment and configuration management using modern tools and techniques.

  • Writing and maintaining systems / application documentation for technical and non-technical audiences.

  • Managing Customer Reliability Engineering activities driving Application Monitoring, Metrics, Incident Reviews and Long-Term Actions

  • Collaborating with cross-functional teams to improve the reliability and performance of systems., We are very supportive of women in Technology and has been a founding signature for the Tech Talent Charter. Currently 27% of our Technology workforce are women which is much higher than the UK average of 17%. We have the following initiatives in place to support women in technology:

  • Mentoring scheme for women in technology

  • Women's network forum

  • Regularly run events for schools girl about careers in technology to inspire the next generation of girls in tech.

    Experience working in a Site Reliability or DevOps related capacity

  • Have experience with configuration management tools like Ansible, Puppet, or Chef.

  • Experience of Developing and maintain automation tools for infrastructure provisioning, configuration management, and monitoring.

  • Have experience with monitoring and logging tools such as Prometheus, Grafana, or ELK.

  • Be able to collaborate effectively with cross-functional teams and communicate technical concepts to non-technical stakeholders.

  • Have expertise in utilizing infrastructure automation tools such as Terraform and Ansible.

  • Have experience of Kubernetes ideally.

    At Cirium, our goal is to keep the world connected. We are the industry leader in aviation analytics; helping our customers understand the past, present, and predicting what will happen tomorrow. Our mission is to transform the aviation industry by enabling airlines, airports, travel companies, tech giants, aircraft manufacturers, financial institutions and many more accelerate their own digital transformation. You can learn more about Cirium at the link below. https://www.cirium.com


  • About our Team

    You will be joining a collaborative, curious, team of Site Reliability Engineers at all different levels. By joining us you will have the opportunity to share ownership in solving this problem end to end. From exploring new data sources for building features, to design and put in production predictive models and make sure they perform consistently over time.