Site Reliability Engineer

 

Description:

The role will be in charge of defining, documenting and operating production environments. Some of the main responsibilities include the creation and setup of new product/service environments, DevOps provisioning, development and adoption of new OPS tools and methodologies, day-to-day governance of the production offerings, continuous improvement of the OPS process, collaboration on pipeline automation with the development DevOps engineers, periodic maintenance, and day-to-day troubleshooting of support / helpdesk / business escalations


We are looking for a Devops expert with strong Operations related experience.

What you will do:

  • Documentation of processes in OPS production environments
  • Continuous improvement of the OPS production processes and increasing the performance of OPS KPIs, including the development and/or adoption of tools and technologies
  • Collaborating with the development teams on DevOps Automation development and adoption of pipelines for OPS production environments
  • Provisioning of pre-prod (with client data) and production environments. Collaborating with security and architecture teams
  • Continuous proactive governance, and governance processes in production environments
  • Troubleshooting of support / helpdesk / business escalations
  • Periodic maintenance

What you bring to this role:

  • Bachelor Degree in Computing Science or a related field or equivalent Technical Diploma combined with relevant experience.
  • At least 3 years of experience in IT Operations
    • Working in large and complex IT environments.
    • Working with UNIX/Linux/BSD systems and/or Windows server systems as an application or database administrator.
    • Proven experience administering large applications in production grade cloud, with emphasis on Azure (other cloud: AWS, GCP).
    • Experience with configuration management (Chef, Ansible, Puppet).
    • Experience with proactive governance, monitoring, and a continuous operational improvement processes.
    • At least 3 years of experience with DevOps CI/CD development
    • Automating builds, releases, and pipelines, with advantage to knowledge with GitHub Actions (other tools: Jenkins, TravisCI, Azure ADO).
    • Experience with infrastructure as code with, with advantage to knowledge with Terraform (other tools: AWS CloudFormation).
    • Other software development experience, including scripting with Ruby, Python, Bash, PowerShell, and Java.
    • Experience with Git branching and source code management within enterprise team setup.
  • At least 2 years of experience managing teams in the IT and/or software development space.

Organization KPMG
Industry Engineering Jobs
Occupational Category Site Reliability Engineer
Job Location Toronto,Canada
Shift Type Morning
Job Type Full Time
Gender No Preference
Career Level Intermediate
Experience 3 Years
Posted at 2024-07-08 6:53 am
Expires on 2024-10-16