Site Reliability Engineer

 

Description:

The role Cloud Application Engineer/Site Reliability Engineer is to build solutions to enhance availability, performance, and stability of OpenText services as well as automating away repetitive work as part of a cloud dev ops organization.

This role would be a great fit for someone with creative and innovative problem-solving skills. You will develop and implement solutions that operate at scale. Our teams are empowered and expected to improve our products to truly deliver a reliable experience to customers.

 

You are great at:

Collaborates with Agile squads/developers, sustain and business partners and provides significant contributions to develop specifications to resolve problems, and to address enhancement needs focusing in areas of logging, monitoring and metrics for operational readiness

  • Uses technical knowledge, creativity, and company practices to drive down occurrences of incidents through development of proactive monitoring and alerting.
  • Provide attention to incidents according to Service Level Agreements.
  • Provide continuous feedback to development teams on system stability, defect analysis and system enhancements
  • Develop runbooks and patterns to sustain applications in a production environment
  • Participate in technical discussions and drive transition to sustain activities with the development teams
  • Work with IT business and development partners to gather input to develop new capabilities in displaying/monitoring/alerting on key performance indicators (KPIs) by tracking business transactions (BT) in real-time
  • Partner with application owners to develop creative and effective solutions to mitigate risk and successfully remediate any audit issues, providing quality and timely responses
  • Take ownership and accountability for the incident resolution process, participating in RCA and SWAT investigations.
  • Plan for validation and verification of changes deployed by infrastructure teams, development teams.
  • Participate in day-to-day real time advanced level technical support and troubleshooting on issues reported from user/customer base.
  • Provides guidance in resolving performance related issues and designing solutions for any technical issues faced by the application
  • Establish and maintain a good relationship with team members, Product Development, Product management, Customer Service, Client management and other cross functional teams.
  • Participate in training and information sharing activities.
  • Act as backup for other team members when necessary.
  • Requires rotating shift work as needed.
  • On-call rotation is required, as 7x24x365 support is required.

Organization opentext
Industry Engineering Jobs
Occupational Category Site Reliability Engineer
Job Location Mississauga,Canada
Shift Type Morning
Job Type Full Time
Gender No Preference
Career Level Intermediate
Experience 2 Years
Posted at 2022-11-13 2:29 pm
Expires on Expired