Reliability Production Engineer

September 5, 2025

Apply for this job

Job Description

Description

The Reliability Production Engineer (RPE) plays a critical role in providing production support services within the RPE organization. This role involves developing automation and tooling to support Site Reliability Engineering (SRE) activities, with a focus on improving system reliability and supportability—such as reducing manual toil, optimizing monitoring, and enhancing alerting efficiency. The RPE collaborates with global teams to maintain and improve production systems.

Key Responsibilities

  • Provide production support for in-scope systems under the RPE organization
  • Develop automation and tooling to improve reliability and reduce manual tasks
  • Monitor databases and perform performance tuning across platforms including DB2, Greenplum, MongoDB, and Snowflake
  • Create and maintain database scripts (stored procedures, complex SQL) for data analysis and operations
  • Develop and maintain Python and Linux Shell scripts for operational support
  • Troubleshoot containerized environments using Docker and Kubernetes
  • Analyze system metrics and trends using observability tools
  • Collaborate effectively with global teams and communicate clearly in both verbal and written forms
  • Support shift work and participate in on-call rotations to ensure continuous system availability
  • Comply with current policies requiring a minimum of three days working from the office weekly

Required Qualifications

  • Bachelor’s degree in Computer Science or related field
  • 4–5 years of experience with database scripting, monitoring, and performance tuning (DB2, Greenplum, MongoDB, Snowflake)
  • Proficiency in Linux operating systems
  • Experience with Python and Linux Shell scripting
  • Hands-on experience with Docker and Kubernetes, including troubleshooting and observability stack tools
  • Strong verbal and written communication skills for global collaboration
  • Flexibility to work shifts and fulfill on-call responsibilities

Preferred Qualifications

  • Experience in financial services or investment banking environments
  • Familiarity with advanced monitoring and alerting tools such as Splunk, AppDynamics, or Elastic Search
  • Knowledge of development tools including GIT and Jenkins
  • Agile, DevOps, or SRE mindset and related tooling experience
  • Understanding of cloud technologies and their applications in reliability engineering

Certifications (if any)

  • No specific certifications required, though relevant certifications in DevOps, Cloud, or SRE are a plus

Email ID * This field is required Please enter valid emailId.

Cell phone * This field is required Please enter valid cell phone.

First Name * This field is required Please enter valid first name.

Last Name * This field is required Please enter valid last name.

#J-18808-Ljbffr

Company

Compunnel, Inc.

Location

Montreal

Country

Canada

Salary

100.000

URL

https://en-ca.whatjobs.com/coopob__cpl___291_2609573__3337?utm_source=3337&utm_medium=feed&keyword=Reliability-Production-Engineer&location=Montreal&geoID=3824