Job Description
Description
Get AI-powered advice on this job and more exclusive features.
We are seeking an experienced Site Reliability Engineer (SRE) / Infrastructure Engineer to join our Platform Engineering team. This role requires a hands-on technologist with deep expertise in cloud infrastructure, Kubernetes, DevOps, and SRE practices to ensure the performance, availability, scalability, and security of mission-critical platforms.
About the Role
We are seeking an experienced Site Reliability Engineer (SRE) / Infrastructure Engineer to join our Platform Engineering team. This role requires a hands-on technologist with deep expertise in cloud infrastructure, Kubernetes, DevOps, and SRE practices to ensure the performance, availability, scalability, and security of mission-critical platforms.
Key Responsibilities
- Design, implement, and maintain highly available, scalable, and secure infrastructure across AWS, Azure, and GCP.
- Build and automate CI/CD pipelines using Azure DevOps, Jenkins, Ansible Tower, and Terraform .
- Manage containerized applications using Kubernetes, Docker, AKS, EKS, and GKE
- Develop and enforce SRE best practices including monitoring, incident response, capacity planning, and reliability automation.
- Implement Infrastructure as Code (IaC) using Terraform, Bicep, ARM templates, and CloudFormation.
- Collaborate with development, QA, and security teams to integrate DevSecOps pipelines.
- Use observability tools (e.g., ELK, Kibana, ) for metrics, logging, and alerting.
- Manage machine identity and key lifecycle with Venafi , TLS, and PKI-based automation.
- Lead root cause analysis and provide reliable fixes for complex infrastructure issues.
- Participate in architectural reviews, security audits, and disaster recovery planning.
Qualifications
Must-Have:
- 10+ years in infrastructure, DevOps, or SRE roles within enterprise-grade environments.
- Proven experience with AWS, Azure, and GCP cloud services.
- Hands-on expertise in Kubernetes (AKS/EKS/GKE) , Helm, Docker.
- Strong scripting skills in Python, Bash, PowerShell .
- Experience with Terraform, Ansible .
- Familiarity with CI/CD tools : Jenkins, Azure DevOps, Octopus, GitHub Actions.
- In-depth knowledge of Linux, Windows Server , and hybrid cloud environments.
- Solid understanding of networking, load balancing (NGINX, F5, ELB), and firewalls .
- Knowledge of security best practices and tools (e.g., IAM, TLS, PKI, SIEM, WAF, DAST/SAST).
Nice-to-Have:
- Experience with Apache airflow, snowflake , and big data pipelines.
- Familiarity with SRE maturity models and service level objectives (SLOs, SLIs, SLAs).
Seniority level
-
Seniority level
Mid-Senior level
Employment type
-
Employment type
Full-time
Job function
-
Job function
Engineering and Information Technology
-
Industries
IT Services and IT Consulting
Referrals increase your chances of interviewing at iVedha Inc. by 2x
Get notified about new Site Reliability Engineer jobs in Canada .
Milton, Ontario, Canada $120,000.00-$120,000.00 2 weeks ago
Canada $133,900.00-$173,000.00 2 weeks ago
Site Reliability Engineer | North America | Canada | Europe | Fully Remote
Canada CA$60,500.00-CA$108,100.00 1 week ago
Greater Calgary Metropolitan Area 2 hours ago
Senior Site Reliability Engineer (Remote)
Canada $180,000.00-$230,000.00 9 months ago
Sr. Site Reliability Engineer (Remote NA – East)
Full-Stack Software Engineer (New graduates: Canada)
Canada CA$80,000.00-CA$120,000.00 1 month ago
Canada CA$107,000.00-CA$122,000.00 9 hours ago
We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.
#J-18808-Ljbffr
Company
iVedha Inc.
Location
, , Canada
Country
Canada
Salary
100.000
URL