To apply for this job you must first either login or register

Intermediate Service Reliability Engineer (Dev-OPS)

Ontario  - Permanent



Job Description

The Service Reliability Engineer works closely with the many of the digital teams to automate their needs and make sure production environments are always in top shape. This position is responsible for code deployments and ensuring these deployments remain healthy through the entire development lifecycle.

KEY PERFORMANCE METRICS
•Measure and maintain error budgets and uptime
•Achievements based on goals set out departmental director
•Achieve 100% automated environments


KEY ACCOUNTABILITIES
Functional
•Design, implement and deploy complex cloud-based workloads from initial architecture and design through development, testing, and deployment
•Participate in architectural discussions & requirements gathering to ensure customer success on the cloud platform
•Implement and improve our continuous integration and continuous delivery pipeline
•Drive the coverage of monitoring and improvement to alerting/communication practices for system and environment issues
•Keep the team current on SRE tools and practices
•Drive automation and scripting to reduce repeated manual tasks and human error
•Highlight issues/risks to project leads and management team
•Analyze functional, technical and business requirements for projects
•Maintain a supportive, positive, open and honest IT culture
•Build strong and positive relationships with other teams and peers
•Maintain the entire CI suite of tools (JIRA, Jira Service Desk, Confluence, Bitbucket, TeamCity, Octopus Deploy, Package Hosts)
•Act as an advocate for the customer by placing them at the forefront of all decision-making and design processes
•Proactively identify and anticipate customer expectations and needs
•Embrace and seek out technology that creates high tech and high touch solutions for customers
•Challenge the status quo by consistently identifying areas for improvement, diagnosing issues and working to resolve them
•Participate in on-call rotation


Must Have Skills:

Work Experience / Education / Certifications
•Computer Science Degree, other related diploma or equivalent experience accompanied with formal computer training
•Minimum of 2-5 years of work experience in IT development operations related field with emphasis on web centric or e-commerce preferred

Competencies / Skills / Attributes
•Experience in 24/7 production operations, preferably supporting a highly available environment for a ecommerce, SaaS or cloud service provider
•Systems administration and configuration management automation expertise is a must (Example, Chef)
•Experience with Continuous Integration (TeamCity, Jenkins, Bamboo, …)
•Experience with scripting (PowerShell, bash, TCL, …)
•Strong understanding of system and networking concepts and troubleshooting techniques. Experience with configuration management (Chef, Puppet)
•Experience with deployment automation (Octopus Deploy, Rundeck, …)
•Experience with monitoring, dashboarding and alerting systems. New Relic and PagerDuty is a nice to have.
•Experience with managing and configuring highly available server environments in Linux or Windows.
•Detail-oriented, able to concentrate and work quickly
•Ability and enthusiasm to learn new technologies
•Analytical and problem-solving skills
•Excellent communication skills


Details:
Starting: ASAP
To apply for this job you must first either login or register