May 9
🔄 Hybrid – Ottawa
• Responsible for planning, implementing, directing, and administering all aspects of the cloud development services in an Agile DevOps environment. • Partner with Lightspeed product development teams to help build, deploy, and maintain tools, platforms, and software services. • Leverage understanding of public cloud architecture to design and implement secure, scalable, and operationally sound solutions. • Form part of the core group of SREs that will build and operate the cloud infrastructure for Telesat's low-earth-orbit broadband service for enterprises.
• A Diploma or Degree in a relevant area of study with a preference for Computer Science together with demonstrated operational network-related experience. • Minimum of 4 years in supporting Development teams in an Engineering environment. • Industry certifications in Azure, Agile and/or Security are an asset. • In-depth knowledge of Kubernetes deployment and support. • Significant experience in scripting (e.g., PowerShell, bash). • Significant experience in automation tooling (e.g., Terraform, Ansible). • Significant experience in code and artifact repos (e.g., Gitlab, GitHub, Artifactory, ACR). • Significant experience in the deployment and support of Linux and Microsoft Windows. • Experience in performance tuning of Linux kernel, memory and filesystem parameters. • Experience in coding and programming (e.g., Python). • Experience in the deployment of environments in Azure. • Working technical knowledge of network systems. • Working technical knowledge of systems software, protocols and standards including Active Directory. • Working knowledge of security principles and securing systems. • Excellent written and oral communication skills. • Excellent problem-solving skills. • Strong interpersonal and organizational skills. • Ability to speak effectively before groups of internal employees, communicate technical information, create, and deliver presentations and information sessions to both technical and nontechnical personnel. • Demonstrated experience in applying technical expertise and in-depth evaluation to solve complex problems in own area of expertise. • Bilingual (English/French) is an asset.
• Testing, selection and implementation of technology and tools for SaaS, IaaS and on-prem systems to support the secure development, test, and release of internal code. • A part of Agile development teams to deliver an end-to-end automation of deployment, monitoring, and infrastructure management in both cloud and on-prem environments. • Responsible for defining and implementing the build, deployment, and monitoring standards. • Build and deployment of security utilities and tools for internal use. • Automation of DevOps processes, tooling and test suites as required. • Monitor system operations and act to troubleshoot, diagnose and provide solutions to ensure availability, capacity, and optimal performance. • Embed and intensely collaborate with Development teams on DevSecOps processes and best practices. • Collaborate with the Cyber team to ensure adherence to best practices with overall code and systems security, as well as document risks to tools and systems. • Work closely with product owners to align product security needs and remediate security flaws. • Collaborate with the QA and IVV teams to ensure test cases and metrics are automated and provide appropriate metrics and logging. • Respond to complex issues and major incidents. • Work with other operational teams to ensure issues are tracked and closed in a timely manner. • Mentor and train other staff on more complex and/or critical procedures and items pertaining to DevOps and SRE. • Document, track, and monitor problems to ensure timely resolution. • Document operating processes, runbooks and build books. • Automate routine operational items. • Installation and customization of operating systems and tools to support the business. • Provide support and guidance to development teams and related lab environments. • Assist in audits and forensics on artifacts collected during a security incident response. • Stay up to date with current security trends and evolution of cloud systems and tooling. • Perform occasional after-hours maintenance on production systems. • Incident on-call rotation as required. • Day-to-day operational support.
Apply Now