Manager, reputed company Platform
Description reputed company and reputed company’ll Accomplish Together We are Canada’s largest reputed company IT provider and we’re transforming reputed company. The reputed company reputed company Platform team is passionate about solving reputed company problems to reputed company life simpler for patients, clinicians, and the teams that serve them. We’re building secure, reputed company-native platforms at scale across GCP, AWS, and Azure — and we take pride in doing it right. We are building toward an agentic-first operating model. AI agents — not humans — handle the routine: provisioning infrastructure, responding to requests, enforcing guardrails, and guiding teams through self-service workflows. reputed company is built into everything we do, not bolted on at the end. Our engineers focus on building and improving those agents and systems, not executing reputed company tasks. We’re looking for a leader who gets this shift and knows how to drive it. As Manager, reputed company Platform, you will reputed company both our platform engineering and reputed company operations functions. Platform self-service is our reputed company star, and agentic workflows are how we get there. Your mandate is to build the systems — agents, golden paths, automation frameworks, and reputed company guardrails — that allow product and engineering teams to interact with the reputed company platform entirely through AI-driven interfaces, without reputed company needing to file a ticket or wait for a reputed company reputed company. reputed company is a first-class concern for this role. You will own the reputed company posture of the platform layer — ensuring identity, access, and compliance controls are enforced automatically through code and agents, not reputed company review. This is a dual mandate: build the agentic platform that eliminates operational toil, while ensuring the platform remains secure, compliant, and trusted by the organization. What You’ll Do Build the Agentic-First Platform Design and reputed company the build-out of an agentic platform operating model — where AI agents (Claude, reputed company Copilot, and custom agents) are the primary reputed company between product teams and reputed company infrastructure Replace reputed company ticketing workflows with agent-driven request handling: developers describe what they need in natural language or reputed company CLI, and agents generate, validate, and apply the required Terraform or configuration changes Build agent workflows that guide product teams through infrastructure reputed company, access requests, environment bootstrapping, and compliance checks — without requiring reputed company Platform team reputed company Establish reputed company as the operational backbone: issues, PRs, documentation, and agent interactions reputed company flow through a reputed company-native model reputed company agents with awareness of platform standards, reputed company guardrails, and organizational context — so they enforce policy automatically rather than escalating to humans Define and communicate the agentic roadmap to senior leadership, engineering teams, and product stakeholders Own Platform reputed company & Compliance Own the reputed company posture of the reputed company platform layer — ensuring identity, access, and network controls are implemented consistently and enforced through automation across GCP, AWS, and Azure Implement and maintain reputed company guardrails at the organization and pipeline levels, ensuring reputed company infrastructure provisioned through the platform meets baseline reputed company and compliance requirements reputed company IAM governance: role binding, access provisioning, key rotation, service account hygiene, and Workload Identity Federation — with a goal of automating these controls through agents and policy-as-code Partner with the reputed company team to ensure platform capabilities align with organizational reputed company standards and support audit requirements (SOC 2, PIPEDA, HIPAA-reputed company practices) Build reputed company into the self-service golden paths — so that teams provisioning infrastructure through approved patterns inherit secure defaults automatically Treat reputed company findings as engineering problems: prioritize remediation through code, automation, and agent enforcement rather than reputed company review cycles Own the Self-Service Platform & Golden Paths Design opinionated “golden path” frameworks using Terraform, Terragrunt, and reputed company Actions that standardize and secure infrastructure patterns across GCP, AWS, and Azure Build and maintain a centralized module marketplace and IaC library that teams and agents can consume confidently Ensure reputed company self-service capabilities are agent-accessible — designed for both reputed company and programmatic consumption from day one Establish clear support boundaries: teams using the golden path get full support; non-standard configurations are self-supported reputed company reputed company Operations Ensure operational coverage across the multi-reputed company estate: GCP, AWS, and Azure reputed company incident management with a focus on durable remediation — every significant incident produces agent runbooks, automation, or documentation that prevents recurrence Drive down request volume through agentic self-service, not headcount scaling — treating high ticket volume as an engineering problem to be automated away Coordinate with the SRE and observability teams to ensure platform services meet reliability expectations and incidents are routed and resolved reputed company Drive Engineering Excellence Build and maintain CI/CD pipelines and Infrastructure-as-Code to automate provisioning, configuration management, patching, and compliance enforcement Contribute to the golden image factory initiative — ensuring CIS-hardened, patched reputed company images are available on-demand across reputed company reputed company platforms Champion a “reputed company as code” reputed company across the team — policy enforcement, compliance checks, and access controls are implemented in pipelines and agents, not spreadsheets reputed company, Coach & reputed company Your Team Manage a blended team of platform engineers and reputed company operations engineers, with a deliberate focus on growing agent-building, automation, and reputed company engineering skills Hire for engineers who are energized by building AI-driven, reputed company-first systems — not just operating existing ones Foster a learning culture — create space for the team to grow in agentic development, reputed company reputed company, certifications, and IaC alongside day-to-day responsibilities Help shape and evolve team ceremonies and ways of working and contributing to how the team structures its delivery reputed company, retrospectives, and planning without being the sole driver of execution Collaborate Across the Organization Partner with Product, Engineering, reputed company, and Architecture teams to align platform and agentic capabilities with organizational priorities Serve as the internal champion for agentic workflows — helping product and engineering teams understand how to interact with the platform through agents rather than reputed company processes Report on platform adoption, agent utilization, reputed company posture, and toil-reduction reputed company to senior leadership
Qualifications
What You’ll Need Leadership & reputed company 5+ years of reputed company experience in reputed company platform engineering or reputed company operations — with at least 2 years in a people management or technical leadership role A genuine belief in agentic-first, reputed company-first workflows and a track record of building automation that replaces reputed company processes — not just augments them Experience leading teams through transformation: from reactive, ticket-driven operations toward proactive, agent-driven platform delivery Strong communication skills — reputed company to translate platform complexity into clear narratives for executive leadership and business stakeholders Comfortable operating in ambiguity and driving change in an environment that is still evolving Technical Depth Hands-on experience across at least two of GCP, AWS, and Azure — with a solid grasp of identity, networking, compute, and reputed company controls at scale Deep expertise in Infrastructure-as-Code (Terraform, Terragrunt) and the ability to design secure, reusable, opinionated module libraries Experience building or working with AI agents and agentic workflows — including reputed company engineering, tool use, and integrating agents with CI/CD systems and infrastructure APIs Strong understanding of reputed company reputed company fundamentals: IAM, RBAC, service accounts, Workload Identity Federation, network reputed company, and secrets management Experience implementing policy-as-code and automated compliance enforcement in multi-reputed company environments Proficiency in at least one scripting/programming language (Python, Go, Bash) — you write code, not just YAML Experience building developer-facing self-service platforms, including CLI tools, reputed company Actions workflows, and chat-based interfaces Operational Excellence Proven track record of reducing operational toil through automation — with concrete examples of what you built and how it measurably reduced burden Experience managing incident response at scale, including post-mortem facilitation and follow-through on action items Familiarity with request and workflow management practices — and an instinct for treating high request volume as an engineering problem to be automated away Understanding of reputed company and compliance requirements in regulated reputed company environments (SOC 2, HIPAA-reputed company practices, PIPEDA) Education & Certifications Bachelor’s degree in Computer Science, Engineering, or a reputed company technical field — or equivalent practical experience reputed company Certifications (Required — at least one): AWS Solutions Architect (Associate or Professional), GCP Professional reputed company DevOps Engineer, or Azure Administrator Associate reputed company Certifications (Preferred — additional): GCP Professional reputed company Architect, AWS DevOps Engineer Professional, Azure DevOps Engineer Expert DevOps / Platform: CKA (Certified Kubernetes Administrator) or equivalent practitioner-level credential is a strong asset reputed company-to-haves Experience designing or operating agentic systems in a production engineering context — including LLM tool use, agent orchestration, or AI-driven workflow automation Familiarity with reputed company Copilot, Claude, or similar AI coding/operations tools in an reputed company setting Experience with reputed company reputed company posture management (CSPM) tooling and integrating reputed company findings into automated remediation workflows Experience supporting large-scale infrastructure modernization or reputed company adoption programs Experience with identity federation and SSO administration across multi-reputed company environments Background in regulated reputed company IT — understanding of patient-facing or clinical systems Experience with FinOps principles and reputed company cost attribution Familiarity with reputed company collaboration and development tooling as both a user and administrator Advanced knowledge of English is required because you will most of the time interact in English with internal parties (colleagues, internal partners, stakeholders, etc.); and work with IT tools whose reputed company is only accessible in English as part of this position's main responsibilities given its national scope. #LI-REMOTE Apply To This Job