Reposting this.
Published January 26, 2026
Reposting this.
Hands‑On Senior Specialty Engineering Manager – GenAI Platform Core Services + GPU Infrastructure I’m hiring a hands-on Generative AI Specialty Senior Engineering Manager to lead GenAI platform core services and GPU infrastructure across our hybrid cloud stack (GCP Vertex AI + Azure ML + on‑prem). This is a rare role: you’ll own end‑to‑end strategy, engineering execution, and production delivery for secure, scalable, enterprise‑grade GenAI platform capabilities. Will also Build real production GenAI platforms — and operated GPU fleets at scale 🔥 What you’ll lead GenAI Platform Core Services AI Gateway / API / SDK platform (authN/Z, rate limits, quotas, SLAs, versioning, deprecation) Foundational services powering apps, agents, and enterprise workflows Multi‑cloud + hybrid runtime consistency (artifacts, toolchains, governance) GPU Infrastructure (H100/H200) Day‑2 ops for GPU environments (capacity, scheduling, isolation, autoscaling, failover) Fleet readiness, telemetry, instrumentation, and reliability engineering Scaling new clusters and building the roadmap for future GPU estate growth Observability + Evaluations Platforms for tracing, evaluations, signal collection, dashboards, retention/export pipelines Unified observability across on‑prem + cloud Agentic AI Capabilities Tool/agent marketplace, workflow designer, execution flows Governance metadata (provenance, versioning, registry) Scaling adoption across teams + enterprise programs 🎯 What you’ll do Manage + grow team of engineers Coach teams directly in the codebase (Python, GenAI infra, MLOps) Partner with architects on target GenAI architecture, data strategy, and cloud alignment Drive security, stability, and NFR excellence every release Act as escalation partner for high-risk delivery, removing friction + enabling velocity Influence roadmap, requirements, and product direction Own platform reliability, production readiness, and cross-team technical alignment 🧠 Who thrives here Leaders who have actually: Built and run GPU-backed GenAI platforms Operated multi‑cloud AI stacks Designed and governed platform services at enterprise scale Led senior engineers through complexity, ambiguity, and rapid innovation
Originally posted on LinkedIn · 41 likes · 0 comments