Langsung ke konten utama
Kembali ke Lowongan

Senior Product Architect (Reliability & Ops)

Bangun platform operasi canggih untuk infrastruktur digital global

Tentukan visi dan strategi untuk domain operasional Equinix, termasuk observasi, manajemen insiden, dan sistem AI untuk memastikan operasional yang andal dan skalabel

Kenapa Menarik?

Bekerja di tim yang membentuk masa depan infrastruktur digital global

Skills Wajib

Product ArchitectureReliability EngineeringAI OpsObservability

Keywords

Senior Product ArchitectReliability EngineeringAI OpsObservabilityEquinixDigital Infrastructure
Lihat Deskripsi Asli dari The Muse

Deskripsi asli dari The Muse

Who are we? Equinix is the world's digital infrastructure company®, shortening the path to connectivity to enable the innovations that enrich our work, life and planet. A place where bold ideas are welcomed, human connection is valued, and everyone has the opportunity to shape their future. A career at Equinix means being at the center of shaping what comes next and amplifying customer value through innovation and impact. You'll work across teams, influence key decisions, and help shape the path forward. You'll find belonging, purpose, and a team that welcomes you-because when you feel valued, you're empowered to do your best work. Job Summary Leads the vision, strategy, and execution for the Runtime, Reliability & Operations capability domain within Equinix Engineering Excellence (E3). Owns the product portfolio and long-term roadmap for the operational platforms, reliability engineering capabilities, observability systems, and AI-assisted operational workflows that ensure resilient, scalable, and self-healing service operations across Equinix environments. This leader is responsible for transforming fragmented operational tooling and reactive support models into a unified, intelligent operational platform that improves system reliability, accelerates incident response, reduces operational toil, and enables autonomous operations at scale. The Runtime, Reliability & Operations domain is responsible for capabilities spanning observability, incident management, operational telemetry, reliability automation, service health intelligence, operational workflows, resilience engineering, AI Ops, and self-healing operational systems. Acts as the single-threaded product owner for the capability domain strategy, executive inspection narrative, investment priorities, operational maturity roadmap, and adoption outcomes across engineering, SRE, infrastructure, operations, and support organizations. The role requires balancing operational rigor and reliability engineering discipline with developer productivity, automation, scalability, and AI-native operational transformation. Responsibilities Capability Domain Strategy & Vision Defines and evolves the long-term vision, operating model, and roadmap for the Runtime, Reliability & Operations capability domain, including: Observability platforms and telemetry pipelines Incident, problem, and operational workflow automation Service health intelligence and operational analytics Reliability engineering capabilities and resilience frameworks AI Ops and event correlation systems Automated remediation and self-healing operations Operational runbooks, diagnostics, and recovery orchestration Integrated alerting, ownership, and escalation systems Synthetic monitoring and behavioral validation frameworks Runtime operational governance and operational readiness standards Reliability telemetry, SLO/SLA management, and operational reporting Establishes strategic direction aligned to Equinix reliability, operational scalability, resiliency, customer experience, and engineering productivity goals Product Portfolio Ownership Owns a portfolio of operational and reliability platform products and capabilities, including roadmap prioritization, sequencing, dependency management, and adoption strategy. Ensures operational capabilities are reusable, scalable, and integrated into engineering and support workflows across the enterprise Executive Inspection Leadership Partners with engineering, infrastructure, SRE, and operations leaders to shape and govern the executive inspection process for the Runtime, Reliability & Operations domain. Drives alignment between operational performance, engineering practices, infrastructure reliability, and business continuity objectives Reliability & Operational Experience Leadership Represents the needs of developers, SREs, operations teams, infrastructure engineers, incident responders, and engineering managers. Partners with Voice of Developer and operational stakeholders to conti

Lamar gratis

Akun gratis · tanpa kartu kredit · Masuk

Pro Rp39rb/bln · lamar tanpa batas + resume AI

Perusahaan
Equinix, Inc
Sumber
The Muse
Tipe Pekerjaan
full time
Lokasi
Worldwide Remote · Remote
Kategori
Engineering
Level
senior
Diposting
20 Mei 2026

Bagikan lowongan ini

Bantu temanmu nemu kerja remote berikutnya.

Lamar gratis

Akun gratis · tanpa kartu kredit · Masuk

Pro Rp39rb/bln · lamar tanpa batas + resume AI