In this position:
- You own the reliable day-to-day operation of platform infrastructure services within the AI Business Unit.
- You drive automation across operational workflows, using modern tooling and AI to improve efficiency, scalability, maintainability, and security.
- You support and contribute to the migration of internal applications from Nomad to Kubernetes.
- You collaborate with Infrastructure teams to continuously improve Kubernetes environments.
- You manage and support core technologies such as Qdrant, PostgreSQL, ClickHouse, Redis, Dify, Langfuse, Kubeblocks, and the Cnpg.io Postgres Operator.
- You build and improve monitoring, alerting, and operational observability standards.
- You enable development teams to design, ship, and operate production-grade Kubernetes-ready services.
- You improve system reliability, scalability, security, and operational excellence across the AI landscape.
