Jobbeschreibung
For our client we are looking for an Storage Operations Specialist (f/m/d).
Objective 1: Provide Tier-3 operational ownership for Storage Products for Local Production (DE)
Objective 2: Ensure operational readiness for deployments
Objective 3: Monitoring, Incident, Problem and Change Management/ Ensure operational stability and responsiveness for the managed Kubernetes platform
Objective 4: Automation/Reduce operational toil and improve service reliability
Objective 5: Ensure platform operations adhere to security and compliance standards
The contractor must be a senior level professional with proven experience in operations management of private cloud solutions, proficiency in managing storage operations on the platform:
Skills (must-have):
- 5+ years in IT storage operations / service delivery / platform operations with demonstrated leadership in mission-critical environments
- Proven experience implementing/leading Incident, Problem, Change, Release governance in production.
- Experience supporting platform workloads that rely on shared storage services.
- Expertise with storage types: File Storage, Block Storage, Object Storage.
- Expertise with protocols/services: NFS; object storage operations (S3-like concepts).
- Experience with kubernetes storage integration: CSI driver concepts and troubleshooting (PV/PVC lifecycle understanding).
- Virtualization (Storage): Experience operating storage virtualization in enterprise environments.
- Expertise within ITSM: Jira Service Management (JSM), Jira, Confluence.
- Fundamental understanding of core operations processes (incident management, change management, problem management, IT Service Management) as well as SRE concepts
- Experience in gathering operational insights from monitoring or observability including SLI/SLA/SLO management and tracking.
- Hand-on experience in documenting procedures properly and enforcing clear runbooks or playbooks.
- Observability Hands-on experience with monitoring and logging tools (e.g., Prometheus, Grafana, Datadog, Mimir, Loki).
- Familiarity with enterprise DevOps toolchains is a plus (GitLab, JFrog Artifactory, Backstage, Harness).
- Strong understanding of modern platform operations (Kubernetes/containers, automation, observability), sufficient to govern specialists.
- Platform delivery concepts: GitOps and IaC awareness (Terraform/OpenTofu, ArgoCD, Helm) to govern deployment/readiness standards.
Skills (should-have):
- Experience operating in regulated / high-availability industries (banking, telco, public sector, healthcare).
- Experience with SRE practices (SLOs/SLIs, error budgets) and reliability management.
- Experience operating storage services that integrate with Kubernetes platforms.
- Familiarity with IaC-based provisioning and GitOps-driven operational patterns.
|
Must have skills |
Nice to have skills |
|
Startdatum |
Laufzeit |
|
Auslastung |
Remote |
|
Erforderliche Sprachkenntnisse |
Budget |
|
Einsatzorte |