Tasks
The Site Reliability Engineering department operates all IONOS Cloud IaaS and PaaS services. We are a team of highly skilled engineers dedicated to solving complex problems.
- Monitor system performance (uptime, latency, error rates) and lead 24/7 incident response, aiming for 85% first-time resolution.
- Plan and execute seamless software/hardware deployments across multiple datacenters.
- Conduct regular disaster recovery drills and improve runbooks, alerts, and monitoring thresholds.
- Research, evaluate, and recommend solutions for improving reliability, availability, performance, and security.
- Automate repetitive tasks to improve efficiency.
- Provide level 2 support and direct customer contact.
Qualifications
- Proficient in Linux system administration, with strong troubleshooting skills.
- Experienced with virtualized environments, including Qemu/KVM, OpenStack, Proxmox, and Kubernetes (K8s).
- Knowledge of configuration management tools such as SaltStack or Ansible, and monitoring tools like Prometheus, Loki, and Grafana.
- Experience with code management, including merge conflicts, feature branches, merge requests, and CI/CD processes.
- Preferred: experience with Ceph and software-defined networking.
- Proficiency in English and German (B2+ level).
Nice to have:
- Experience with software engineering best practices, including code reviews, build processes, packaging, and testing.
- Familiarity with the ITIL framework.
Location: Berlin
Note: Candidates must undergo a security check at the end of the application process. Your consent will be requested during the process.
Benefits
- Hybrid working model with home office options.
- Flexible working hours through trust-based scheduling.
- Subsidized canteen and free drinks at some locations.
- Modern office space with excellent transport connections.
- Employee discounts for activities and products.
- Employee events such as summer and winter parties, workshops.
- Training and development opportunities.
- Health offers, including sports and health courses.
#J-18808-Ljbffr
Site Reliability Engineer (f/m/d) Arbeitgeber: 1&1 IONOS SE
IONOS ist ein hervorragender Arbeitgeber, der seinen Mitarbeitern in Berlin ein dynamisches und unterstützendes Arbeitsumfeld bietet. Mit einem hybriden Arbeitsmodell, flexiblen Arbeitszeiten und umfangreichen Schulungs- und Entwicklungsmöglichkeiten fördert das Unternehmen die persönliche und berufliche Weiterentwicklung seiner Mitarbeiter. Zudem profitieren die Angestellten von modernen Büros, einer subventionierten Kantine und regelmäßigen Teamevents, die eine positive Unternehmenskultur und ein starkes Gemeinschaftsgefühl schaffen.

Kontaktperson:
1&1 IONOS SE HR Team