Auf einen Blick
- Aufgaben: Design and maintain reliable infrastructure for scalable services using cloud and on-premise solutions.
- Arbeitgeber: Join FACT-Finder, the European leader in eCommerce product discovery and search.
- Mitarbeitervorteile: Enjoy flexible working hours, state-of-the-art equipment, and personal development support.
- Warum dieser Job: Be part of a diverse team in a scale-up company with a market-leading product.
- Gewünschte Qualifikationen: 5+ years in IT, expertise in Linux, scripting, and containerization technologies required.
- Andere Informationen: Remote work available within Germany; regular feedback rounds to support your growth.
Das voraussichtliche Gehalt liegt zwischen 43200 - 72000 € pro Jahr.
Your mission
Design, build, and maintain our infrastructure and tools to allow for the highly reliable and scalable deployment of services and applications, incorporating both cloud-based and on-premise solutions.
Implement comprehensive monitoring and observability frameworks to detect and resolve issues proactively, using tools like Prometheus, Grafana, and Zabbix for system health and performance metrics.
Develop and manage incident response protocols, including on-call rotations, incident analysis, and conducting postmortems to ensure continuous improvement in system reliability and performance.
Automate infrastructure and workflows using Infrastructure as Code (IaC) tools like Ansible.
Optimize system performance through regular performance tuning, capacity planning, and conducting reliability experiments to identify and mitigate potential points of failure.
Collaborate with development teams to advocate for reliability and scalable practices throughout the software development life cycle, and assist in the design and review of new systems and major changes.
Your profile
- 5+ years of experience in IT with a focus on system administration and automation.
- Expertise in Linux system administration and in using Infrastructure-as-Code tools like Ansible.
- Strong knowledge of scripting and programming in Bash and Python.
- Experience with containerization technologies (Docker) and orchestration tools (e.g., Docker Swarm or Kubernetes).
- Experience of running demanding Java applications in production with an understanding of the JVM and Java memory management.
- Work experience in the data center, such as cabling, server racking, up to and including data center design.
- Strong analytical and problem-solving skills with experience in troubleshooting complex issues triggered and supported by monitoring tools.
- Effective communication and collaboration abilities, essential for working across teams and with stakeholders.
- Fluent in English and German.
THE JOY OF WORKING WITH US
- Scale-up company with a market-leading product.
- Open culture with diverse international teams.
- Flexible working hours.
- State-of-the-art equipment.
- Personal development support, e.g. access to the learning platform Udemy.
- Regular feedback rounds.
Job Location
Remote within Germany.
About us
FactFinder is the European leader in eCommerce product discovery and search. Using authentic intelligence, the unique combination of artificial and human intelligence, FactFinder understands every shopper's intent from the first click - increasing conversions and boosting revenues by over 30%. For over two decades, FactFinder has been trusted to support billions of search queries a year, for thousands of B2B and B2C brands including Intersport, White Stuff, OBI, Stihl, and MyTheresa. FactFinder is headquartered in Germany, with offices in Berlin, London and Stockholm. Visit www.fact-finder.com for more information.
Site Reliability Engineer (m/f/d) Arbeitgeber: FACT GmbH
Kontaktperson:
FACT GmbH HR Team
StudySmarter Bewerbungstipps 🤫
So bekommst du den Job: Site Reliability Engineer (m/f/d)
✨Tip Number 1
Make sure to showcase your experience with Infrastructure as Code tools like Ansible during any discussions. Highlight specific projects where you've successfully automated workflows, as this is a key requirement for the role.
✨Tip Number 2
Familiarize yourself with the monitoring tools mentioned in the job description, such as Prometheus and Grafana. Being able to discuss how you've used these tools to proactively resolve issues will set you apart from other candidates.
✨Tip Number 3
Prepare to talk about your experience with containerization technologies like Docker and orchestration tools. Share examples of how you've managed demanding applications in production, especially focusing on Java applications and JVM management.
✨Tip Number 4
Since effective communication is crucial for this role, think of instances where you've collaborated with development teams to improve system reliability. Be ready to discuss how you advocate for best practices in a cross-functional environment.
Diese Fähigkeiten machen dich zur top Bewerber*in für die Stelle: Site Reliability Engineer (m/f/d)
Tipps für deine Bewerbung 🫡
Tailor Your CV: Make sure your CV highlights your 5+ years of experience in IT, focusing on system administration and automation. Emphasize your expertise in Linux, Infrastructure as Code tools like Ansible, and any relevant scripting or programming skills.
Craft a Strong Cover Letter: In your cover letter, express your passion for site reliability engineering. Mention specific experiences where you've implemented monitoring frameworks or automated workflows, and how these contributed to system reliability.
Showcase Relevant Projects: If you have worked on projects involving containerization technologies like Docker or orchestration tools such as Kubernetes, be sure to include these in your application. Highlight your role and the impact of your contributions.
Highlight Communication Skills: Since effective communication and collaboration are key for this role, provide examples in your application that demonstrate your ability to work across teams and with stakeholders, especially in troubleshooting complex issues.
Wie du dich auf ein Vorstellungsgespräch bei FACT GmbH vorbereitest
✨Showcase Your Technical Skills
Be prepared to discuss your experience with Linux system administration, Infrastructure as Code tools like Ansible, and containerization technologies such as Docker. Highlight specific projects where you've implemented these skills.
✨Demonstrate Problem-Solving Abilities
Expect questions that assess your analytical and troubleshooting skills. Prepare examples of complex issues you've resolved, particularly those involving monitoring tools like Prometheus or Grafana.
✨Communicate Effectively
Since collaboration is key in this role, practice articulating your thoughts clearly. Be ready to discuss how you've worked with development teams to advocate for reliability and scalable practices.
✨Prepare for Scenario-Based Questions
Anticipate scenario-based questions related to incident response protocols and performance tuning. Think about how you would handle specific situations and be ready to explain your thought process.