
Site Reliability Engineer @ Antal
1 day ago
- 5+ years of experience in supporting or developing distributed systems (Java-based environments preferred).
- Hands-on experience with monitoring and logging tools: Grafana, Prometheus, Loki, Splunk, etc.
- Solid understanding of Unix/Linux systems, cloud infrastructure (GCP preferred), and databases (RDBMS).
- Experience with CI/CD tooling, such as Ansible, Jenkins, GitHub Actions, and vulnerability management.
- Familiarity with job scheduling tools (e.g., Control-M or equivalent).
- Strong communication skills and ability to drive technical discussions with multiple support teams.
- Experience working in Agile/Scrum teams.
Site Reliability Engineer
📍 Kraków (Hybrid – minimum 2 days/week in the office)
💼 Employment type: B2B
Are you looking for an opportunity to join a high-impact project in a global financial institution that invests heavily in cloud, AI, and DevOps? We're building a new Site Reliability Engineering (SRE) team in Kraków to support a mission-critical Counterparty Credit Risk (CCR) platform, and we're looking for experienced engineers to join the journey.
As part of this role, you'll contribute to the stability, scalability, and observability of a high-volume, distributed platform operating on both Google Cloud Platform and on-prem infrastructure.
What we offer:
- The chance to build and shape a new SRE team supporting a critical platform for global risk management.
- Work in a modern technology stack: Java, GCP, Apache Beam, Spring Boot, DevOps tooling.
- Hybrid working model with at least 2 days/week in our Kraków office.
- Stable, long-term project with excellent opportunities for growth and learning.
📩 Interested? Apply now and take the next step in your career with a team that’s redefining reliability at a global scale.
,[Ensure the reliability and high availability of production systems used in global credit risk management., Monitor, detect, and troubleshoot incidents in distributed systems running in cloud and hybrid environments., Implement observability tools (Grafana, Prometheus, Loki, etc.) and improve monitoring and alerting strategies., Lead root cause analysis (RCA) and post-incident reviews to improve resilience and operational efficiency., Collaborate with developers, DevOps engineers, and global support teams to implement SRE best practices., Contribute to CI/CD automation, deployment pipelines, and security/vulnerability remediation.] Requirements: Java, Grafana, Prometheus, Loki, Splunk, Unix, Linux, Cloud infrastructure, RDBMS, CI/CD, Ansible, Jenkins, GitHub Actions, GCP, Control-M-
Site Reliability Engineer @
2 weeks ago
Kraków, Lesser Poland, Czech Republic Antal Full timeWhat you need to succeed in this role:5+ years of experience in supporting or developing distributed systems (Java-based environments preferred).Hands-on experience with monitoring and logging tools: Grafana, Prometheus, Loki, Splunk, etc.Solid understanding of Unix/Linux systems, cloud infrastructure (GCP preferred), and databases (RDBMS).Experience with...
-
Kubernetes Site Reliability Engineer @
2 weeks ago
Kraków, Lesser Poland, Czech Republic ITDS Full timeYou're ideal for this role if you have:7-10 years of experience working on Kubernetes environmentsStrong understanding of containerization and orchestration conceptsProficiency in observability and logging toolsFamiliarity with infrastructure as code practicesStrong analytical and problem-solving skillsExperience with Unix/Linux system administrationAbility...
-
Kraków, Czech Republic ITDS Full timeYou're ideal for this role if you have: 7-10 years of experience working on Kubernetes environments Strong understanding of containerization and orchestration concepts Proficiency in observability and logging tools Familiarity with infrastructure as code practices Strong analytical and problem-solving skills Experience with Unix/Linux system...
-
Site Reliability Engineering Lead @
1 week ago
Kraków, Lesser Poland, Czech Republic HSBC Technology Poland Full timeWhat you need to have to succeed in this roleExperience in service reliability, production support, or platform operations at scale.Strong analytical mindset and ability to interpret trends across complex data sets.Familiarity with automation/orchestration platforms such as Control-M or IBM SFG.Understanding of ITIL principles (esp. incident, problem, and...
-
Highly Skilled Reliability Engineer Wanted
1 week ago
Kraków, Lesser Poland, Czech Republic beBeeReliability Full time €62,152 - €84,454Site Reliability Engineering LeadKey Challenges and Objectives:The role of Site Reliability Engineering Lead involves driving reliability engineering efforts to enhance platform stability and identify patterns of failure. This requires a strong analytical mindset and ability to interpret complex data sets.Responsibilities:Lead data-driven reliability...
-
Cloud Engineer @ Antal
1 day ago
Kraków, Czech Republic Antal Full timeBachelor's degree or higher in Computer Science, Engineering, or a related field 2+ years of hands-on experience in infrastructure/system support, operations, or cloud engineering Strong command of Linux OS, with scripting experience in shell or Python Solid understanding of networking and firewall configurations Excellent troubleshooting, communication and...
-
GCP DevOps Engineer @ Antal
1 day ago
Remote, Kraków, Czech Republic Antal Full time3+ years of experience in DevOps or Cloud Engineering roles Practical experience with CI/CD tools such as Jenkins, GitHub Actions, Nexus and Ansible Hands-on experience with cloud platforms (GCP preferred, AWS or Azure also considered) Proficiency in Terraform and Infrastructure as Code (IaC) approaches Strong scripting skills in Bash and Python Very...
-
Site Reliability Engineer Tech Lead @
1 week ago
Warsaw, Kraków, Czech Republic Link Group Full time6+ years of professional experience in DevOps or SRE roles.1+ year of experience in a technical leadership or team management role.Solid hands-on experience with Amazon Web Services (AWS), Kubernetes, Terraform, and CI/CD tools (GitHub Actions, Jenkins, etc.).Strong scripting skills (e.g., Bash, Python).Proficient in working in Linux-based...
-
Warsaw, Kraków, Czech Republic beBeeDevOps Full time €70,000 - €95,000Technical Leadership in DevOps and SREOverview:The role of a DevOps / Site Reliability Engineering (SRE) Team Lead involves overseeing both the technical development and team leadership aspects of embedded SRE initiatives. In this hybrid position, you will play a key part in driving technology transformation and ensuring the scalability, reliability, and...
-
Senior Software Java Engineer @ Antal
1 day ago
Kraków, Czech Republic Antal Full timeRequirements Strong hands-on experience with Java (core Java, multithreading, performance optimization). Solid knowledge of Linux environments and distributed systems. Background in building low-latency, high-availability platforms. Familiarity with frameworks such as Spring or Apache Storm is a plus. Curiosity and willingness to learn financial...