Cloud Reliability Architect

21 hours ago


Remote Kraków, Czech Republic Kontakt Full time

Kontakt.io is revolutionizing the way care operations are run.

We use AI, RTLS, and EHR data to automate workflows, adapt in real-time, and orchestrate all of care delivery operations. Our platform gives a clear picture of spaces, equipment, and people, eliminating inefficiencies and enhancing the patient experience.

Reliability Engineer Leader Job Description

We're looking for a highly experienced SRE leader to own the reliability, performance, and automation of our cloud-based, real-time platform.

  • Simplify high-traffic, mission-critical platforms with 10+ years of experience in Site Reliability Engineering or Cloud Infrastructure.
  • Proven success scaling complex platforms in SaaS, IoT, or healthcare.
  • Expertise in cloud platforms (AWS), Kubernetes, and distributed systems.
  • Strong background in monitoring, logging, and observability with Prometheus, OpenTelemetry, or similar tools.
  • Hands-on experience with incident management, postmortems, and building resilient systems.
  • Deep knowledge of CI/CD automation, GitOps, and infrastructure as code (Terraform).
  • Mature leadership approach, driving technical strategy while growing and mentoring a high-performance SRE team.
  • Strong understanding of network security, access management, and compliance frameworks (HIPAA, SOC 2).
Responsibilities:
  • Ensure 99.99% uptime across our cloud platform, meeting strict SLAs for healthcare customers.
  • Design and implement self-healing, fault-tolerant systems to prevent failures before they happen.
  • Define SLIs, SLOs, and SLAs, ensuring proactive performance monitoring and incident resolution.
  • Architect and manage scalable cloud infrastructure (AWS) for massive real-time data processing.
  • Optimize containerized environments (Kubernetes, Docker) to support multi-region deployments.
  • Lead the adoption of infrastructure as code (Terraform) to fully automate infrastructure management.
  • Build and refine a world-class monitoring, alerting, and logging system using Prometheus, Grafana, OpenTelemetry, and Datadog.
  • Lead incident response and on-call operations, reducing mean time to detection (MTTD) and mean time to resolution (MTTR).
  • Conduct blameless postmortems and continuously improve system resilience.
  • Reduce manual intervention through automated deployment, scaling, and failover mechanisms.
  • Partner with Security & Compliance teams to ensure infrastructure meets HIPAA and SOC 2 standards.
  • Lead disaster recovery and business continuity planning to ensure critical healthcare services are always available.
  • Drive technical strategy and roadmap for scalability, monitoring, and reliability engineering.
  • Collaborate with Product, Engineering, and Infrastructure teams to align SRE initiatives with business priorities.
Requirements:
  • Python expertise.
  • AWS experience.
  • Kubernetes knowledge.
  • SaaS and IoT experience.
  • Terraform skills.
  • Prometheus proficiency.
  • CI/CD understanding.
  • IoC familiarity.
  • GitOps knowledge.


  • Remote, Kraków, Czech Republic Kontakt Full time

    About Us:Kontakt.io is revolutionizing care operations with its cutting-edge platform.Our innovative solution reduces waste, cuts costs, and improves revenue by optimizing throughput, asset utilization, and staff productivity.We harness AI, RTLS, and EHR data to enable self-learning agents that automate workflows, adapt in real-time, and orchestrate all care...


  • Remote, Czech Republic Chorus One Full time

    We are looking for a highly skilled Cloud Infrastructure Architect to join our team at Chorus One. Our company is one of the leading operators of infrastructure for Proof-of-Stake networks and decentralized protocols.About UsAt Chorus One, we value radical transparency, striving for excellence and improvement while treating each other with kindness and...


  • Remote, Kraków, Czech Republic Kontakt Full time

    About Kontakt.io Kontakt.io is revolutionizing the care operations landscape with its innovative AI-driven healthcare platform. The company’s mission is to reduce waste, cut costs, and improve revenue by enhancing throughput, asset utilization, and staff productivity. By leveraging real-time data from AI, RTLS, and EHR, Kontakt.io enables self-learning...


  • Remote, Warszawa, Cracow, Wrocław, Czech Republic N-iX Full time

    Job Description:At N-iX, we're seeking a skilled Cloud Data Solutions Architect to join our team. As a seasoned professional in data engineering, you'll be responsible for designing and implementing robust cloud-based data solutions using Azure ecosystems and modern data engineering practices.Key Responsibilities:Develop, optimize, and maintain complex data...


  • Remote, Kraków, Czech Republic Kontakt Full time

    10+ years of experience in Site Reliability Engineering or Cloud Infrastructure.Proven success scaling high-traffic, mission-critical platforms in SaaS, IoT, or healthcare.Deep expertise in cloud platforms (AWS), Kubernetes, and distributed systems.Strong background in monitoring, logging, and observability with Prometheus, OpenTelemetry, or similar...


  • Remote, Kraków, Czech Republic Kontakt.io Full time

    10+ years of experience in Site Reliability Engineering or Cloud Infrastructure. Proven success scaling high-traffic, mission-critical platforms in SaaS, IoT, or healthcare. Deep expertise in cloud platforms (AWS), Kubernetes, and distributed systems. Strong background in monitoring, logging, and observability with Prometheus, OpenTelemetry, or similar...


  • Remote, Czech Republic Link Group Full time

    Company OverviewWe are a leading provider of cloud-based solutions for the industrial insurance industry, utilizing cutting-edge Java and cloud technologies in an international environment.Job DescriptionWe are seeking an experienced Java Developer with German proficiency (C1) to join our team in developing innovative cloud-based solutions. As a key member...


  • Remote, Wrocław, Gdańsk, Rzeszów, Czech Republic Xebia sp. z o.o. Full time

    Company OverviewXebia sp. z o.o. is a renowned digital solutions provider with nearly two decades of experience serving clients from diverse industries globally. Our partnership with leading cloud providers such as Amazon Web Services (AWS), Microsoft Azure, and Google Cloud Platform (GCP) enables us to deliver innovative, cloud-based solutions.Skill...


  • Remote, Warszawa, Kraków, Wrocław, Poznań, Katowice, Czech Republic Deloitte Full time

    Key Requirements: Bachelor's degree in Computer Science, Software Engineering, or a related field Minimum of 5 years of professional experience in cloud solutions architecture, At least 3 years focused on Microsoft Azure Strong knowledge and hands-on experience with Azure services, including PaaS, IaaS, and SaaS solutions Extensive experience in developing...


  • Remote, Kraków, Wrocław, Warszawa, Czech Republic N-iX Full time

    Job DescriptionThe role of a Senior Data Solutions Architect at N-iX involves designing and implementing scalable data pipelines using Python.In this position, you will work collaboratively with an international team to develop efficient data solutions.Your primary responsibilities will include:Developing and maintaining smart documentation for process...


  • Kraków, Czech Republic Splunk Services Sp. z o.o. Full time

    Must-have Qualifications Cloud experience. Knowledge of instance management and storage, as well as an understanding of regional centers, availability zones, and HA strategies. Proven experience in at least one of the major CSPs (AWS, GCP, Azure) is required. Infrastructure as code experience. You are proficient with infrastructure as code solutions, such...


  • Remote, Warsaw, Czech Republic Ework Group Full time

    Higher education level within IT or related is preferred  Min. 6 years of experience in IT sector specializing in cloud architecture based on Azure in the large, enterprise environment  Hands-on experience with automation and IaC tools: Powershell, Azure CLI, Azure Bicep  Working on migration workloads between VMWare and Azure Native VMs  Ability to be...


  • Remote, Kraków, Czech Republic Kontakt Full time

    3+ years of experience as an SREStrong expertise in Kubernetes, Docker, and container orchestration.Experience managing cloud-native environments (AWS).Experience with event-driven architectures, Kafka, or real-time data streaming.Knowledge of machine learning infrastructure.Previous experience in healthcare, compliance (HIPAA), and highly regulated...


  • Remote, Czech Republic Inhabit Polska Full time

    Skills & Knowledge Ability to multi-task and complete regular duties in a time-efficient manner. Strong skills in Linux Server systems and some Windows experience. Experience with MS SQL/Postgres SQL applications. Experience with Azure and Amazon Cloud Services Experience with Python and PowerShell scripts Experience with deploying, supporting, and...


  • Kraków, Czech Republic HSBC Technology Poland Full time

    What you need to have to succeed in this role Experience with SRE and Azure DevOps Ability to script (Bash/PowerShell, Azure CLI), code (Python, C#, Java), query (SQL, Kusto query language) coupled with experience with software versioning control systems (e.g., GitHub) and CI/CD systems. Programming experience in the following languages: PowerShell,...


  • Remote, Wrocław, Gdańsk, Rzeszów, Czech Republic Xebia sp. z o.o. Full time

    Xebia sp. z o.o. is a leading digital solutions provider with a passion for cloud-based technology.We partner with top cloud providers like Amazon Web Services (AWS), Microsoft Azure, and Google Cloud Platform (GCP) to deliver innovative projects globally.Our mission is to be recognized as the authority in our field of expertise, driven by innovation,...


  • Remote, Warsaw, Czech Republic Ework Group Full time

    Higher education level within IT or similar fields are preferred Over 5 years of experience as SAP Developer with focus on BTP platform Hand-on experience with SAP Fiori, CAP Node framework, Fiori Launchpad, Cloud Foundry and JavaScript/prescriptpt Nice to have experience with OData API for Success Factor Fluency in English – speaking and writing  For...


  • Remote, Warsaw, Czech Republic Ework Group Full time

    Higher education level within IT or similar fields are preferred  Over 5 years of experience as SAP Developer with focus on BTP platform  Hand-on experience with SAP Fiori, CAP Node framework, Fiori Launchpad, Cloud Foundry and JavaScript/prescriptpt  Nice to have experience with OData API for Success Factor  Fluency in English – speaking and...


  • Remote, Czech Republic Link Group Full time

    What You Bring ✅ 6+ years of experience in Oracle ERP Fusion Finance ✅ Lead experience in at least 2 full life cycle implementations + participation in 4 others as a team member ✅ Strong background in functional design, financial processes, and solution architecture ✅ Prior experience with Oracle EBS Finance is a plus ✅ Experience working in...

  • Cloud DevOps Expert

    5 days ago


    Remote, Kraków, Czech Republic Infoniqa Poland Sp. z o.o. Full time

    Job DescriptionWe are seeking a highly motivated and experienced Cloud DevOps Team Lead to drive and optimize our DevOps practices, oversee cloud infrastructure and operations, and lead a team of engineers.About the RoleThe ideal candidate will have a strong hands-on background in DevOps, experience in leading people, and fluency in English. As a Cloud...