Principal Site Reliability Engineer

3 days ago


Remote, Czech Republic Akamai Full time
  • 12 years of relevant experience and a Bachelor's degree or its equivalent in work experience
  • Possess expertise in Linux internals, deep understanding of hardware and best practices enabling HW features in Linux
  • Possess advanced level experience with the Linux kernel, OS, and optimization of their configurations for KVM/QEMU virtualization
  • Possess expert level experience with designing, developing, and deploying software and infrastructure at scale
  • Have expertise in a DevOps, Development, or SysAdmin role, working with large scale distributed systems
  • Have experience with tools like SaltStack and Ansible for managing infrastructure at scale
  • Have excellent communication and interpersonal skills

Our team designs, develops, and manages applications and infrastructure that support Akamai's Compute products and services. We do this while maintaining Akamai's mission at the forefront of what we do make life better for billions of people, billions of times a day.

As a Principal Site Reliability Engineer in the Virtualization & Host Platforms (VHP) team, you will be at the forefront of Akamai Cloud and Delivery fleet platform hardware and software host technologies. Our team is responsible for all physical and virtual Linux platform software, working closely with hardware teams to define and enable new server builds, and investigating Linux performance issues company-wide.

,[Architecting, developing, testing, and distributing changes to software, services, and tools the VHP team is responsible for., Designing and implementing enhancements to VHP observability infrastructure in order to identify and correct problems before they impact our customers, Developing subject matter expertise in VHP components and mentoring the team, Identifying and implementing automation best practices for existing products and processes, Collaborating with our support, operations and engineering teams to investigate and troubleshoot complex problems, Participating in on-call rotations, guiding restoration and repair of service-impacting issues] Requirements: Python/Golang, Linux, Kernel-based Virtual Machine, KVM, QEMU, SaltStack, Ansible, Python Tools: Jira, GIT, Agile. Additionally: Private healthcare, Small teams, International projects, Home Office Budget, Free coffee, Gym, Canteen, Bike parking, Playroom, Shower, Free parking, In-house trainings, In-house hack days, Modern office, Startup atmosphere, No dress code.

  • Remote - Czech Republic Groupon Full time €90,000 - €120,000 per year

    Groupon is a marketplace where customers discover new experiences and services everyday and local businesses thrive. To date we have worked with over a million merchant partners worldwide, connecting over 16 million customers with deals across various categories. In a world often dominated by e-commerce giants, we stand out as one of the few platforms...


  • Remote, Czech Republic Akamai Full time

    12 years of relevant experience and a Bachelor's degree or its equivalent in work experiencePossess expertise in Linux internals, deep understanding of hardware and best practices enabling HW features in LinuxPossess advanced level experience with the Linux kernel, OS, and optimization of their configurations for KVM/QEMU virtualizationPossess expert level...


  • Remote, Czech Republic Akamai Full time

    12 years of relevant experience and a Bachelor's degree or its equivalent in work experiencePossess expertise in Linux internals, deep understanding of hardware and best practices enabling HW features in LinuxPossess advanced level experience with the Linux kernel, OS, and optimization of their configurations for KVM/QEMU virtualizationPossess expert level...


  • Remote, Czech Republic Akamai Full time

    Have in-depth understanding of computer networking concepts, Security concepts, Unix/Linux internals, distributed systems, and systems design.Have professional experience in a Site Reliability, Development, or Systems Engineering role, with large scale distributed systemsDemonstrate experience with programming or scripting languages such as Python or...


  • Remote, Czech Republic Akamai Full time

    Have in-depth understanding of computer networking concepts, Security concepts, Unix/Linux internals, distributed systems, and systems design. Have professional experience in a Site Reliability, Development, or Systems Engineering role, with large scale distributed systems Demonstrate experience with programming or scripting languages such as Python or...


  • Remote, Czech Republic Akamai Full time

    Have 5 years of relevant experience and a Bachelor's Degree in Computer Science or its equivalentPossess expert level experience in a DevOps, Development, or SysAdmin role working with large scale distributed systemsHave experience with building tools for automation and infrastructure at scale(python/go, terraform, saltstack, jenkins)Be able to work in...


  • Remote, Czech Republic Akamai Full time

    Have 5 years of relevant experience and a Bachelor's Degree in Computer Science or its equivalentPossess expert level experience in a DevOps, Development, or SysAdmin role working with large scale distributed systemsHave experience with building tools for automation and infrastructure at scale(python/go, terraform, saltstack, jenkins)Be able to work in...


  • Remote, Czech Republic Akamai Full time

    Have relevant experience and a Bachelor's diploma in Computer Science or its equivalentPossess expert level experience in a SysAdmin (Linux/Unix Administration), DevOps or Software engineering role, working with large scale distributed systemsPossess at least one programming language (Python/Golang) and configuration management with...


  • Remote, Czech Republic Akamai Full time

    Have 5 years of relevant experience and a Bachelor's Degree in Computer Science or its equivalent Possess expert level experience in a DevOps, Development, or SysAdmin role working with large scale distributed systems Have experience with building tools for automation and infrastructure at scale(python/go, terraform, saltstack, jenkins) Be able to work in...


  • Remote, Czech Republic Akamai Full time

    Have relevant experience and a Bachelor's diploma in Computer Science or its equivalent Possess expert level experience in a SysAdmin (Linux/Unix Administration), DevOps or Software engineering role, working with large scale distributed systems Possess at least one programming language (Python/Golang) and configuration management with...