Platform Engineer
16 hours ago
Job Description
Join our company as we transform and innovate. We are at the forefront of research to deliver innovative health solutions that advance the prevention and treatment of diseases in people and animals. We are currently seeking a Cloud & Infrastructure Technology Staff Engineer to help deliver our Container Platform product. This is also an exciting opportunity to contribute to the development of our broader company's container practice outside our team.
The Platform and Containers as a service product team provide development teams a pre-paved path to run and operate applications via a Platform-as-a-Service model. This enables application product teams by delivering infrastructure solutions and container platforms, primarily through the development of Infrastructure-as-Code. The team also help develops the strategy and works with teams worldwide to govern, grow, and operate container solutions across the company. Our team embraces the regulatory challenges of our industry and drives innovative, secure, and compliant use of containers and container platforms. We operate as a business partner proposing ideas and innovative solutions that enable new organizational capabilities. We collaborate closely with internal and external partners worldwide to mature our ability to utilize Cloud Service Providers. We also impact cluster availability and integrity worldwide via our cloud-based container cluster Platform as a Service (PaaS & CaaS).
As a Platform Engineer, you bring a strong background in cloud infrastructure, networking, PaaS, and automation and a passion for eliminating toil through great engineering. Join our global team to build and evolve automated, self-service platform experiences (think GitOps, policy-as-code, golden paths) so modern application teams can onboard quickly, deploy reliably, and focus on delivering value.
Primary job responsibilities:
- Architect, design, and engineer global platform services—including Tanzu Application Service (Cloud Foundry), Oracle APEX, container platforms, and a multi-region Web Redirect Service
- Lead architecture and engineering of container image management (registry design, signing/provenance, SBOM, retention, geo-replication)
- Define, codify, and socialize best practices for containers and Application-Hosting PaaS (security baselines, multi-tenancy, networking, cost controls) as policy-as-code
- Cultivate an engineering community of practice—share patterns, improve templates/playbooks, and drive inner-source contributions
- Identify capability gaps via telemetry and stakeholder feedback; translate into roadmaps/MVPs and deliver solutions that close the gaps
- Automate and simplify platform maintenance and lifecycle (provisioning, upgrades, patching, backup/DR) using GitOps and CI/CD; reduce manual toil and risk
- Maintain up-to-date cloud-native knowledge; evaluate and introduce fit-for-purpose technologies and practices
- Prioritize workloads and commitments; balance roadmap, BAU operations, and incident response against clear SLAs and timelines
- Partner with product managers and engineering teams to onboard workloads quickly and safely, providing reference architectures and guardrails
- Ensure SDLC and company policy compliance through documentation, reviews, audits, and secure-by-default controls
- Serve as Tier-3 escalation for customer support; lead incident response, post-mortems, and preventive action plans
- Design hybrid/multi-cloud networking: build secure connectivity with Direct Connect/ExpressRoute, PrivateLink/Service Endpoints, segmented VPC/VNet topologies, and enterprise DNS (Route 53/Azure) with strict egress controls
- Drive FinOps & cost efficiency: implement tagging/chargeback, rightsizing & autoscaling, storage lifecycle policies (S3/EFS/FSx), and Savings Plans/Reservations, with dashboards to track unit economics
- Harden data protection & key management: apply envelope encryption with AWS KMS/Azure Key Vault, automate secrets rotation (Secrets Manager/Key Vault), and enforce backup/restore and cross-region DR for RDS/EFS/FSx
- Standardize IaC & GitOps at scale: deliver reusable Terraform/Bicep modules, environment promotion via pipelines, policy-as-code (OPA/Conftest/Checkov), and automated drift detection/remediation across AWS/Azure and container platforms
- Deep AWS proficiency: VPC, Route 53, EC2, ALB/ELB/NLB, S3, RDS, IAM, Lambda, EventBridge, SNS/SES, CloudWatch/CloudTrail, KMS, Secrets Manager, MSK, Kendra, ACM, DRS, EFS, FSx for ONTAP
- Azure experience: Virtual Networks, Resource Groups, Application Gateway, App Service/ASE v3, Autoscaling, Azure Monitor, Key Vault, Secrets Management, Storage Accounts
- Ecosystem & platforms: Broadcom Tanzu Platform, Spectro Cloud (edge/K8s orchestration), Azure ASE v3, Oracle APEX
Requirements:
- BS Degree or equivalent in Computer Science, Computer Engineering, Information Systems, or equivalent experience
- Relevant certification or completion of equivalent program in areas such as Software Development, Computer Science, or Computer Engineering
- Practical experience with container platforms: Kubernetes, Amazon EKS/ECS, Azure AKS; cloud-native tools such as Helm, Cilium/Calico/Flannel, Karpenter, plus container image management (registry design, signing/provenance, SBOMs, retention, geo-replication)
- Direct involvement in architecture, engineering, and operations for Broadcom Tanzu Platform, Azure ASE v3, Oracle APEX, Spectro Cloud, and multi-region platform services (e.g., web redirect services)
- Extensive hands-on work with AWS and Azure across IaaS and networking: VPC/VNet, subnets, security groups/NSGs, NACLs, routing, enterprise DNS (Route 53/Azure DNS), ALB/ELB/NLB, Application Gateway
- Hybrid/multi-cloud networking: Direct Connect / ExpressRoute, PrivateLink / Service Endpoints, segmented VPC/VNet topologies, egress controls and inspection
- Strong understanding of HTTP/S, TCP/IP, DNS, load balancing, TLS/PKI, and foundational routing concepts
- Practical Linux (RHEL/Ubuntu) administration: provisioning/migrations, standardization, performance tuning, and HA clustering
- Proficiency in Python and Bash for automation; develop reusable Ansible playbooks/roles in Ansible Automation Platform
- Infrastructure as Code & GitOps: author reusable Terraform/Bicep modules; environment promotion via GitHub Actions/Azure DevOps/Jenkins/CloudBees; policy-as-code (OPA/Conftest/Checkov); automated drift detection/remediation
- Serverless & integrations: event-driven designs with AWS Lambda; messaging and orchestration via EventBridge, SNS/SES
- Security & compliance: identity and access, encryption at rest/in transit; KMS/Key Vault, Secrets Manager/Key Vault with automated rotation; baseline hardening, vulnerability remediation, and compliance controls
- FinOps: tagging/chargeback, rightsizing & autoscaling, storage lifecycle policies (S3/EFS/FSx), reservations/savings plans, and cost dashboards
- Reliability & DR: architect hybrid DR/BC with defined RTO/RPO, automated failover, backup/restore testing, documented recovery runbooks
- Observability & SRE: create SLIs/SLOs, alerting, and dashboards with Dynatrace, Prometheus, Grafana, Nobl9 (plus ELK/Datadog or equivalents); incident response, post-mortems, and preventive actions
- Ability to troubleshoot across infrastructure, networks, security, and databases; Tier-3 escalation ownership
- Experience designing software/platform solutions with high-quality documentation; ensure SDLC and company policy compliance via reviews and audits
- Proven delivery using Agile/Scrum; prioritize roadmap vs. BAU/operational work against SLAs and timelines
- Workload onboarding & enablement: partner with product managers/engineering to provide reference architectures, guardrails, and safe, rapid onboarding to platform services
- Experience deploying and managing Cloud Foundry or similar PaaS (e.g., Elastic Beanstalk)
- Familiarity with DevOps toolchain: Git, Terraform, Jira, GitHub Actions, Jenkins, CloudBees, Azure DevOps
Nice to haves:
- Possess 4 to 6 years of experience within the IT industry or related fields
- Capable of operating effectively in a matrixed and highly concurrent work environment
- Proven track record of planning and executing projects or experiments, including defining milestones and endpoints
- Experience collaborating with global and diverse teams
- Proficient in using, implementing, or operating Kubernetes or similar container orchestration platforms
- Cloud platforms (AWS): VPC, IAM, EC2, S3, ELB/ALB/NLB, RDS, Route 53, CloudWatch, CloudTrail, EventBridge, SES, SNS, KMS, Secrets Manager, Kendra, Lambda, MSK, ACM, DRS, EFS, FSx for ONTAP
- Cloud platforms (Azure): Virtual Networks, Resource Groups, Application Gateway, App Service & ASE v3, Autoscaling, Azure Monitor, Key Vault, Secrets Management, Storage Accounts
- Containers & platforms: Broadcom Tanzu Platform, Spectro Cloud (edge/K8s orchestration), Azure ASE v3, Oracle APEX
- Networking, security & compliance: Hybrid architecture and network design; secure connectivity; identity & access; encryption; policy-as-code; compliance controls
- Serverless & integration: Event-driven architectures with AWS Lambda; integrations via EventBridge, SNS, SES
- Observability & SRE: Dynatrace, Prometheus, Grafana, Nobl9; SLO/SLI design; alerting; dashboards
- Delivery & operations: End-to-end app design & deployment; troubleshooting; monitoring; continuous improvement; change control; strong analytical & problem-solving skills
- Built CI/CD workflows with Terraform on GitHub Actions (plan/apply, policy checks, environment promotion)
- Operate Azure DevOps pipelines to provision/manage Azure with ARM/Bicep/Terraform
- Develop reusable Ansible playbooks/roles in AAP for repeatable configuration and deployments
- Provision, migrate, and standardize RHEL and Ubuntu across physical/virtual infra, including HA clusters
- Patching & security compliance (baseline hardening, vulnerability remediation, drift control) while tuning performance
- Architect hybrid DR/BC with defined RTO/RPO, automated failover, backup/restore testing, and documented recovery runbooks
- Expertise with Amazon Web Services, including VPC, Route53, EC2, ALB, S3, RDS, IAM, and others
- Hold relevant professional certifications (e.g., AWS, Azure)
- Skilled in debugging software and scripting errors
- Experience in Go programming language
- Expect with troubleshooting infrastructure, network, database, or security-related issues
- Background in delivering products and features utilizing Agile/Scrum methodologies
- Competent in DevOps tools such as Git, Terraform, Jira, Jenkins, CloudBees, Github Actions, among others
- Adept at System Development Lifecycle (SDLC) documentation
- Hands-on experience with Oracle APEX platform
- Exposure to resiliency and observability platforms
What we offer:
- Exciting work in a great team, global projects, international environment
- Opportunity to learn and grow professionally within the company globally
- Hybrid working model, flexible role pattern
- Pension and health insurance contributions
- Internal reward system plus referral program
- 5 weeks annual leave, 5 sick days, 15 days of certified sick leave paid above statutory requirements annually, 40 paid hours annually for volunteering activities, 12 weeks of parental contribution
- Cafeteria for tax free benefits according to your choice (meal vouchers, Lítačka, sport, culture, health, travel, etc.), Multisport Card
- Vodafone, Raiffeisen Bank, Foodora, and other discount programs
- Up-to-date laptop and iPhone
- Parking in the garage for drivers or showers for bikers
- Competitive salary, incentive pay, and many more
Ready to take up the challenge? Apply now
Know anybody who might be interested? Refer this job
The date shown below is the earliest possible closing date for this posting. However, we sometimes extend the job posting period as needed, so please feel free to apply anytime you see the "Apply" button available. You may also reach out to the recruiter directly via
Required Skills:
Preferred Skills:
Current Employees apply HERE
Current Contingent Workers apply HERE
Search Firm Representatives Please Read Carefully
Merck & Co., Inc., Rahway, NJ, USA, also known as Merck Sharp & Dohme LLC, Rahway, NJ, USA, does not accept unsolicited assistance from search firms for employment opportunities. All CVs / resumes submitted by search firms to any employee at our company without a valid written search agreement in place for this position will be deemed the sole property of our company. No fee will be paid in the event a candidate is hired by our company as a result of an agency referral where no pre-existing agreement is in place. Where agency agreements are in place, introductions are position specific. Please, no phone calls or emails.
Employee Status:
Regular
Relocation:
VISA Sponsorship:
Travel Requirements:
Flexible Work Arrangements:
Hybrid
Shift:
Valid Driving License:
Hazardous Material(s):
Job Posting End Date:
11/22/2025
- A job posting is effective until 11:59:59PM on the day BEFORE the listed job posting end date. Please ensure you apply to a job posting no later than the day BEFORE the job posting end date.
Requisition ID:R373016
-
Platform Lead
5 days ago
Praha 9, Hlavní město Praha, Czech Republic Knihobot Full time 120,000 - 240,000 per yearBookbot is a recommerce platform that operates in 9 countries, ships books to 40 countries and has already sold over 10 million books. Every day we sell around books and receive a little bit more. We are the 16th fastest growing tech startup in the CEE region.Who we're looking for:You are experienced platform engineer, with expertise in AWS & Kubernetes.You...
-
Microsoft Dynamics 365/Power Platform Engineer
16 hours ago
Praha 5, Hlavní město Praha, Czech Republic SPOLEČNOST SATPO Full time 104,000 - 130,878 per yearSATPO management, s.r.o. je manažerská společnost, která řídí developerské rezidenční projekty, investuje do pozemkového developmentu pod značkou SATPO a rozvíjí investiční platformu pod značkou City Home z oblasti nákupu existujících činžovních domů či jejich podílů. Na českém realitním trhu působí společnost v rámci...
-
Technical Lead Platform Reliability
6 days ago
okres Praha-východ, Czech Republic GoodData Full time 150,000 - 250,000 per yearWe're an innovative leader in Business Intelligence, turning complex customer data into actionable insights with a powerful analytics platform. Our technology runs across AWS and Microsoft Azure and supports decision-making for companies worldwide. At our core is a collaborative team obsessed with reliability, performance, and continuous improvement.Role...
-
Staff Site Reliability Engineer
5 days ago
Praha 8, Hlavní město Praha, Czech Republic ShipMonk Full time 120,000 - 240,000 per yearWe are ShipMonk and we provide logistics services in the field of order fulfillment. We are operating from our main business in the USA, but Prague is the center of the development of our modern platform.Apply for jobWe are seeking an influential Staff SRE to help architect and drive the strategic evolution of our core cloud and deployment infrastructure,...
-
Data Engineer
2 weeks ago
Praha 3, Hlavní město Praha, Czech Republic Клаудфреш Full time 60,000 - 120,000 per yearVacancy descriptionCloudfresh is a Global Google Cloud Premier Partner, Zendesk Premier Partner, Asana Solutions Partner, GitLab Select Partner, Hubspot Platinum Partner, Okta Activate Partner, and Microsoft Partner.Since 2017, we've been specializing in the implementation, migration, integration, audit, administration, support, and training for top-tier...
-
Senior Cloud Infrastructure Engineer
6 days ago
okres Praha-východ, Czech Republic GoodData Full time 900,000 - 1,200,000 per yearAbout GoodDataGoodData is a cloud-based analytics platform enabling employees, customers, and partners to interpret data consistently. With an end-to-end pipeline, self-service analytics, low-code/no-code interfaces, embeddable data visualization, and seamless application integration, we deliver robust analytics solutions.About the RoleWe're looking for a...
-
Senior Site Reliability Engineer
6 days ago
okres Praha-východ, Czech Republic GoodData Full time 600,000 - 1,200,000 per yearAbout GoodDataGoodData is a cloud-based analytics platform enabling employees, customers, and partners to interpret data consistently. With an end-to-end pipeline, self-service analytics, low-code/no-code interfaces, embeddable data visualization, and seamless application integration, we deliver robust analytics solutions.About the RoleWe're looking for a...
-
Senior full-stack engineer
2 weeks ago
Praha, Czech Republic MSD Full time 120,000 - 240,000 per yearJob DescriptionThe PositionOur Team is looking for a Senior Full Stack Engineer. Our team supports the Company's business areas – research and development, manufacturing, supply chain, commercial and animal monitoring – through data science and AI delivery. This role allows you to work on impactful AI products and collaborate with AI engineers and data...
-
Middle Fullstack Engineer
6 days ago
Praha, Czech Republic Avenga Full time 60,000 - 120,000 per yearThis is us At Avenga, we believe that human creativity empowers technology that matters. Operating globally, our 6000+ specialists provide a full spectrum of services, including business and tech advisory, enterprise solutions, CX, UX and Ul design, managed services, product development, and software development.This is the jobAt Prague within the banking...
-
Senior Software Engineer
2 weeks ago
Okres Praha-západ, Central Bohemia, Czech Republic AutoRABIT Holding Full time 60,000 - 120,000 per yearAbout AutoRABITAutoRABIT is the leading DevSecOps and CI/CD platform for low-code SaaS environments like Salesforce. Our solutions help organizations achieve scalable release management, robust version control, and bulletproof data protection. We're growing fast — and we're just getting started.Our CultureAt AutoRABIT, learning isn't just encouraged —...