Lead/Senior Data Engineer @

2 days ago


Remote Warsaw, Czech Republic KMD Poland Full time

Personal Requirements: 

  •    Have 4+ years of Apache Spark experience and have faced various data engineering challenges in batch or streaming
  •    Have an interest in stream processing with Apache Spark Structured Streaming on top of Apache Kafka
  •    Have experience leading technical solution designs
  •    Have experience with distributed systems on a cloud platform
  •    Have experience with large-scale systems in a microservice architecture
  •    Are familiar with Git and CI/CD practices and can design or implement the deployment process for your data pipelines
  •    Possess a proactive approach and can-do attitude
  •    Are excellent in English and Polish, both written and spoken
  •    Have a higher education in computer science or a related field
  •    Are a team player with strong communication skills

Nice to have requirements: 

  •    Apache Spark Structured Streaming
  •    Azure
  •    Domain Driven Development
  •    Docker containers and Kubernetes
  •    Message brokers (i.e. Kafka) and event-driven architecture
  •    Agile/Scrum

Are you ready to join our international team as a Lead / Senior Data Engineer? We shall tell you why you should...

What product do we develop?

We are building an innovative solution, KMD Elements, on Microsoft Azure cloud dedicated to the energy distribution market (electrical energy, gas, water, utility, and similar types of business). Our customers include institutions and companies operating in the energy market as transmission service operators, market regulators, distribution service operators, energy trading, and retail companies.

KMD Elements delivers components allowing implementation of the full lifecycle of a customer on the energy market: meter data processing, connection to the network, physical network management, change of operator, full billing process support, payment, and debt management, customer communication, and finishing on customer account termination and network disconnection.

The key market advantage of KMD Elements is its ability to support highly flexible, complex billing models as well as scalability to support large volumes of data. Our solution enables energy companies to promote efficient energy generation and usage patterns, supporting sustainable and green energy generation and consumption.

We work with always up-to-date versions of: 

  • Apache Spark on Azure Databricks
  • Apache Kafka
  • Delta Lake
  • Java
  • MS SQL Server and NoSQL storages like Elastic Search, Redis, Azure Data Explorer
  • Docker containers
  • Azure DevOps and fully automated CI/CD pipelines with Databricks Asset Bundles, ArgoCD, GitOps, Helm charts
  • Automated tests

How do we work?

#Agile #Scrum #Teamwork #CleanCode #CodeReview #Feedback #BestPracticies   

  • We follow Scrum principles in our work – we work in biweekly iterations and produce production-ready functionalities at the end of each iteration – every 3 iterations we plan the next product release
  • We have end-to-end responsibility for the features we develop – from business requirements, through design and implementation up to running features on production
  • More than 75% of our work is spent on new product features
  • Our teams are cross-functional (7-8 persons) – they develop, test and maintain features they have built
  • Teams' own domains in the solution and the corresponding system components
  • We value feedback and continuously seek improvements
  • We value software best practices and craftsmanship

Product principles:

  • Domain model created using domain-driven design principles
  • Distributed event-driven architecture / microservices
  • Large-scale system for large volumes of data (>100TB data), processed by Apache Spark streaming and batch jobs powered by Databricks platform

Our offer:

  • Contract type: B2B
  • Work Mode: Flexible — this role supports on-site, hybrid, and remote arrangements, depending on your individual preferences.
  • Occasional on-site presence may be required — for example, onboard new team members, explore new business domains, or refine requirements in close collaboration with stakeholders or team building activities.
,[Develop and maintain the leading IT solution for the energy market using Apache Spark, Databricks, Delta Lake, and Apache Kafka, Have end-to-end responsibility for the full lifecycle of features you develop, Design technical solutions for business requirements from the product roadmap, Maintain alignment with architectural principles defined on the project and organizational level, Ensure optimal performance through continuous monitoring and code optimization., Refactor existing code and enhance system architecture to improve maintainability and scalability., Design and evolve the test automation strategy, including technology stack and solution architecture., Prepare reviews, participate in retrospectives, estimate user stories, and refine features ensuring their readiness for development.] Requirements: Apache Spark, Apache Kafka, Microservice architecture, Data, Databricks, Batch, Java, SQL, CI/CD, Spark, DDD, Azure, Docker Additionally: Sport subscription, Training budget, Private healthcare, International projects, Flat structure, Free coffee, Bike parking, Playroom, Free snacks, Free beverages, In-house trainings, No dress code.

  • Remote, Warsaw, Wrocław, Białystok, Kraków, Gdańsk, Czech Republic beBeeDataScientist Full time €80,000 - €150,000

    Job Title: Senior Data Scientist LeadAs a senior data scientist, you will lead a team of experts in leveraging cutting-edge technologies to create innovative AI solutions that drive business value.We are seeking a highly skilled professional with expertise in machine learning, data science, and software engineering.The ideal candidate should have strong...


  • Remote, Czech Republic beBeeDataEngineer Full time 600,000 - 1,000,000

    Job Summary:We are seeking a highly skilled Data Engineer to join our team and lead the implementation of Trino/Apache Trino solutions. The successful candidate will have expertise in data virtualization, strong proficiency in Trino administration, configuration, and tuning.Key Responsibilities:Installation and performance optimization of Trino...


  • Remote, Warszawa, Czech Republic Antal Full time

    About You:Master's degree in Computer Science, Engineering, Pharmacy, or a related field.15+ years of progressive experience in IT quality engineering, testing, validation, and compliance—particularly in highly regulated industries.Proven track record leading enterprise-wide quality transformations and delivering strategic outcomes at scale.Deep knowledge...


  • Remote, Warsaw, Czech Republic beBeeDataEngineer Full time 900,000 - 1,200,000

    Job SummaryWe are seeking a seasoned Data Engineer to lead the design and implementation of technical solutions for business requirements. The ideal candidate will have a strong background in data engineering, particularly with Apache Spark.Main Responsibilities: Design and implement technical solutions, develop and maintain leading IT solutions, end-to-end...


  • Remote, Wrocław, Gdańsk, Rzeszów, Czech Republic beBeeEngineering Full time €90,000 - €110,000

    Job OverviewThe Senior AWS Data Engineer will be responsible for designing and building at-scale infrastructure with a focus on distributed systems.This role involves creating architecture patterns for data processing, workflow definitions, and system-to-system integrations using Big Data and Cloud technologies.A key aspect of this position is translating...


  • Remote, Wrocław, Warsaw, Czech Republic beBeeMachineLearning Full time 1,200,000 - 1,600,000

    About this roleWe are seeking a skilled Senior Machine Learning Engineer to lead cutting-edge projects in deep learning-focused applications, particularly in biomedical contexts with an emphasis on microscopic imaging data.Key responsibilitiesImplement state-of-the-art machine learning methods for analyzing microscopy data, enabling metadata extraction and...


  • Remote, Czech Republic beBeeDataEngineer Full time €90,000 - €120,000

    Job DescriptionWe are seeking a highly skilled Data Engineer to join our team. As a Senior Data Engineer, you will be responsible for designing, developing, and maintaining large-scale data processing systems.Your primary focus will be on leveraging Trino (Starburst or Apache) to deliver high-performance data engineering solutions. You will work closely with...


  • Remote, Czech Republic beBeeData Full time €80,000 - €103,000

    Senior SAP Data Architect PositionWe seek an accomplished Senior SAP BW/BI Solution Architect & Developer to lead data solution design, architecture, and business analysis in a dynamic SAP landscape.Key Responsibilities:Data Solution Design and Architecture:Oversee the design and implementation of scalable and innovative data solutions using SAP BW/BI...


  • Remote, Czech Republic beBeeCloudEngineer Full time 1,000,000 - 1,200,000

    About the RoleWe are seeking a senior platform and infrastructure engineer to join our team in building and evolving a robust data analytics platform using Google Cloud Platform (GCP).


  • Remote, Warszawa, Czech Republic Crestt Full time

    Technologies & ToolsSnowflake (enterprise data warehouse)dbt Cloud (ETL/ELT development – licenses provided)Confluence (documentation)Azure DevOps / Jira (task planning and tracking) We are seeking an experienced Senior Data Engineer with strong expertise in Snowflake to support a short-term data engineering initiative (September–December 2025,...