Senior Data Engineer

2 days ago


Remote Warsaw Wrocław Białystok Kraków Gdańsk, Czech Republic Addepto Full time
What you’ll need to succeed in this role:
  • At least 5 years of commercial experience implementing, developing, or maintaining Big Data systems.
  • Strong programming skills in Python: writing a clean code, OOP design.
  • Strong SQL skills, including performance tuning, query optimization, and experience with data warehousing solutions.
  • Experience in designing and implementing data governance and data management processes.
  • Deep expertise in Big Data technologies, including Apache Airflow, Dagster, Databricks, Spark, DBT, and other modern data orchestration and transformation tools.
  • Experience implementing and deploying solutions in cloud environments (with a preference for Azure).
  • Knowledge of how to build and deploy Power BI reports and dashboards for data visualization.
  • Excellent understanding of dimensional data and data modeling techniques.
  • Consulting experience and the ability to guide clients through architectural decisions, technology selection, and best practices.
  • Ability to work independently and take ownership of project deliverables.
  • Master’s or Ph.D. in Computer Science, Data Science, Mathematics, Physics, or a related field.

Addepto is a leading AI consulting and data engineering company that builds scalable, ROI-focused AI solutions for some of the world’s largest enterprises and pioneering startups, including Rolls Royce, Continental, Porsche, ABB, and WGU. With our exclusive focus on Artificial Intelligence and Big Data, we help organizations unlock the full potential of their data through systems designed for measurable business impact and long-term growth.
Beyond client projects, we have developed our own product offerings born from real-life client insights and challenges. We are also actively releasing open-source solutions to the community, transforming practical experience into tools that benefit the broader AI ecosystem. This commitment to scalable innovation, proven ROI delivery, and knowledge sharing has earned us recognition by Forbes as one of the top 10 AI consulting companies worldwide.

As a Senior Data Engineer, you will have the exciting opportunity to work with a team of technology experts on challenging projects across various industries, leveraging cutting-edge technologies.  Here are some of the projects we are seeking talented individuals to join:

  • Design and development of a universal data platform for global aerospace companies. This Azure and Databricks powered initiative combines diverse enterprise and public data sources. The data platform is at the early stages of the development, covering design of architecture and processes, as well as giving freedom for technology selection.
  • Data Platform Transformation for energy management association body.  This project addressed critical data management challenges, boosting user adoption, performance, and data integrity. The team is implementing a comprehensive data catalog, leveraging Databricks and Apache Spark/PySpark, for simplified data access and governance. Secure integration solutions and enhanced data quality monitoring, utilizing Delta Live Table tests, established trust in the platform. The intermediate result is a user-friendly, secure, and data-driven platform, serving as a basis for further development of ML components.
  • Design of the data transformation and following data ops pipelines for global car manufacturer. This project aims to build a data processing system for both real-time streaming and batch data. We’ll handle data for business uses like process monitoring, analysis, and reporting, while also exploring LLMs for chatbots and data analysis. Key tasks include data cleaning, normalization, and optimizing the data model for performance and accuracy.
Discover our perks and benefits:
  • Work in a supportive team of passionate enthusiasts of AI & Big Data.
  • Engage with top-tier global enterprises and cutting-edge startups on international projects.
  • Enjoy flexible work arrangements, allowing you to work remotely or from modern offices and coworking spaces.
  • Accelerate your professional growth through career pathsknowledge-sharing initiatives, language classes, and sponsored training or conferences, including a partnership with Databricks, which offers industry-leading training materials and certifications.
  • Choose from various employment options: B2B, employment contracts, or contracts of mandate.
  • Make use of 20 fully paid days off available for B2B contractors and individuals under contracts of mandate.
  • Participate in team-building events and utilize the integration budget.
  • Celebrate work anniversaries, birthdays, and milestones.
  • Access medical and sports packages, eye care, and well-being support services, including psychotherapy and coaching.
  • Get full work equipment for optimal productivity, including a laptop and other necessary devices.
  • With our backing, you can boost your personal brand by speaking at conferences, writing for our blog, or participating in meetups.
  • Experience a smooth onboarding with a dedicated buddy, and start your journey in our friendly, supportive, and autonomous culture.
,[Design and optimize scalable data processing pipelines for both streaming and batch workloads using Big Data technologies such as Databricks, Apache Airflow, and Dagster., Architect and implement end-to-end data platforms, ensuring high availability, performance, and reliability., Lead the development of CI/CD and MLOps processes to automate deployments, monitoring, and model lifecycle management., Develop and maintain applications for aggregating, processing, and analyzing data from diverse sources, ensuring efficiency and scalability., Collaborate with Data Science teams on Machine Learning projects, including text/image analysis, feature engineering, and predictive model deployment., Design and manage complex data transformations using Databricks, DBT, and Apache Airflow, ensuring data integrity and consistency., Translate business requirements into scalable and efficient technical solutions while ensuring optimal performance and data quality., Ensure data security, compliance, and governance best practices are followed across all data pipelines.] Requirements: Python, SQL, ETL, Azure, Airflow, Databricks, Spark, Docker, CI/CD, Kubernetes, Kafka, Power BI, Dagster, dbt Tools: Jira, Confluence, Wiki, GitHub, Agile, Scrum, Kanban. Additionally: Private healthcare, Multisport card, Referral bonus, MyBenefit cafeteria, International projects, Flat structure, Paid leave, Training budget, Language classes, Team building events, Small teams, Flexible form of employment, Flexible working hours and remote work possibility, Free coffee, Startup atmosphere, No dress code, In-house trainings.

  • Remote, Wrocław, Czech Republic AVENGA (Agencja Pracy, nr KRAZ: 8448) Full time

    Key Requirements:5 years of experience as Data EngineerProven experience in Azure Databricks (data engineering, pipelines, performance tuning, Python)Azure DevOps (Repos, Pipelines, YAML)Azure Key VaultAzure Data Factory (optional)Good to have knowledge within Power BI.Strong analytical and problem-solving skillsExcellent communication and stakeholder...


  • Remote, Warsaw, Wrocław, Białystok, Kraków, Gdańsk, Czech Republic Addepto Full time

    What you’ll need to succeed in this role: 5+ years of commercial experience designing and implementing scalable AI solutions (Machine Learning, Predictive Modeling, Optimization, NLP, Computer Vision, GenAI, LLMs, Deep Learning). Proficiency in developing ML algorithms from scratch to production deployment. Strong programming skills in Python: writing...


  • Remote, Warsaw, Wrocław, Białystok, Kraków, Gdańsk, Czech Republic Addepto Full time

    What you’ll need to succeed in this role: 5+ years of commercial experience designing and implementing scalable AI solutions (Machine Learning, Predictive Modeling, Optimization, NLP, Computer Vision, GenAI, LLMs, Deep Learning). Proficiency in developing ML algorithms from scratch to production deployment. Strong programming skills in Python: writing...


  • Remote, Warszawa, Wrocław, Białystok, Kraków, Gdańsk, Czech Republic Addepto Full time

    What you'll need to succeed in this role:At least 5 years of commercial experience implementing, developing, or maintaining Big Data systems, data governance and data management processes.Strong programming skills in Python (or Java/Scala): writing a clean code, OOP design.Hands-on with Big Data technologies like Spark, Cloudera, Data Platform, Airflow,...


  • Remote, Kraków, Białystok, Wrocław, Czech Republic Grape Up Full time

    PhD or master’s degree in computer science, Data Science, AI, or related field 5+ years of professional experience in Data Engineering and Big Data Proven experience in implementing and deploying solutions in AWS using AWS stack (Redshift, Kinesis, Athena) Proven experience with AWS Data Processing (Glue, EMR) Experience with Data Pipelines Orchestration...


  • Remote, Gdańsk, Wrocław, Warsaw, Kraków, Poznań, Czech Republic RemoDevs Full time

    Proven experience with Azure Databricks and Azure Data Factory (ADF). Strong skills in SQL and Python for data engineering. Experience in building pipelines and data models. Good English (minimum B2) to communicate in an international team. Experience with Agile methods and Azure DevOps. We are looking for skilled Data Engineers to join a team working on...


  • Remote, Warszawa, Wrocław, Białystok, Kraków, Gdańsk, Czech Republic Addepto Full time

    What you’ll need to succeed in this role: At least 5 years of commercial experience implementing, developing, or maintaining Big Data systems, data governance and data management processes. Strong programming skills in Python (or Java/Scala): writing a clean code, OOP design. Hands-on with Big Data technologies like Spark, Cloudera, Data Platform,...


  • Remote, Warszawa, Wrocław, Białystok, Kraków, Gdańsk, Czech Republic Addepto Full time

    What you’ll need to succeed in this role: At least 5 years of commercial experience implementing, developing, or maintaining Big Data systems, data governance and data management processes. Strong programming skills in Python (or Java/Scala): writing a clean code, OOP design. Hands-on with Big Data technologies like Spark, Cloudera Data Platform,...


  • Remote, Warsaw, Wrocław, Białystok, Kraków, Gdańsk, Czech Republic Addepto Full time

    What you’ll need to succeed in this role: 6+ years of commercial experience in designing and implementing scalable AI solutions (Machine Learning, Predictive Modeling, Optimization, NLP, Computer Vision, GenAI). Experience in Machine Learning projects leadership and team mentoring. Proficiency in developing ML algorithms from scratch to production...


  • Remote, Warsaw, Cracow, Czech Republic RTB House Full time

    5+ years of hands-on experience in data engineering roles, building and maintaining large-scale distributed data systems. Proven experience working with petabyte-scale datasets and high-throughput systems. Strong programming skills in Python, Java, or Scala. Solid understanding of database management systems (both relational and non-relational). Expertise...