Data Engineer

4 days ago


Remote Warsaw Wrocław Białystok Kraków Gdańsk, Czech Republic Addepto Full time
What you'll need to succeed in this role:
  • At least 4 years of commercial experience implementing, developing, or maintaining Big Data systems, data governance and data management processes.
  • Strong programming skills in Python (or Java/Scala): writing a clean code, OOP design.
  • Hands-on with Big Data technologies like Spark, Cloudera Data Platform, Airflow, NiFi, Docker, Kubernetes, Iceberg, Trino or Hudi.
  • Excellent understanding of dimensional data and data modeling techniques.
  • Experience implementing and deploying solutions in cloud environments.
  • Consulting experience with excellent communication and client management skills, including prior experience directly interacting with clients as a consultant.
  • Ability to work independently and take ownership of project deliverables.
  • Fluent English (at least C1 level).
  • Bachelor's degree in technical or mathematical studies.
Nice to have:
  • Experience with an MLOps framework such as Kubeflow or MLFlow.
  • Familiarity with Databricks, dbt or Kafka.

Addepto is a leading consulting and technology company specializing in AI and Big Data, helping clients deliver innovative data projects. We partner with top-tier global enterprises and pioneering startups, including Rolls Royce, Continental, Porsche, ABB, and WGU. Our exclusive focus on AI and Big Data has earned us recognition by Forbes as one of the top 10 AI companies.

As a Data Engineer, you will have the exciting opportunity to work with a team of technology experts on challenging projects across various industries, leveraging cutting-edge technologies. Here are some of the projects we are seeking talented individuals to join:

  • Development and maintenance of a large platform for processing automotive data. A significant amount of data is processed in both streaming and batch modes. The technology stack includes Spark, Cloudera, Airflow, Iceberg, Python, and AWS.
  • Design and development of a universal data platform for global aerospace companies. This Azure and Databricks powered initiative combines diverse enterprise and public data sources. The data platform is at the early stages of the development, covering design of architecture and processes as well as giving freedom for technology selection.
  • Centralized reporting platform for a growing US telecommunications company. This project involves implementing BigQuery and Looker as the central platform for data reporting. It focuses on centralizing data, integrating various CRMs, and building executive reporting solutions to support decision-making and business growth.

Discover our perks and benefits:
  • Work in a supportive team of passionate enthusiasts of AI & Big Data.
  • Engage with top-tier global enterprises and cutting-edge startups on international projects.
  • Enjoy flexible work arrangements, allowing you to work remotely or from modern offices and coworking spaces.
  • Accelerate your professional growth through career paths, knowledge-sharing initiatives, language classes, and sponsored training or conferences, including a partnership with Databricks, which offers industry-leading training materials and certifications.
  • Choose from various employment options: B2B, employment contracts, or contracts of mandate.
  • Make use of 20 fully paid days off available for B2B contractors and individuals under contracts of mandate.
  • Participate in team-building events and utilize the integration budget.
  • Celebrate work anniversaries, birthdays, and milestones.
  • Access medical and sports packages, eye care, and well-being support services, including psychotherapy and coaching.
  • Get full work equipment for optimal productivity, including a laptop and other necessary devices.
  • With our backing, you can boost your personal brand by speaking at conferences, writing for our blog, or participating in meetups.
  • Experience a smooth onboarding with a dedicated buddy, and start your journey in our friendly, supportive, and autonomous culture.
,[Develop and maintain a high-performance data processing platform for automotive data, ensuring scalability and reliability., Design and implement data pipelines that process large volumes of data in both streaming and batch modes., Optimize data workflows to ensure efficient data ingestion, processing, and storage using technologies such as Spark, Cloudera, and Airflow., Work with data lake technologies (e.g., Iceberg) to manage structured and unstructured data efficiently., Collaborate with cross-functional teams to understand data requirements and ensure seamless integration of data sources., Monitor and troubleshoot the platform, ensuring high availability, performance, and accuracy of data processing., Leverage cloud services (AWS) for infrastructure management and scaling of processing workloads., Write and maintain high-quality Python (or Java/Scala) code for data processing tasks and automation.] Requirements: Python, SQL, Spark, Airflow, AWS, Cloudera, CI/CD, Kubernetes, Kafka, NiFi, Trino, Hudi, Java, Scala, Docker, Databricks, MLOps, DevOps, Iceberg Tools: Jira, Confluence, Wiki, GitHub, Agile, Scrum, Kanban. Additionally: Private healthcare, Multisport card, Referral bonus, MyBenefit cafeteria, International projects, Flat structure, Paid leave, Training budget, Language classes, Team building events, Small teams, Flexible form of employment, Flexible working hours and remote work possibility, Free coffee, Startup atmosphere, No dress code, In-house trainings.

  • Remote, Warsaw, Wrocław, Białystok, Kraków, Gdańsk, Czech Republic Addepto Full time

    As a Senior Data Engineer at Addepto, you will have the exciting opportunity to work with a team of technology experts on challenging projects across various industries, leveraging cutting-edge technologies. With a focus on data transformation and reporting, your role will involve designing and implementing scalable data pipelines on GCP using BigQuery for...


  • Remote, Warsaw, Kraków, Wrocław, Gdańsk, Czech Republic RemoDevs Full time

    4+ years of experience with Python Proven expertise in Databricks and data pipeline development Strong skills in SQL for data processing and transformation Proficiency with cloud platforms (preferably Azure) Fluent English communication skills Senior Data EngineerAre you passionate about turning raw data into meaningful insights? Do you thrive in a...


  • Remote, Lisbon, Frankfurtammain, Heidelberg, Warszawa, Czech Republic Data Science UA Full time

    - Bachelor degree in Computer Science, similar technical field of study or equivalent practical experience.- Commercial experience developing Spark Jobs using Scala and Java- Experience in data processing using traditional and distributed systems (Hadoop, Spark, AWS - S3) and designing data models and data warehouses.- Strong understanding and application of...

  • Data Engineer

    5 days ago


    Remote, Warsaw, Czech Republic GetInData | Part of Xebia Full time

    Proficiency in a programming language like Python and SQLDeep understanding of data warehousing concepts and experience with platforms such as Snowflake, BigQuery, or RedshiftExperience as a programmer and knowledge of software engineering, good principles, practices, and solutionsFamiliarity with cloud GCP / AWSFamiliarity with DevOps area and tools - GKE,...


  • Remote, Lisbon, Frankfurtammain, Heidelberg, Warszawa, Czech Republic Data Science UA Full time

    About the CompanyData Science UA is a service company with strong data science and AI expertise. We have established ourselves as a leading provider of data-driven solutions in Eastern Europe.Our ClientWe are proud to partner with a market-leading intelligence platform for paid search advertising. Their innovative approach combines artificial search...

  • Lead Data Scientist

    2 days ago


    Remote, Warsaw, Wrocław, Białystok, Kraków, Gdańsk, Czech Republic Addepto Full time

    What you'll need to succeed in this role:6+ years of commercial experience in designing and implementing scalable AI solutions (Machine Learning, Predictive Modeling, Optimization, NLP, Computer Vision, GenAI).Experience in Machine Learning projects leadership and team mentoring.Proficiency in developing ML algorithms from scratch to production...


  • Remote, Warsaw, Czech Republic GetInData | Part of Xebia Full time

    Proficiency in a programming language like Python / Scala or JavaKnowledge of Lakehouse platforms - DatabricksExperience working with dbtFamiliarity with Version Control Systems, particularly GITExperience as a programmer and knowledge of software engineering, good principles, practices, and solutionsExtensive experience in Microsoft AzureKnowledge of at...


  • Gdańsk, Wrocław, Poznań, Kraków, Waszawa, Czech Republic Capgemini Polska Sp. z o.o. Full time

    About Insights & Data:Insights & Data is a dynamic team of over 400 professionals delivering cutting-edge data solutions. We specialize in Cloud & Big Data engineering, building scalable systems for complex datasets across AWS, Azure, and GCP. Our expertise spans the full Software Development Life Cycle (SDLC), utilizing modern data processing tools,...


  • Remote, Warsaw, Czech Republic GetInData | Part of Xebia Full time

    Company Overview:We are GetInData | Part of Xebia, a company that values innovation and collaboration.Our team is passionate about data engineering and strives to create cutting-edge solutions.Job Description:A Data Engineer's role involves designing, constructing, and maintaining data architecture, tools, and procedures facilitating an organization's...


  • Remote, Warsaw, Wrocław, Białystok, Kraków, Gdańsk, Czech Republic Addepto Full time

    Company OverviewAddepto is a pioneering consulting and technology company specializing in AI and Big Data, helping clients deliver cutting-edge data projects. We partner with top-tier global enterprises and innovative startups, leveraging our expertise to drive business growth.

  • Data Scientist

    6 days ago


    Remote, Warszawa, Gdańsk, Wrocław, Białystok, Kraków, Czech Republic Addepto Full time

    What you'll need to succeed in this role:Proven commercial experience designing and implementing scalable AI solutions (Machine Learning, Predictive Modeling, Optimization, NLP, Computer Vision, GenAI).Proficiency in developing ML algorithms from scratch to production deployment.Strong programming skills in Python: writing clean code, OOP design,...


  • Remote, Warsaw, Wrocław, Białystok, Kraków, Gdańsk, Czech Republic Addepto Full time

    Company OverviewAddepto is a leading consulting and technology company specializing in AI and Big Data, helping clients deliver innovative data projects. We partner with top-tier global enterprises and pioneering startups.Job DescriptionWe are seeking a highly skilled Data Engineer to join our team. As a key member of our data processing platform, you will...


  • Remote, Wrocław, Gdańsk, Rzeszów, Czech Republic Xebia sp. z o.o. Full time

    About the Role:As a Senior Data Engineer at Xebia, you will be working closely with engineering, product, and data teams to deliver scalable and robust data solutions to our clients. Your key responsibilities include designing, building, and maintaining data platforms and pipelines, as well as mentoring new engineers.Key Responsibilities:Designing and...


  • Remote, Wrocław, Gdańsk, Rzeszów, Czech Republic Xebia sp. z o.o. Full time

    About the Role:Xebia, a renowned digital solutions developer, is seeking a highly skilled Senior GCP Data Engineer to join its team. As a Senior Data Engineer at Xebia, you will play a crucial role in designing, building, and maintaining data platforms and pipelines that cater to our clients' diverse needs.The ideal candidate will have extensive experience...

  • Data Engineer

    4 days ago


    Gdańsk, Wrocław, Poznań, Kraków, Waszawa, Czech Republic Capgemini Polska Sp. z o.o. Full time

    About Insights & Data:Our team is a dynamic and innovative group of over 400 professionals delivering cutting-edge data solutions. We specialize in Cloud & Big Data engineering, building scalable systems for complex datasets across top cloud providers. Our expertise spans the full Software Development Life Cycle (SDLC), utilizing modern data processing...

  • Data Engineer @

    2 days ago


    Kraków, Wrocław, Warszawa, Czech Republic Unit8 SA Full time

    As a member of agile project teams, your mission will be to build solutions and infrastructure aiming at solving the business problems of our clients.You are a proficient software engineer who knows the fundamentals of computer science and you master at least one widely adopted programming language (Python, Java, C#, C++).You know how to write...

  • AI/ML Engineer @

    1 week ago


    Remote, Czech Republic PAR Data Central Full time

    Position Overview: As an AI/ML Engineer, you will play a crucial role in enhancing our platform's capabilities by developing and refining machine learning models that drive accurate forecasting, event analysis, data-driven decision-making, and more. You will collaborate closely with product owners and engineering teams to implement scalable AI solutions that...

  • Data Engineer @

    6 days ago


    Warszawa, Wrocław, Poznań, Gdańsk, Lublin, Kraków, Czech Republic Sollers Consulting Full time

    About the requirements. You need:At least 3 years of experience in data engineering.Experience with ETL/ELT processes and building data processing pipelines.Experience working with various data sources (RDBMS, CDC, APIs, data connectors, structured, semi-structured and unstructured files).Experience with various data transformation and processing tools (e.g....


  • Remote, Warsaw, Wrocław, Białystok, Kraków, Gdańsk, Czech Republic Addepto Full time

    Company OverviewAddepto is a leading consulting and technology company specializing in AI and Big Data, helping clients deliver innovative data projects.


  • Remote, Wrocław, Gdańsk, Rzeszów, Czech Republic Xebia sp. z o.o. Full time

    7+ years in a data engineering role, with hands-on experience in building data processing pipelines,experience in leading the design and implementing of data pipelines and data products,proficiency with GCP services, for large-scale data processing and optimization,extensive experience with Apache Airflow, including DAG creation, triggers, and workflow...