Data Engineer

2 weeks ago


Remote Warszawa Wrocław Białystok Kraków Gdańsk, Czech Republic Addepto Full time

What you’ll need to succeed in this role: At least 3 years of commercial experience implementing, developing, or maintaining Big Data systems, data governance and data management processes. Strong programming skills in Python (or Java/Scala): writing a clean code, OOP design. Hands-on with Big Data technologies like Spark, Cloudera Data Platform, Airflow, NiFi, Docker, Kubernetes, Iceberg, Trino or Hudi. Excellent understanding of dimensional data and data modeling techniques. Experience implementing and deploying solutions in cloud environments. Consulting experience with excellent communication and client management skills, including prior experience directly interacting with clients as a consultant. Ability to work independently and take ownership of project deliverables. Fluent English (at least C1 level). Bachelor’s degree in technical or mathematical studies. Nice to have: Experience with an MLOps framework such as Kubeflow or MLFlow. Familiarity with Databricks, dbt or Kafka. Addepto is a leading consulting and technology company specializing in AI and Big Data, helping clients deliver innovative data projects. We partner with top-tier global enterprises and pioneering startups, including Rolls Royce, Continental, Porsche, ABB, and WGU. Our exclusive focus on AI and Big Data has earned us recognition by Forbes as one of the top 10 AI consulting companies. As a Data Engineer, you will have the exciting opportunity to work with a team of technology experts on challenging projects across various industries, leveraging cutting-edge technologies. Here are some of the projects we are seeking talented individuals to join: Development and maintenance of a large platform for processing automotive data. A significant amount of data is processed in both streaming and batch modes. The technology stack includes Spark, Cloudera, Airflow, Iceberg, Python, and AWS. Design and development of a universal data platform for global aerospace companies. This Azure and Databricks powered initiative combines diverse enterprise and public data sources. The data platform is at the early stages of the development, covering design of architecture and processes as well as giving freedom for technology selection. Centralized reporting platform for a growing US telecommunications company. This project involves implementing BigQuery and Looker as the central platform for data reporting. It focuses on centralizing data, integrating various CRMs, and building executive reporting solutions to support decision-making and business growth. Discover our perks and benefits: Work in a supportive team of passionate enthusiasts of AI & Big Data. Engage with top-tier global enterprises and cutting-edge startups on international projects. Enjoy flexible work arrangements, allowing you to work remotely or from modern offices and coworking spaces. Accelerate your professional growth through career paths, knowledge-sharing initiatives, language classes, and sponsored training or conferences, including a partnership with Databricks, which offers industry-leading training materials and certifications. Choose from various employment options: B2B, employment contracts, or contracts of mandate. Make use of 20 fully paid days off available for B2B contractors and individuals under contracts of mandate. Participate in team-building events and utilize the integration budget. Celebrate work anniversaries, birthdays, and milestones. Access medical and sports packages, eye care, and well-being support services, including psychotherapy and coaching. Get full work equipment for optimal productivity, including a laptop and other necessary devices. With our backing, you can boost your personal brand by speaking at conferences, writing for our blog, or participating in meetups. Experience a smooth onboarding with a dedicated buddy, and start your journey in our friendly, supportive, and autonomous culture. ,[Develop and maintain a high-performance data processing platform for automotive data, ensuring scalability and reliability., Design and implement data pipelines that process large volumes of data in both streaming and batch modes., Optimize data workflows to ensure efficient data ingestion, processing, and storage using technologies such as Spark, Cloudera, and Airflow., Work with data lake technologies (e.g., Iceberg) to manage structured and unstructured data efficiently., Collaborate with cross-functional teams to understand data requirements and ensure seamless integration of data sources., Monitor and troubleshoot the platform, ensuring high availability, performance, and accuracy of data processing., Leverage cloud services (AWS) for infrastructure management and scaling of processing workloads., Write and maintain high-quality Python (or Java/Scala) code for data processing tasks and automation.] Requirements: Python, SQL, Spark, Airflow, AWS, Cloudera, CI/CD, Kubernetes, Kafka, NiFi, Trino, Hudi, Java, Scala, Docker, Databricks, MLOps, DevOps, Iceberg Tools: Jira, Confluence, Wiki, GitHub, Agile, Scrum, Kanban. Additionally: Private healthcare, Multisport card, Referral bonus, MyBenefit cafeteria, International projects, Flat structure, Paid leave, Training budget, Language classes, Team building events, Small teams, Flexible form of employment, Flexible working hours and remote work possibility, Free coffee, Startup atmosphere, No dress code, In-house trainings.


  • Data Scientist

    2 days ago


    Remote, Warszawa, Gdańsk, Wrocław, Białystok, Kraków, Czech Republic Addepto Full time

    🎯 What you’ll need to succeed in this role: At least 3+ years of proven commercial experience designing and implementing scalable AI solutions (Machine Learning, Predictive Modeling, Optimization, NLP, Computer Vision, GenAI). Proficiency in developing ML algorithms from scratch to production deployment. Strong programming skills in Python: writing...

  • Senior Data Engineer

    2 weeks ago


    Remote, Warszawa, Wrocław, Białystok, Kraków, Gdańsk, Czech Republic Addepto Full time

    What you’ll need to succeed in this role: At least 5 years of commercial experience implementing, developing, or maintaining Big Data systems, data governance and data management processes. Strong programming skills in Python (or Java/Scala): writing a clean code, OOP design. Hands-on with Big Data technologies like Spark, Cloudera, Data Platform,...

  • Lead Data Engineer

    2 weeks ago


    Remote, Warszawa, Wrocław, Białystok, Kraków, Gdańsk, Czech Republic Addepto Full time

    🎯 What you'll need to succeed in this role: 5+ years of proven commercial experience in implementing, developing, or maintaining Big Data systems. Strong programming skills in Python or Java/Scala: writing a clean code, OOP design. Experience in designing and implementing data governance and data management processes. Familiarity with Big...


  • Białystok, Warszawa, Gdańsk, Łódź, Wrocław, Czech Republic Godel Technologies Europe Full time

    Ideally you have: 3+ years in Data Engineering role  Solid Python programming skills for building and maintaining data pipelines  Advanced SQL skills, including query optimization and performance tuning  Experience with ETL/ELT tools and data orchestration frameworks like Apache Airflow, dbt  Strong understanding of data modeling principles (dimensional...

  • Data Engineer

    2 weeks ago


    Remote, Warsaw, Wrocław, Białystok, Kraków, Gdańsk, Czech Republic Addepto Full time

    What you’ll need to succeed in this role: At least 3 years of commercial experience implementing, developing, or maintaining Big Data systems. Strong programming skills in Python: writing a clean code, OOP design. Strong SQL skills, including performance tuning, query optimization, and experience with data warehousing solutions. Experience in designing...


  • Kraków, Gdańsk, Wrocław, Warszawa, Lublin, Czech Republic 1dea Full time

    Minimum 5 lat doświadczenia w IT applications management Doświadczenie i wiedza z zakresu user access management Dobra znajomość SQL Umiejętność programowania w Python oraz znajomość Spark Podstawowa znajomość systemów Unix i środowisk Big Data Umiejętność koordynacji z zespołami globalnymi i współpracy w międzynarodowym...

  • Senior Data Engineer

    2 weeks ago


    Remote, Warsaw, Wrocław, Białystok, Kraków, Gdańsk, Czech Republic Addepto Full time

    What you’ll need to succeed in this role: At least 5 years of commercial experience implementing, developing, or maintaining Big Data systems. Strong programming skills in Python: writing a clean code, OOP design. Strong SQL skills, including performance tuning, query optimization, and experience with data warehousing solutions. Experience in...

  • Junior Data Engineer

    2 weeks ago


    Remote, Warsaw, Wrocław, Białystok, Kraków, Gdańsk, Czech Republic Addepto Full time

    What you’ll need to succeed in this role: At least 1 year of proven commercial experience developing, or maintaining Big Data systems. Hands-on experience with Big Data technologies, including Databricks, Apache Spark, Airflow, and DBT. Strong programming skills in Python: writing a clean code, OOP design. Experience in designing and implementing data...


  • Remote, Wrocław, Białystok, Kraków, Gdańsk, Warszawa, Czech Republic Addepto Full time

    What you’ll need to succeed in this role: Bachelor’s or higher in Computer Science, Mathematics, Physics, or related field. Hands-on experience with Python and Data Science applications (Generative AI, LLMs, Machine Learning, Predictive Modeling, NLP, Computer Vision, Deep Learning). Proven track record in managing technical teams and leading end-to-end...

  • Data Engineer @ EPIKA

    2 weeks ago


    Warszawa, Wrocław, Łódz, Czech Republic EPIKA Full time

    Data Engineering Foundations: Azure Databricks - PySpark, Spark SQL, Unity Catalog, Workflows Azure Data Factory, Key Vault, and ADLS/Delta OAuth, OpenID, SAML, JWT Medallion Architecture Strong SQL and data modeling in a lakehouse/Delta architecture Python for data engineering (API integration, utilities, testing) Operations: Running production pipelines...