Data Engineer with Spark @

2 days ago


Remote Wroclaw, Czech Republic Comscore (via CC) Full time

The candidate must have:

  • Solid understanding of Spark basics, building blocks, and mechanics (the deeper the knowledge, the higher the value)
  • Strong knowledge of Python, Java, or Scala (with the ability to expand to expert level)
  • 1+ years of experience with Spark (commercial not required, deep understanding matters more than years)
  • Good SQL skills - not necessarily writing complex queries by hand, but strong knowledge of available tools and approaches to solve data problems
  • Understanding of data quality issues in large datasets (inconsistencies, missing data, imbalanced sets, etc.)
  • 1+ years of experience with Linux (power-user skills are a big plus; deployment is not required, but Linux knowledge makes your life easier)
  • Professional working proficiency in English (oral and written)
  • Understanding of HTTP API communication patterns (HTTP/REST/RPC) and protocol itself
  • Good software debugging skills (beyond print - using debuggers effectively)
  • Deep understanging of at least one technical area (be ready to share your biggest "battle story" about it)
  • Solid Git understanding
  • Strong communication skills (ability to drive end-to-end projects and mentor team members)

If you don't have all the qualifications, but you're interested in what we do and you have a solid Linux understanding -> let's talk

Correct Context is looking for a Data Engineer with Spark for Comscore in Poland and around.

Comscore is a global leader in media analytics, revolutionizing insights into consumer behavior, media consumption, and digital engagement.

Comscore leads in measuring and analyzing audiences across diverse digital platforms. Thrive on using cutting-edge technology, play a vital role as a trusted partner delivering accurate data to global businesses, and collaborate with industry leaders like Facebook, Disney, and Amazon. Contribute to empowering businesses in the digital era across media, advertising, e-commerce, and technology sectors.

We have multiple Java + Spark, Scala + Spark, Python + Spark teams and we may try to match you to multiple teams or just find you single best fit depends on your skills and experience.

We offer:

  • Real big data projects (PB scale)
  • An international team (US, PL, IE, CL)
  • A small, independent team working environment
  • High influence on working environment
  • Hands on environment
  • Flexible work time
  • Fully remote or in-office work in Wroclaw, Poland
  • 12,000 - 22,000 PLN net/month B2B
  • Private healthcare (PL)
  • Multikafeteria (PL)
  • Free parking (PL)

The recruitment process for the Data Engineer position has following steps:

  1. Technical survey - 10min
  2. Technical screening - 30 min video call
  3. Technical interview - 60-90min video call - this step can be multiplied if we speak to multiple teams (we have multiple teams that you may want to join, your choice)
  4. Final Interview - Technical/Managerial - 30 min video call
,[ Design, implement, and maintain petabyte-scale big data pipelines using Spark (Java, Python, or Scala - depending on the team), Apache Airflow, Kubernetes, and other technologies , Optimize performance - working with big data is highly specific: sometimes IO-bound, sometimes CPU-bound. You'll help figure out the most efficient approaches , Collaborate closely with other big data teams , Work with technologies such as AWS, Kubernetes, Airflow, EMR, Hadoop, Linux/Ubuntu, Kafka, and Spark  ] Requirements: Spark, Java, Scala, Python, Big data, AWS, API Tools: Jira, Confluence, Bitbucket, GIT, Jenkins, Agile, Kanban. Additionally: Remote work, Flexible working hours, Sport subscription, Flat structure, Small teams, International projects, Free parking, Free coffee, Playroom, Modern office, Startup atmosphere, No dress code.

  • Remote, Czech Republic beBeeDataEngineer Full time €90,000 - €120,000

    Job DescriptionWe're seeking an experienced Data Engineer to join our team in developing and implementing scalable data processing pipelines with Apache Spark.The ideal candidate will have a strong background in designing, deploying, and managing Spark clusters on open-source Kubernetes infrastructure.RequirementsTo be successful in this role, you'll need to...


  • Remote, Warsaw, Czech Republic beBeeDataEngineer Full time 900,000 - 1,200,000

    Job SummaryWe are seeking a seasoned Data Engineer to lead the design and implementation of technical solutions for business requirements. The ideal candidate will have a strong background in data engineering, particularly with Apache Spark.Main Responsibilities: Design and implement technical solutions, develop and maintain leading IT solutions, end-to-end...


  • Remote, Czech Republic beBeeDataEngineer Full time 900,000 - 1,200,000

    Senior Azure Data Engineer with DatabricksThis is a challenging role that requires a strong foundation in data engineering and experience with cloud-based data platforms.RequirementsA minimum of 3 years' experience with Azure Data Factory and Databricks, along with at least 5 years' experience in data engineering or backend software development.Strong SQL...


  • Remote, Wrocław, Czech Republic Comscore (via CC) Full time

    The candidate must have:Good understanding of Spark internals and mechanics4+ years of commercial experience with JavaProven ability to debug JVM/Java (hopefully in Spark ecosystem)Ability to design high performance processing pipelinesHands-on, proactive approachFamiliarity with CI/CD pipelines and DevOps practicesProficiency in English enables effective...

  • Chief Data Engineer

    4 days ago


    Remote, Czech Republic beBeeData Full time €65,000 - €80,000

    We are seeking a talented Senior Data Scientist to join our organization and contribute to shaping the future of finance teams.Key Responsibilities:Design, build, and deploy machine learning models using frameworks like LangChain.Owning the full technical stack from data pipelines to deployment.Closing collaboration with engineers and product teams in...


  • Remote, Warsaw, Czech Republic SquareOne Full time

    7+ years of experience in data architecture, database design, and data engineeringProven expertise in Google Cloud Platform (GCP), including: Dataplex, BigQuery, Dataflow (Apache Beam) and other GCP-native toolsStrong experience with Apache-based data pipelining tools (Beam, Airflow, Kafka, Spark)Expertise in data modeling (conceptual, logical, physical)...


  • Remote, Wrocław, Gdańsk, Rzeszów, Czech Republic Xebia sp. z o.o. Full time

    7+ years in a data engineering role, with hands-on experience in building data processing pipelines,experience in leading the design and implementing of data pipelines and data products,proficiency with GCP services, for large-scale data processing and optimization,extensive experience with Apache Airflow, including DAG creation, triggers, and workflow...


  • Remote, Warsaw, Czech Republic KMD Poland Full time

    Personal Requirements:    Have 4+ years of Apache Spark experience and have faced various data engineering challenges in batch or streaming   Have an interest in stream processing with Apache Spark Structured Streaming on top of Apache Kafka   Have experience leading technical solution designs   Have experience with distributed systems on a...


  • Remote, Czech Republic beBeeDataEngineer Full time €90,000 - €120,000

    Job DescriptionWe are seeking a highly skilled Data Engineer to join our team. As a Senior Data Engineer, you will be responsible for designing, developing, and maintaining large-scale data processing systems.Your primary focus will be on leveraging Trino (Starburst or Apache) to deliver high-performance data engineering solutions. You will work closely with...

  • Senior Data Engineer

    5 hours ago


    Remote, Gdańsk, Wrocław, Warsaw, Kraków, Poznań, Czech Republic RemoDevs Full time

    Proven experience with Azure Databricks and Azure Data Factory (ADF).Strong skills in SQL and Python for data engineering.Experience in building pipelines and data models.Good English (minimum B2) to communicate in an international team.Experience with Agile methods and Azure DevOps. We are looking for skilled Data Engineers to join a team working on an...