Data Engineer with Spark @ Comscore
16 hours ago
The candidate must have: Solid understanding of Spark basics, building blocks, and mechanics (the deeper the knowledge, the higher the value) Strong knowledge of Python, Java, or Scala (with the ability to expand to expert level) 1+ years of experience with Spark (commercial not required, deep understanding matters more than years) Good SQL skills - not necessarily writing complex queries by hand, but strong knowledge of available tools and approaches to solve data problems Understanding of data quality issues in large datasets (inconsistencies, missing data, imbalanced sets, etc.) 1+ years of experience with Linux (power-user skills are a big plus; deployment is not required, but Linux knowledge makes your life easier) Professional working proficiency in English (oral and written) Understanding of HTTP API communication patterns (HTTP/REST/RPC) and protocol itself Good software debugging skills (beyond print - using debuggers effectively) Deep understanging of at least one technical area (be ready to share your biggest "battle story" about it) Solid Git understanding Strong communication skills (ability to drive end-to-end projects and mentor team members) If you don't have all the qualifications, but you're interested in what we do and you have a solid Linux understanding -> let's talk Correct Context is looking for a Data Engineer with Spark for Comscore in Poland and around. Comscore is a global leader in media analytics, revolutionizing insights into consumer behavior, media consumption, and digital engagement. Comscore leads in measuring and analyzing audiences across diverse digital platforms. Thrive on using cutting-edge technology, play a vital role as a trusted partner delivering accurate data to global businesses, and collaborate with industry leaders like Facebook, Disney, and Amazon. Contribute to empowering businesses in the digital era across media, advertising, e-commerce, and technology sectors. We have multiple Java + Spark, Scala + Spark, Python + Spark teams and we may try to match you to multiple teams or just find you single best fit depends on your skills and experience. We offer: Real big data projects (PB scale) 🚀 An international team (US, PL, IE, CL) 🌎 A small, independent team working environment 🧑💻 High influence on working environment Hands on environment Flexible work time ⏰ Fully remote or in-office work in Wroclaw, Poland 🏢 12,000 - 22,000 PLN net/month B2B 💰 Private healthcare (PL) 🏥 Multikafeteria (PL) 🍽️ Free parking (PL)🚗 The recruitment process for the Data Engineer position has following steps: Technical survey - 10min Technical screening - 30 min video call Technical interview - 60-90min video call - this step can be multiplied if we speak to multiple teams (we have multiple teams that you may want to join, your choice) Final Interview - Technical/Managerial - 30 min video call ,[ Design, implement, and maintain petabyte-scale big data pipelines using Spark (Java, Python, or Scala - depending on the team), Apache Airflow, Kubernetes, and other technologies , Optimize performance - working with big data is highly specific: sometimes IO-bound, sometimes CPU-bound. You’ll help figure out the most efficient approaches , Collaborate closely with other big data teams , Work with technologies such as AWS, Kubernetes, Airflow, EMR, Hadoop, Linux/Ubuntu, Kafka, and Spark ] Requirements: Spark, Java, Scala, Python, Big data, AWS, API Tools: Jira, Confluence, Bitbucket, GIT, Jenkins, Agile, Kanban. Additionally: Remote work, Flexible working hours, Sport subscription, Flat structure, Small teams, International projects, Free parking, Free coffee, Playroom, Modern office, Startup atmosphere, No dress code.
-
Python Big Data Developer @ Comscore
16 hours ago
Remote, Wroclaw, Czech Republic Comscore (via CC) Full timeAs a Python Developer, you'll: Define data models which tie together large datasets from multiple data sources (terabytes of data, hundreds of terabytes) Design, implement and maintain Data pipelines and data driven solutions (using python) and Linux/AWS environment Building data pipelines using Apache Airflow, Apache Druid, Apache Spark, AWS, EMR,...
-
Remote, Wrocław, Czech Republic Comscore (via CC) Full timeAn ideal candidate would have: A solid foundation in Python programming, future proficiency in SQL, and a solid basic understanding of statistical concepts. Python programming, proficiency in SQL, and a basic understanding Knowledge of testing methodologies and practical experience in API testing and data testing Experience in testing large datasets is...
-
Java/Linux Engineer @ Comscore
16 hours ago
Remote, Wroclaw, Czech Republic Comscore (via CC) Full timeThe candidate must have: 2+ years of strong Linux experience 2+ years of strong Java experience Sharp axe Strong knowledge of Linux processes, thread management, shell scripting, service management and networking Strong knowledge and good understanding of Java/JVM and ecosystem (how Java resources map to system resources, how to optimize it and...
-
Python/Django FullStack Developer @ Comscore
16 hours ago
Remote, Wroclaw, Czech Republic Comscore (via CC) Full timeAs a Python/Django Big Data Developer, you'll: Build and maintain scalable APIs using Python, Django + DRF Build also React frontends as part of the role (hard to say what would be exact balance between frontend and backend, but we primarly looking for backend developers with being open to help with frontend) Work with AWS/Kubernetes/Apache Airflow and...
-
Big Data Engineer
16 hours ago
Remote, Czech Republic Link Group Full timeMust-Have Qualifications At least 3+ years of experience in big data engineering. Proficiency in Scala and experience with Apache Spark. Strong understanding of distributed data processing and frameworks like Hadoop. Experience with message brokers like Kafka. Hands-on experience with SQL/NoSQL databases. Familiarity with version control tools like Git....
-
Remote, Kraków, Warszawa, Czech Republic 1dea Full timeWymagania Min. 4 lata doświadczenia w zarządzaniu aplikacjami IT. Biegła znajomość SQL oraz dobra wiedza z zakresu Python/Spark. Zrozumienie zarządzania dostępem użytkowników. Podstawowa wiedza o Unix i środowisku Big Data (HDFS). Biegła znajomość języka angielskiego w mowie i piśmie (B2+) Mile widziane: Doświadczenie z...
-
Senior Data Engineer @ Link Group
1 week ago
Remote, Czech Republic Link Group Full timeRequired Skills & Experience 5–8 years of hands-on experience in data engineering or similar roles. Strong knowledge of AWS services such as S3, IAM, Redshift, SageMaker, Glue, Lambda, Step Functions, and CloudWatch. Practical experience with Databricks or similar platforms (e.g., Dataiku). Proficiency in Python or Java, SQL (preferably Redshift), Jenkins,...
-
Senior Data Engineer @ 1dea
1 week ago
Remote, Czech Republic 1dea Full timemin 5 yrs of relevant experience Solid experience with AWS services (S3, IAM, Redshift, Sagemaker, Glue, Lambda, Step Functions, CloudWatch) Experience with platforms like Databricks, Dataiku Proficient in Python / Java, SQL – Redshift preferred, Jenkins, CloudFormation, Terraform, Git, Docker, 2-3 years of Spark – PySpark Good communication and SDLC...
-
Data Engineer
16 hours ago
Remote, Czech Republic Link Group Full timeMinimum 6 years of experience as a Data Engineer or in a similar role Strong expertise in SQL Server, T-SQL, and database design principles Proficiency in Python for writing clean, maintainable code Hands-on experience with Apache Spark for large-scale data processing Knowledge of Apache Hive for data warehousing solutions Experience with Apache Airflow for...
-
Data Engineer @ Experis Polska
1 week ago
Wroclaw, Czech Republic Experis Polska Full timeExpectations Proven experience in Azure Databricks (data engineering, pipelines, performance tuning) Proficiency in Python, PySpark, and SQL Familiarity with Azure DevOps (Repos, Pipelines, YAML) Experience with Unity Catalog and data workflow orchestration Strong communication and stakeholder management skills Data Engineer Opis Pracy Start...