Big Data Lead Engineer @

1 day ago


Remote, Czech Republic ALM Services Technology Group Full time

We are currently seeking resources with experience in building and taking to production low latency, Massive Parallel Processing (MPP) data and analytic systems, ideally on Hadoop, Scala and Spark. The Candidate that meets the following criteria:

∙Computer Science Degree/Student 
∙3+ years strong native SQL skills 
∙3+ years strong experience in database and data warehousing/data lake concepts and techniques. Understand: relational and dimensional modeling, star/snowflake schema design, BI, Data Warehouse operating environments and related technologies, ETL, MDM, and data governance practices
∙2+ years' experience working in Linux 
∙2+ years' experience with Hadoop, Hive, Impala, HBase, and related technologies 
∙3+ years strong experience with low latency (near real time) systems and working with Tb data sets, loading and processing billions of records per day 
∙3+ years' experience with Spark, Scala 
∙3+ years' experience with MPP, shared nothing database systems, and NoSQL systems 
∙Ability to work in a fast-paced, team-oriented environment 
∙Ability to complete the full lifecycle of software development and deliver in an Agile/Scrum environment, leveraging Continuous Integration/Continuous Development

ALM Services Technology Group develops end–to–end Web and Mobile Solutions. We work closely with customers usually in long term relations.
Our mission is to create the best possible environment of work for our people, engage in innovative projects, and help to strengthen and develop new competences.

ALM was founded in 2009 in Poland. In 2022 we opened a branch in Budapest, and we are actively working on growing our team in Hungary.

ALM Services Technology Group comprises creative, open-minded individuals who develop innovative solutions daily to help our clients expand their businesses. 

Our mission is to create the best possible environment of work for our people, engage in innovative projects, and help to strengthen our competences.

Since 2020 we have been cooperating with our partner, a multinational company working in a med-tech and analytics space. A recognized global leader still willing to challenge the status quo to improve patient care.  

We have helped and supported them through various stages of growth. We have built multiple applications for them. We have the core team and know-how in place to help them grow further.  

Our Client is developing our next-generation Global Data Lake and Analytics platform to support analytics and insights against hundreds of Terabytes and Petabytes of health care data, and doing it in near real-time.

,[Data Engineer on the Data Lake/Hadoop application. Work on building data pipelines to load and manipulate data onto the Data Lake. Optimize data architecture for consumption, utilization and analytics with data on Hadoop, including for data science, machine learning and statistical use cases , Help lead the charge on a data lake store strategy, ensuring rapid delivery while taking responsibility for applying standards, principles, theories, and concepts, Responsible for design and delivery of data models, which power BI initiatives, dashboards, syndicated reporting, and ad-hoc data exploratory canvases , Work with data architects on the logical data models and physical database designs optimized for performance, availability and reliability , Tuning and optimization of backend and frontend data operations , Serve as a query tuning and optimization technical expert, providing feedback to team , Design and develop ETL and master data management processes , Scripting and automation to support development, QA and production database environments and deployments to production , Define and help enforce data governance and security policy , Mentors development team members , Proactively helps to resolve difficult technical issues , Provide technical knowledge to teams during project discovery and architecture phases , Keep management informed of work activities and schedules , Assess new initiatives to determine the work effort and estimate the necessary time-to-completion , Document new development, procedures or test plans as needed , Participate in data builds and deployment efforts. Help mature our Continuous Integration and Continuous Deployment methodologies , Participate in projects through various phases , Performs other related duties as assigned , Partner with the business units to develop effective solutions that solve business challenges, Should be able to lead small team of Big data engineers and manage them] Requirements: Hadoop, Hive, Scala, Spark, SQL, ETL, Linux, DevOps, RDBMS Tools: Agile, Scrum. Additionally: Private healthcare, Flat structure, Small teams, International projects, Salary in foreign currency, Workation, Flexible hours, Certificates.
  • Big Data Engineer

    23 seconds ago


    Remote, Czech Republic Link Group Full time

    Must-Have QualificationsAt least 3+ years of experience in big data engineering.Proficiency in Scala and experience with Apache Spark.Strong understanding of distributed data processing and frameworks like Hadoop.Experience with message brokers like Kafka.Hands-on experience with SQL/NoSQL databases.Familiarity with version control tools like Git.Solid...


  • Remote, Czech Republic SoftServe Full time

    IF YOU AREExperienced with Python and PySparkProficient in Databricks Lakehouse architecture and principlesSkilled in designing data models, building ETL pipelines, and wrangling data to solve business challengesKnowledgeable in Azure cloud technologies, including Azure Data Factory, Azure DevOps, Azure Synapse, and Azure Data Lake ServicesAdvanced in SQL...


  • Remote, Czech Republic ALM Services Technology Group Full time

    We are seeking a highly experienced Data Engineer to lead our team in building a next-generation Global Data Lake and Analytics platform.This role involves designing and delivering data models, which power business intelligence initiatives, dashboards, syndicated reporting, and ad-hoc data exploratory canvases. You will work closely with data architects on...


  • Remote, Czech Republic Matrix Global Services Full time

    Job DescriptionWe are seeking a highly skilled Lead Data Engineer to join our team at Matrix Global Services.About the RoleThe successful candidate will lead development projects of critical, high-availability cloud-scale services and APIs. They will support clients with large amounts of data and scalability in mind.Key Responsibilities:Leverage expertise in...

  • Lead Data Scientist

    4 hours ago


    Remote, Warsaw, Wrocław, Białystok, Kraków, Gdańsk, Czech Republic Addepto Full time

    What you'll need to succeed in this role:6+ years of commercial experience in designing and implementing scalable AI solutions (Machine Learning, Predictive Modeling, Optimization, NLP, Computer Vision, GenAI).Experience in Machine Learning projects leadership and team mentoring.Proficiency in developing ML algorithms from scratch to production...


  • Remote, Czech Republic Link Group Full time

    About the RoleWe are seeking an experienced Data Engineering Lead to join our team at Link Group. This is a unique opportunity to lead and mentor a team of data engineers while actively contributing to pipeline development, architecture design, and system optimization.Key ResponsibilitiesDesign and implement scalable, high-performance data architectures that...


  • Remote, Kraków, Czech Republic N-iX Full time

    Company OverviewN-iX is a leading IT services company that provides innovative solutions to businesses across various industries.Job DescriptionWe are seeking a highly skilled Data Engineer to join our team. In this role, you will be responsible for designing, implementing, and maintaining sophisticated data pipelines in Palantir Foundry.Required Skills and...


  • Remote, Czech Republic Link Group Full time

    7+ years of experience in data engineering, including 2+ years in a leadership or architecture role.Expertise in Airflow, Snowflake, DBT, and advanced knowledge of Python, Spark, SQL, and AWS.Proven experience in designing and deploying large-scale data warehouses and building complex data models.Hands-on experience with third-party APIs and external data...


  • Remote, Wrocław, Czech Republic Comscore (via CC) Full time

    The candidate must have:2+ years of experience with LinuxSolid knowledge of Linux (bash, threads, IPC, filesystems; being power-user is strongly desired, understanding how OS works so you can benefit from performance optimizations in production but also in daily workflows)Huge need to drive projects of the future, improve stuff, risk taking mindset - covered...

  • Data Engineer

    22 seconds ago


    Remote, Czech Republic Link Group Full time

    Must-Have QualificationsAt least 3+ years of experience in data engineering.Strong expertise in one or more cloud platforms: AWS, GCP, or Azure.Proficiency in programming languages like Python, SQL, or Java/Scala.Hands-on experience with big data tools such as Hadoop, Spark, or Kafka.Experience with data warehouses like Snowflake, BigQuery, or...


  • Remote, Czech Republic Matrix Global Services Full time

    +3 years of experience in large scale, distributed server side, backend developmentExtensive experience in stream & batch big data pipeline processing using Apache SparkExperience with Linux, Docker, and KubernetesExperience in working with cloud providers (e.g., AWS, GCP)Strong experience with event streaming platforms like Kafka or its alternatives, such...


  • Remote, Warszawa, Czech Republic Syncron Full time

    About SyncronSyncron is a leading SaaS company with over 20 years of experience, specializing in aftermarket solutions. Our Service Lifecycle Management Platform offers domain-fit solutions for supply chain optimization, pricing strategy, service fulfillment, warranty management, field service management, service parts management, and knowledge management.We...


  • Remote, Warszawa, Czech Republic SquareOne Full time

    Job DescriptionSquareOne is seeking a highly skilled Data Engineer to join our team. As a key member of our data engineering team, you will be responsible for developing robust and maintainable data pipelines that meet the needs of our business stakeholders.About the RoleThis is a challenging opportunity for an experienced data engineer to take ownership of...


  • Remote, Kraków, Czech Republic N-iX Full time

    4+ years of experience in data engineering, preferably within the pharmaceutical or life sciences industryProven experience as a Data Engineer with a focus on cloud-based solutions.Strong proficiency in Python, and PySparkHands-on experience with Azure cloud servicesExpertise in data modeling, database design, and optimization.Familiarity with Big Data...

  • AI/ML Engineer @

    7 days ago


    Remote, Czech Republic PAR Data Central Full time

    Position Overview: As an AI/ML Engineer, you will play a crucial role in enhancing our platform's capabilities by developing and refining machine learning models that drive accurate forecasting, event analysis, data-driven decision-making, and more. You will collaborate closely with product owners and engineering teams to implement scalable AI solutions that...


  • Remote, Czech Republic ETFbook Full time

    As a key member of the ETFbook team, you will be responsible for designing and maintaining a scalable data platform using Python, .NET, and Azure Services.About UsETFbook is a fast-paced ETF data analytics startup. We offer a cutting-edge platform providing businesses with actionable insights.You'll join a highly skilled data engineering team, part of a...


  • Remote, Kraków, Warszawa, Kyiv, Czech Republic TechHunt Full time

    Qualifications:Strong proficiency in Scala and the ability to design and build robust backend services. Experience with Python for data pipeline development, ETL processes, or data integration Solid understanding of database systems (SQL/NoSQL) and experience working with large datasets. Experience with K8S, writing and maintaining Helm...


  • Remote, Kraków, Wrocław, Warszawa, Czech Republic N-iX Full time

    Must have:Python Development: Minimum 5 years of professional experience in production environments, emphasising performance optimisation and code quality.Ingestion and modelling:Experience with Python and orchestration tools like Airflow is beneficial.SQL Proficiency: Advanced knowledge of SQL:At least one of PostgreSQL, MySQL, MSSQLAbility to write complex...


  • Remote, Warszawa, Czech Republic Sunscrapers Full time

    At Sunscrapers, we are looking for a skilled Software Engineering Lead to join our team in developing a holistic data platform for a leading US-based healthcare company.The ideal candidate will have strong experience in designing and optimizing data models in Django, managing and maintaining Docker environments, and utilizing AWS services and infrastructure...


  • Remote, Warsaw, Wrocław, Białystok, Kraków, Gdańsk, Czech Republic Addepto Full time

    About AddeptoAddepto is a cutting-edge consulting and technology company specializing in AI and Big Data. We partner with top-tier global enterprises and pioneering startups to deliver innovative data projects.Exciting Project OpportunitiesDesign and Development of Azure Data Platform: Migrate SSAS cube objects and SSIS pipelines from on-premises SQL Server...