Technical Data Engineer for AI Data Ingestion

10 hours ago


Remote, Czech Republic Speechify Full time
About Us

Speechify is revolutionizing the way people consume information, making it easier to listen to articles, documents, and books at your own pace. We're a fast-growing company that's #1 in our category and experiencing exponential growth.

We're looking for a skilled Senior Software Engineer to join our Data Acquisition team, responsible for collecting data to support our model training operations. Our tight integration of infrastructure, engineering, and research work enables us to build high-quality datasets at petabyte-scale and low cost.

Job Description

We're seeking a talented individual to help us operate and extend our cloud infrastructure for our ingestion pipeline, currently running on GCP and managed with Terraform. You'll collaborate closely with our Scientists to deliver richer data at bigger scale and lower cost, powering our next-generation models.

  • Design and implement efficient data ingestion pipelines using various technologies
  • Develop and maintain scalable cloud infrastructure using Terraform and GCP services
  • Collaborate with cross-functional teams to identify and prioritize new features and requirements
Requirements
  • Bash/Python scripting experience in Linux environments
  • Proficiency in Docker and Infrastructure-as-Code concepts
  • Experience with cloud providers, preferably GCP
  • Strong communication skills and ability to work in a collaborative environment
What We Offer
  • A fast-paced environment where you can make a significant impact
  • An entrepreneurial-minded team that supports innovation and risk-taking
  • A flat structure with minimal management oversight, allowing you to focus on your work
  • A competitive compensation package and opportunities for professional growth

At Speechify, we're committed to building a diverse and inclusive workplace. If you're passionate about working with cutting-edge technology and making a difference in the world, please consider joining our team.



  • Remote, Czech Republic PAR Data Central Full time

    Position Overview: As an AI/ML Engineer, you will play a crucial role in enhancing our platform’s capabilities by developing and refining machine learning models that drive accurate forecasting, event analysis, data-driven decision-making, and more. You will collaborate closely with product owners and engineering teams to implement scalable AI solutions...


  • Remote, Lisbon, Frankfurtammain, Heidelberg, Warszawa, Czech Republic Data Science UA Full time

    - Bachelor degree in Computer Science, similar technical field of study or equivalent practical experience.- Commercial experience developing Spark Jobs using Scala and Java- Experience in data processing using traditional and distributed systems (Hadoop, Spark, AWS - S3) and designing data models and data warehouses.- Strong understanding and application of...

  • Data Engineer

    7 days ago


    Remote, Lisbon, Frankfurtammain, Heidelberg, Warszawa, Czech Republic Data Science UA Full time

    About Data Science UA:Data Science UA is a leading service company with a strong focus on data science and AI expertise.We are passionate about fostering the largest Data Science Community in Eastern Europe, and our journey began in 2016 with the organization of the first Data Science UA conference.Our Mission:We aim to deliver innovative solutions that...


  • Remote, Czech Republic Speechify Full time

    Speechify is a cutting-edge platform that transforms information into audio content, revolutionizing the way we consume knowledge.We are seeking a highly skilled Senior Data Engineer to join our AI team and play a crucial role in designing, building, and maintaining robust data pipelines and systems. This individual will collaborate with data scientists,...


  • Remote, Czech Republic AVENGA Full time

    At Avenga, we are seeking an experienced AI-driven data engineer to join our team.About the RoleWe are looking for a highly skilled data engineer with expertise in designing and implementing scalable data pipelines for AI and Generative AI applications. As an AI-driven data engineer, you will be responsible for collaborating with AI engineers, data...

  • Data Engineer @

    1 day ago


    Remote, Warszawa, Czech Republic SquareOne Full time

    Required Skills & Qualifications:Previous experience as a data engineerTechnical expertise with data modeling techniques (Data Vault)Advanced expertise with ETL tools (Talend, Alteryx etc.)Advanced SQL programming experience, Python is a plusPrevious experience with agile methodologies in Software DevelopmentPrevious experience working with Data...


  • Remote, Kraków, Czech Republic Kontakt Full time

    5+ years of experience as a Data Engineer, Data Platform Engineer, or related role.Proficiency in Python, Scala, or Java for data processing and automation.Hands-on experience with big data frameworks like Apache SparkExperience with streaming and batch processing using KafkaStrong knowledge of cloud platforms (AWS) and their data processing...


  • Remote, Kraków, Wrocław, Warszawa, Czech Republic N-iX Full time

    Must have: Python Development: Minimum 5 years of professional experience in production environments, emphasising performance optimisation and code quality. Ingestion and modelling: Experience with Python and orchestration tools like Airflow is beneficial. SQL Proficiency: Advanced knowledge of SQL: At least one of PostgreSQL, MySQL, MSSQL Ability to write...

  • Data Engineer @

    22 hours ago


    Remote, Czech Republic AVENGA Full time

    3+ years of experience in data engineering, preferably supporting AI/ML applications.Proficiency in Python, SQL, and vector database query languages.Experience with relational, NoSQL, and vector databases (Snowflake preferred).Hands-on experience with AWS (OpenSearch, S3, Lambda) or Azure (Azure AI Search, Blob Storage, Automation).Experience building...


  • Remote, Czech Republic Speechify Full time

    An Ideal Candidate Should Have  Bachelor’s or Master’s degree in Computer Science, Engineering, or a related field. Proven experience as a Data Engineer or in a similar role and experience with ETL. Proficiency in programming languages such as Python and experience in SQL Big data tools: Data- and Delta-lakes Cloud: Bare-Metal, Hybrid...


  • Remote, Warszawa, Czech Republic SquareOne Full time

    Required Skills & Qualifications: Previous experience as a data engineer Technical expertise with data modeling techniques (Data Vault) Advanced expertise with ETL tools (Talend, Alteryx etc.) Advanced SQL programming experience, Python is a plus Previous experience with agile methodologies in Software Development Previous experience working with Data...


  • Remote, Czech Republic AVENGA Full time

    Strong organizational, analytical, and consulting skills. Knowledge of Data Mesh concepts and Data One Platform (Snowflake). Familiarity with Data Vault Concept and Design. Experience converting legacy architectures (e.g., PULSE) to Data Vault Design using Snowflake, Immuta, Collibra, and Cloud Security. Understanding of Commercial Processes in DIA and...


  • Remote, Czech Republic SquareOne Full time

    Experience: 3+ years in AI/ML engineering with exposure to both classical machine learning methods and language model-based applications. Technical Skills: Advanced proficiency in Python and experience with deep learning frameworks like PyTorch or TensorFlow. Expertise with Transformer architectures, hands-on experience with LangChain or similar LLM...


  • Remote, Czech Republic PAR Data Central Full time

    Hands-on experience in writing automated test cases and developing test automation QA frameworks, with at least four years of experience. Proficiency in programming languages such as C# and JavaScript is essential. Hands-on experience with CSS, HTML, and MSSQL for data retrieval and verification. Solid experience with Object-Oriented Programming...

  • Senior Data Engineer

    15 hours ago


    Remote, Czech Republic Speechify Full time

    An Ideal Candidate Should Have Bachelor's or Master's degree in Computer Science, Engineering, or a related field.Proven experience as a Data Engineer or in a similar role and experience with ETL.Proficiency in programming languages such as Python and experience in SQLBig data tools: Data- and Delta-lakesCloud: Bare-Metal, Hybrid infrastructureGood to...

  • Project Manager

    1 day ago


    Remote, Warsaw, Wrocław, Białystok, Kraków, Gdańsk, Czech Republic Addepto Full time

    🎯 What you’ll need to succeed in this role: Bachelor’s or higher in Computer Science, Mathematics, Physics, or related field. Proven track record in managing technical teams and leading end-to-end projects. Experience working with corporate clients. Strong critical thinking and problem-solving abilities. Excellent communication and presentation...


  • Remote, Czech Republic Gen AI Works Full time

    We are an innovative startup leveraging AI technology to build cutting-edge products at a global scale.Software Architect and Tech LeadThis role is a key part of our technical initiatives, driving the architectural design and end-to-end implementation of our software products. As a senior developer and tech lead, you will be responsible for leading a skilled...


  • Remote, Czech Republic AVENGA Full time

    About AvengaAvenga is a cutting-edge organization driving innovation in data engineering.Job DescriptionWe are seeking a highly skilled Data Engineering Lead to spearhead our data architecture efforts and drive business growth through data-driven insights.Key Responsibilities:Technical Governance: Ensure adherence to best practices and regulatory compliance...


  • Remote, Czech Republic Provectus Full time

    Experience in data engineering; Experience working with Cloud Solutions (preferably AWS, also GCP or Azure); Experience with Cloud Data Platforms (e.g., Snowflake, Databricks); Proficiency with Infrastructure as Code (IaC) technologies like Terraform or AWS CloudFormation; Experience handling real-time and batch data flow and data warehousing with tools and...


  • Remote, Czech Republic Gen AI Works Full time

    Bachelor’s or Master’s degree in Computer Science, Software Engineering, or related field. 5+ years of experience in software development, with at least 1 year in a leadership role. Strong proficiency in modern web development stack both frontend and backend Experience with at least one of the cloud providers (AWS, Azure, GCP) Familiarity with API...