Remote Senior Data Scientist, Machine Learning Engineer @ Shelf

6 days ago


Remote Wrocław Warszawa Kraków, Czech Republic Shelf Full time

3+ years of professional experience researching and shipping ML-based solutions, with strong Python skills and a track record of delivering fast without sacrificing quality Proven experience in owning research problems end-to-end, starting from initial data analysis, through iterative research phases to delivering on production Practical NLP/LLM experience: transformers, embeddings, prompt design, and evaluation; ability to choose and justify metrics and methodologies Strong backend fundamentals: designing RESTful services, schema design, data modeling, and performance tuning for SQL and NoSQL stores Data processing skills: pandas/NumPy; experience with batch/stream processing and ETL orchestration (e.g., Airflow, Step Functions) Strong English verbal and written communication As a plus LLM ops and safety: eval frameworks (e.g., RAGAS), guardrails, red-teaming, prompt optimization at scale Model optimization: quantization, distillation, pruning; GPU/accelerator-aware serving Experience with AWS ML stack (SageMaker, Batch, Step Functions, Lambda, SQS/SNS, DynamoDB, ECS, EC2, S3) Vector databases and search: Pinecone, Elasticsearch, pgvector, FAISS, or DeepLake Background in reinforcement learning, agent frameworks, or autonomous agents Publications, open-source contributions, GitHub portfolio The R&D department plays a pivotal role in driving Shelf to disrupt the market. We are looking for Machine Learning experts that are able to deliver end to end with a blend of experience: Python engineering, ML engineering, and pragmatic Data science and Machine learning research. You will ship end-to-end features—from problem framing and experimentation to service deployment, and ongoing operations—quickly and with high quality. Your work will power ML- and LLM-driven services used by top enterprises like Amazon, Mayo Clinic, AmFam, and Nespresso. This role requires strong Python engineering capabilities coupled with a strong ability to deliver robust ML solutions, along with ML research literacy to choose sound methodologies, define metrics, and evaluate different approaches effectively. You’ll work in an agile environment, move fast, and own what you ship. What Shelf Offers B2B contract Company Stock Options Hardware: MacBook Pro Modern technical stack. Develop open-source software Premier AI development environment: GitHub Copilot, Claude Code, OpenAI, TypingMind, v0, MCP Servers, plus credits to experiment with emerging AI tools About Shelf There is no AI Strategy without a Data Strategy. Getting GenAI to work is mission-critical for most companies, but 90% of AI projects haven't deployed. Why? Poor data quality—it’s the #1 obstacle companies face getting GenAI into production. Shelf unlocks AI readiness. We provide the core infrastructure that enables GenAI to be deployed at scale. We help companies deliver more accurate GenAI answers by eliminating bad data in documents and files before they go into an LLM and create bad answers. We’re partnered with Microsoft, Salesforce, Snowflake, Databricks, OpenAI and other leaders bringing GenAI to the enterprise. Our mission is to empower humanity with better answers everywhere. ,[Own end-to-end delivery: ideate, research, prototype, productionize, and operate ML-powered services with an expectation to iterate and ship frequently, Stand up robust training/evaluation pipelines: dataset curation, labeling/feedback loops, experiment tracking, offline/online metrics, and A/B testing, Solve problems using sound methodology, evaluate approaches along with , Transform ML models and LLM workflows (including RAG) into reusable, versioned, observable production services with CI/CD, Collaborate with Product Owners to shape our product and requirements, Conduct and receive code reviews; champion engineering excellence, testing discipline, and documentation, Leverage AI coding assistants to accelerate development and create internal agents that automate parts of the engineering workflow, Share learnings through demos, docs, and knowledge sessions; contribute to a culture of continuous improvement] Requirements: Python, NLP, RESTful, LLM, SQL, NoSQL, pandas, NumPy, ETL, Data analysis, AWS ML stack, Pinecone, Elasticsearch, pgvector, FAISS, DeepLake, GitHub Additionally: Stock options, GitHub Copilot subscription, LLM credits.


  • Engineering Manager

    6 days ago


    Remote, Wrocław, Warszawa, Kraków, Czech Republic Shelf Full time

    Note: This position does not assume developing system architecture and daily coding. 4+ years of Engineering Management experience leading cross-functional, backend-centric teams (10+ engineers) in product companies operating multi-tenant SaaS systems. 5+ years as a Senior Software Engineer, Tech Lead, or Staff Engineer with deep Node.js or Python...


  • Remote, Wrocław, Warszawa, Kraków, Czech Republic Shelf Full time

    Over 5 years of professional software engineering experience, including more than 3 year specializing in Node.js Deep understanding of distributed systems, concurrency patterns, and event-driven architectures Hands-on experience with AWS or Azure cloud primitives - you've personally provisioned resources, configured services, and built scalable systems using...


  • Remote, Warsaw, Czech Republic hubQuest Full time

    What we expect 5+ years of professional experience in Data Science or ML Engineering, including production deployments MSc or PhD in Computer Science, Statistics, Mathematics, Physics, or related technical field Strong Python programming skills, including software engineering practices (OOP, modular code design, testing) Solid experience with ML frameworks...


  • Kraków, Czech Republic VirtusLab Full time

    What we expect in general The ability to work in hybrid model from Krakow office is a must (3 days per week). Hands-on experience in deploying Python projects. Strong experience in writing high-quality Python code. Experience with orchestration tools such as Airflow. Knowledge of Spark or other distributed data processing tools. Experience with Kubernetes...


  • Warszawa, Czech Republic Bayer Full time

    MSc in Computer Science, Machine Learning, Statistics, Mathematics, Quantitative Finance/Economics/Marketing, or a related discipline. 5+ years of experience in data-related roles. Practical experience in at least one, preferably two, of the following areas: machine learning, generative AI, forecasting, or mathematical optimization. Proficiency in Python and...

  • Data Scientist

    3 days ago


    Remote, Warszawa, Gdańsk, Wrocław, Białystok, Kraków, Czech Republic Addepto Full time

    🎯 What you’ll need to succeed in this role: At least 3+ years of proven commercial experience designing and implementing scalable AI solutions (Machine Learning, Predictive Modeling, Optimization, NLP, Computer Vision, GenAI). Proficiency in developing ML algorithms from scratch to production deployment. Strong programming skills in Python: writing...


  • Remote, Czech Republic Link Group Full time

    Required Skills & Experience 5–8 years of hands-on experience in data engineering or similar roles. Strong knowledge of AWS services such as S3, IAM, Redshift, SageMaker, Glue, Lambda, Step Functions, and CloudWatch. Practical experience with Databricks or similar platforms (e.g., Dataiku). Proficiency in Python or Java, SQL (preferably Redshift), Jenkins,...


  • Remote, Czech Republic 1dea Full time

    min 5 yrs of relevant experience Solid experience with AWS services (S3, IAM, Redshift, Sagemaker, Glue, Lambda, Step Functions, CloudWatch) Experience with platforms like Databricks, Dataiku Proficient in Python / Java, SQL – Redshift preferred, Jenkins, CloudFormation, Terraform, Git, Docker, 2-3 years of Spark – PySpark Good communication and SDLC...


  • Remote, Warszawa, Czech Republic Acaisoft Full time

    5+ years of experience in software engineering, simulation systems, data science, or ML infrastructure. Strong command of Python and systems-level programming. Experience designing scalable task pipelines, browser or API simulations (e.g. Playwright, Selenium), or distributed compute frameworks. Understanding of RL concepts - reward modeling, environment...


  • Remote, Warszawa, Czech Republic Acaisoft Full time

    3+ years of experience in software engineering, simulation systems, data science or ML infrastructure. Strong command of Python and systems-level programming. Understanding of RL concepts - reward modeling, environment dynamics, verifiability, evaluation, and agent interaction loops. Experience designing scalable task pipelines, browser or API simulations...