Sanoma Learning logo

Data Engineer

Sanoma Learning
Department:Data Engineer
Type:REMOTE
Region:EU
Location:Warsaw, Mazowieckie, Poland
Experience:Mid-Senior level
Estimated Salary:PLN120,000 - PLN180,000
Skills:
PYTHONAWSSQLNOSQLETLELTDATA MODELINGS3GLUELAMBDADYNAMODBATHENACI/CDDATA PIPELINES
Share this job:

Job Description

Posted on: April 23, 2026

This position is open only to candidates who are legally residing in Poland and can work with us through a registered business entity in Poland (e.g., sole proprietorship/JDG or limited liability company/sp. z o.o.). As part of onboarding, we kindly ask new joiners to visit our Warsaw office on the first day for a short introduction, identity verification, and equipment pick-up. Location: Warsaw, Poland Employment type: B2B contract Work model: 100% remote Business travel: Occasional, up to once per quarter (e.g., onboarding sessions or workshops) Seniority level: Mid-level or Senior (4 open positions) About Us Sanoma Learning is the leading European learning company, serving over 20 million students in 11 countries. We offer printed and digital learning materials as well as digital learning and teaching platforms for primary, secondary, and vocational education. The development of our methodologies is based on deep teacher and student insight and really understanding their needs. By combining our educational technologies and pedagogical expertise, we create learning products and services with the highest learning impact. In our Technology organization, you will join the largest cross-cultural community of Sanoma Learning and contribute to the digital transformation and future of education in Europe. Project Description Content as a Service (CaaS) is a strategic central capability that enables Sanoma Learning to efficiently scale and innovate its digital offerings. It provides a single, enterprise-grade service to ingest, enrich, and deliver all learning content and educational metadata – from both print and digital sources – for use in digital customer facing products and method creation. Role Responsibilities

  • Design, develop, and maintain data pipelines to ensure reliable, scalable, and high-performance data flows.
  • Work on the central storage layer, ensuring data availability, consistency, and security.
  • Collaborate with data scientists, analysts, and software engineers to support data-driven initiatives.
  • Implement and enforce best development practices in code quality, testing, monitoring, and deployment.
  • Optimize data infrastructure for performance and cost-efficiency in AWS environments.
  • Leverage AWS services such as S3, Glue, Lambda, DynamoDB, and Athena to manage and query data.
  • Contribute to projects involving Generative AI (GenAI) by enabling data access, preparation, and integration with AI-driven solutions.
  • Troubleshoot and resolve issues across the data pipeline and storage systems.

Must-have Requirements

  • Proficiency in at least one programming language, preferably Python.
  • Strong knowledge of AWS cloud services (S3, Glue, Lambda, DynamoDB, Athena, etc.).
  • Solid understanding of databases (SQL and NoSQL), including schema design, optimization, and query performance.
  • Hands-on experience with data processing pipelines (batch and/or streaming).
  • Strong foundation in software engineering best practices, including version control, CI/CD, and automated testing.
  • Experience with data modeling, storage formats, and ETL/ELT workflows.
  • Familiarity with Generative AI technologies and how data engineering supports AI-driven applications.
  • Strong problem-solving skills and ability to work in a collaborative, agile environment.

Nice-to-have Requirements

  • Knowledge of data orchestration tools (Airflow, Step Functions, Prefect).
  • Exposure to big data frameworks (Spark, Hadoop).
  • Understanding of data governance and security best practices.
  • Experience working with Content Management Systems (CMS) and content-centric data (e.g. articles, learning materials, metadata, versions, publishing workflows).

What We Offer

  • B2B contract for an indefinite period
  • Work-life balance and a supportive, informal atmosphere
  • Opportunities for professional growth and skill development
  • Work on modern data platforms (cloud environments, AWS stack)
  • Build and maintain data pipelines supporting AI-driven solutions in education
  • Hands-on experience with modern data stack (ETL/ELT, CI/CD, orchestration tools)
  • Collaborate with Data Engineers, Data Scientists, Product teams, and AI Engineers
  • Work in a flexible, result-oriented and collaborative environment
  • Be part of an international team working across European markets
  • Contribute to projects with real impact on digital education
Originally posted on LinkedIn

Apply now

Please let the company know that you found this position on our job board. This is a great way to support us, so we can keep posting cool jobs every day!

Sanoma Learning logo

Sanoma Learning

View company page
RemoteITJobs.app logo

RemoteITJobs.app

Get RemoteITJobs.app on your phone!