
Data Engineer
Job Description
Posted on: August 6, 2025
About Us
At Sapia.ai, we're pioneering the future of ethical, AI-powered hiring. With millions of candidates engaging with our Chat Interview every year, we are redefining how talent is discovered, assessed, and nurtured. Our products are trusted by some of the world's most innovative companies, and we're just getting started.
Why this role exists
Data is the backbone of every intelligent decision we make—from real-time model feedback to enterprise-grade analytics. As we scale our AI-native platform, the complexity, volume, and velocity of our data demand engineering muscle, not middleware hacks.
This role is for a systems-minded data engineer who builds for performance, not patchwork. You’ll design cloud-native data infrastructure that fuels everything from ML to product analytics—with reliability, observability, and speed built in.
We’re looking for someone who thinks in DAGs, automates what others tolerate, and treats data quality as a first-class citizen.
What You’ll Do
- Pipeline Engineering: Design and implement robust ETL pipelines that move and transform data from diverse sources into our data warehouses and lakes.
- Cloud-Native Deployments: Work across AWS, Azure, or GCP to build scalable and high-performance data solutions.
- Reporting Infrastructure: Build and maintain systems that fuel business intelligence and ML workflows—empowering smarter decisions across the org.
- Data Visualisation: Use tools like Tableau, QuickSight, or Power BI to help teams see and understand their data.
- Database Management: Manage NoSQL (MongoDB, DynamoDB) and SQL databases (PostgreSQL, Redshift) for fast, reliable access to structured and unstructured data.
- Quality First: Identify and resolve data quality issues using tools like dbt or soda to ensure data accuracy, consistency, and trust.
- Startup Mentality: Bring energy, flexibility, and problem-solving creativity to a fast-paced, high-impact environment.
What You Bring
- 1+ years of hands-on experience in data engineering, ideally in a fast-moving, agile team.
- Proficient in Python and SQL, writing clean, efficient code for data transformation and manipulation.
- Experience working with cloud platforms (AWS, Azure, GCP).
- Familiarity with both NoSQL (MongoDB, DynamoDB) and SQL databases (PostgreSQL, Redshift).
- Exposure to data visualisation tools like Tableau, QuickSight, or Power BI.
- Bonus points for experience with:
- Databricks and Apache Spark
- Data ingestion tools like Fivetran, Airbyte, or Matillion
- A solid grasp of statistics and machine learning fundamentals is a plus.
How You Work
You take ownership of your code, your outcomes, and your impact.
You believe in radical accountability and generous collaboration. You celebrate wins and own the misses.
You care deeply about the customer experience and build with empathy.
You thrive in an agile, remote-first environment—balancing autonomy with teamwork.
You do what’s right, not what’s easy—for your team, our users, and the mission.
How we hire
We believe there’s more to you than your CV. So, we start with a chat interview, our very own, designed to uncover your potential and give you the space to share who you are in your own words. You’ll answer 5-6 role-relevant questions over chat and submit a short video answer. It’s untimed and mobile-friendly. Afterwards, you’ll get personalised insights from the interview, based on your responses. Our hiring team will reach out to you directly if they’d like to move forward.
We’re building something big. Something that matters. If you’re the kind of person who makes things happen, we’d love to hear from you.
Apply now
Please let the company know that you found this position on our job board. This is a great way to support us, so we can keep posting cool jobs every day!

RemoteITJobs.app
Get RemoteITJobs.app on your phone!

Software Engineer

Software Developer Full-Stack - GenAI Team - Remote

Software Developer, Software Developer

Software Architect (Contractor) – Composable Commerce & Event-Driven Design
