Questions d'entretien Senior Data Engineer chez Multiplier

Aller au contenu Aller au pied de page

Multiplier

Entretiens

Entretien pour Senior Data Engineer

9 juin 2026

Candidat à l'entretien anonyme

Aucune offre

Expérience négative

Recherches associées: Avis sur Multiplier | Offres d’emploi chez Multiplier | Salaires chez Multiplier | Avantages sociaux chez Multiplier

Entretiens chez Multiplier Entretiens d’embauche pour Senior Data Engineer chez Multiplier Entretien chez Multiplier

Entretien

The interview experience was subpar. The HR didn't verbally inform that I'd need to have PySpark set up locally. It was mentioned in the interview email but hidden in the blocks of unnecessary text. So that led to a bit of back & forth. Later, when the interview happened, the interviewer expected me to remember the entire PySpark & pandas syntax without taking the help of AI to write the same piece of code that you will use AI to write for daily work. While the interview problem itself is not difficult but expecting to memorise the syntax for the whole thing seemed a bit unfair.

Questions d'entretien [1]

Question 1

Multiplier currently pays out salaries to various members under our payroll. Payouts to members are recorded in a raw ingestion file. This is loaded into a Google Sheet for your reference. The Product department needs you to build a pipeline to transform this raw data into a clean, query-able format for their analytics. Notes on Data: This is raw ingestion data. You may encounter inconsistent date formats, nested JSON strings, or mixed currencies. amounts: This contains a JSON-like string representing various components of the payout (Salary, Tax, Bonus). Part 1: Architecture & Modeling Before writing code, verbalise a strategy for this pipeline: Target Schema: Design a Star Schema (or appropriate data model) that this data should be transformed into to best answer the business questions below. Ingestion Strategy: How would you handle this file if it arrived daily? (Consider duplicates, partitioning, etc.) Part 2: Implementation Choose ONE of the following options based on your preferred stack: Option A: Python/DataFrame (Pandas, Spark, Polars) Implement a transformation script that reads the raw CSV and outputs the answers. Option B: SQL / ELT (Postgres, Snowflake, BigQuery) Assume the raw CSV data has already been loaded into a staging table (raw_payments) where all columns are currently TEXT/VARCHAR type. Write the SQL query to transform this raw table into your target schema and answer the business questions. Business Questions to Answer: Total Payouts: What is the total amount disbursed per currency in May 2023? Currency Analysis: What is the average salary per currency by Department? Note: You do not need an FX table; treat each currency as a separate group. Data Cleaning: Flatten the amounts column so that salary, tax, and bonus are distinct columns (or rows, depending on your modeling choice in Part 1). Constraints: You do not need to connect to a real database. Output the final results to the console or a clean CSV.

Répondre à cette question

Multiplier

Entretien pour Senior Data Engineer

Entreprises suivies

Recherche d’emplois

Entretien pour Senior Data Engineer