Data Engineer - StartUp Experience

United States, New York, New York City

Full-time

icon
$ 160 - 250 K/year

experience
5 - 7 yrs

experience
5 - 7 yrs

Posted on: a month ago

Skills

python
pandas
etl
machine learning
startup
data engineering
gcp

About this role

You'll play a crucial role in building and maintaining our ETL infrastructure and pipelines, as well as optimizing ML systems for forecasting grocery demand. You’ll also work directly with our customers to ensure successful technical integrations.

Around 80% of your time will be spent on engineering:

  • Designing and implementing scalable data pipelines for processing large-scale data across multiple customers

  • Optimizing machine learning pipelines for demand forecasting

  • Contributing to our backend services to serve our mobile and web applications

Around 20% of your time will be spent on customer facing work:

  • Collaborating directly with customers' technical teams to ensure smooth data integrations and system implementations

  • Working with customers’ internal supply chain and store operations teams to understand and then implement their unique business logic

Experience :
Seniority

  • 5+ years of experience as a software engineer at a startup or technical founder

  • Work experience

  • Experience writing data pipelines in Python and Pandas

  • Experience working with enterprise customers in traditional industries (e.​g.​,​ grocery,​ healthcare,​ airlines)

  • Experience working at Palantir as a forward deployed engineer or some customer facing role

Skills:

Python, Pandas, Dagster, Airflow, PySpark, Dask, FastAPI, React, Next.js, Typescript, GCP, Postgres, Terraform, Docker, BigQuery

Education

  • Undergraduate degree in CS from a top school

Hard skills

  • Familiarity with data ingestion,​ transformation,​ and export workflows

  • Experience with data orchestration tools (Dagster,​ Airflow)

Soft skills

  • Ability to understand and translate customer business operations into technical solutions

  • Proactive and comfortable working without highly structured specs