Lead Data Engineer at Heap
San Francisco, CA, US
Data is king: it doesn’t matter how fancy your visualizations or predictive models are; if you don’t have the right data, you can’t do anything.

Your mission: automatically collect all the customer data in the world.

Heap has revolutionized user behavior analysis with a simple philosophy: capture all the data. Traditionally, companies needed to write lots of messy custom tracking and ETL code to get all their customer data in one place. This keeps them from higher-level insights. We aim to completely automate away this effort, so that people can focus on science, not janitorial work.

Having a complete unified dataset is core to Heap’s value and long-term vision. This is a unique opportunity to found and build the team that will sit upstream of all of our other work. You’ll build the software to ingest billions of events per month from disparate sources, provide technical leadership for your colleagues, and work closely with our infrastructure and product teams to ensure that this data is organized usefully and presented beautifully.

Some projects we’ve built:

Jonathan built an integration to automatically bring Salesforce data into Heap, which required turning offline sales touch points into a stream of behavioral data, ultimately allowing businesses to better understand how sales affects user behavior (and vice versa).
Dorian built the event visualizer for iOS, allowing people who have never coded before to setup tracking on their mobile app by just tapping around.
Some projects we would like to build:

React Native auto-capture: more and more high-quality apps are built in React Native, and capturing data there isn’t quite like any other platform.
Auto-collecting cost data from Facebook and Google ads: let our marketing customers directly measure the efficacy of their work without having to fight endless spreadsheets and data dumps.
Open Source: Eventually, we will open source all the integrations we create and make it as easy as possible for new integrations to get built.
You’ll like this role if:

You're excited to build a new team from the ground-up.
Scaling data infrastructure sounds like a fun challenge. We're ingesting a large volume of high-value data, so designing a reliable system is extremely important.
You’re comfortable working in broad range of technologies. In a normal day you might find yourself working on a node.js developer-facing API, a React Native data capture library, or a Scala Kafka consumer.
You enjoy thinking about how to model data. There are lots of possible data sources -from mobile frameworks to payment processors to third-party APIs - each with their own complexities. We want to bring it all into a simple, unified analysis experience.
Heap has raised $40M in funding from NEA, Y Combinator, Menlo Ventures, SVAngel, Sam Altman, Garry Tan, Alexis Ohanian, Harj Taggar, Ram Shriram, and others.

We work in San Francisco and can cover relocation costs. We'd love to hear from you!
