To land an analytics data science job at DoorDash, you have to pass one to two case study interviews. These interviews typically take place in the second round, following your initial screening, and they are designed to assess both your analytical skills AND your coding prowess.
To be more specific, analytics case studies test your ability to
DoorDash analytics case studies fall into three categories:
SQL Analytics- DoorDash SQL case studies ask you to investigate a business problem AND write SQL code to produce relevant metrics.
Product Metrics- This type of DoorDash case study focuses entirely on your product sense. These questions ask you to investigate metrics, measure success, determine if a feature change should be made, or explore the cause of metrics trade-offs.
Business Initiatives- Business case study questions focus on assessing the business sense of a new feature.
Need some help? The following DoorDash case study guide provides everything you need to prepare! We have tips, a video overview, and sample DoorDash analytics case study questions.
No matter the company, there is a specific framework to follow for data analytics case study interviews. The steps are as follows: Ask clarifying questions. Use the components of the question to make assumptions. Form a hypothesis. Provide metrics and perform data analysis. Propose a solution.
Always remember: Make your recommendations ACTIONABLE. Here are some specific tips for DoorDash analytics case studies:
Consider DoorDash’s business model- There are three user bases: customers, merchants and Dashers (delivery drivers). Learn the pain points for each of these user bases. Your answer will likely address these pain points.
Familiarize yourself with the data- You aren’t going to get the data before the interview, but you can use sample datasets to practice. Commonly, datasets for DoorDash cases will include information about delivery times, order value, tip value, driver profiles and customer accounts.
Ask for clarification- DoorDash case studies tend to be vague to start. For example: Why is there so much variation between order times and pickup? Practice asking clarifying questions for business problems related to DoorDash.
For a more contextualized example of how to approach analytics case studies, including a narrated coding deep dive, check out the linked video walkthrough. It is very similar to what you might face at DoorDash. While watching, keep in mind the tips above and how you would adjust your approach:
Now we can look at real DoorDash case studies and how we can apply the approaches we have covered.
Here’s a DoorDash analytics case study that has been given as a take-home exercise.
Analyze the provided data and generate some specific recommendations on how our business can improve. Provide any supporting analysis and state your assumptions in your work.
The dataset includes information like:
With Doordash take-home assignments, you should start by asking questions. Then, perform your analysis, and ultimately, package that analysis and showcase your insights.
1. Start by Asking Questions
You can start with some broad questions like: What can I learn from this dataset, and what potential insights can I generate that would be of value? As you develop your questions, also be thinking about specific metrics that you can pull to help answer them.
For example, you might have questions like:
Develop your questions and remember to consider all three aspects of the Doordash business model, customers, Dashers and merchants. Answering these questions, for example, would provide business insights affecting all three of these user groups.
Additionally, you should be thinking about metrics. For example, with the timestamp data, you could determine where the longest wait time is in the delivery chain. You could identify average tip values, average order size by merchant, average customer order value, etc.
2. Document Your Work
As you perform your analysis, provide examples of your work. Here’s a look at a Doordash analytics case study project on Github. The user provides visualizations, as well as detailed SQL queries used in the analysis.
Here’s a look at a visualization from the project, showing the correlation between discount and tip amount:
3. Provide Clear Recommendations
Finally, you want package your analysis and provide recommendations for the business. This is your chance to show off your business and product sense. Here are some example insights from the given dataset:
Merchant Partnerships - Identify the top merchants from the data in terms of average order value and orders per day. This data could used to reward these merchants through promotions or advertising and strength partnerships with these high-growth stores.
Operational Efficiency - Identify the longest wait times in the delivery chain. This insight helps identify potential breakdowns in the delivery chain. How can you use this information to reduce delivery times?
Average Wages for Dashers - With the provided data, you could find the average tip amount, daily tip amount earned per driver, etc. This can be used as a marketing tool to encourage Dasher sign-ups.
These are just a few of the insights you could generate. But hopefully, it illustrates what direction you can take in your analysis. Ultimately, your goal should be to provide compelling recommendations that showcase your ability to work with data and your business sense.
Here are two more DoorDash interview questions that can help you practice and refine your approach for your upcoming Doordash interview:
Important starting question: Are the operating conditions the same in both cities, and how does that impact deliveries? How would we go about deciding which Dashers are assigned deliveries?
How to Solve This Question:
Different cities mean different conditions to operate within. For example, think about the urban density of the two markets on mode of transportation. In New York City where the buildings and restaurants are packed together tightly, Dashers may rely on bicycles or electric scooters. In a sprawling city like Charlotte where restaurants and homes are further apart, a Dasher would likely use a car.
As a result of how Dashers navigate their city, you would want to adjust your thinking for the unique traffic patterns and realistic delivery ranges that impact delivery times.
Next, when it comes to how to choose Dashers to be assigned orders, think algorithmically. What inputs and outputs might be used in the model? For example, in using a Dasher for a particular order, we have a feature input such as:
You can then measure these against output variables like customer satisfaction or actual completion time versus estimated completion time (on-time vs late deliveries), though it would likely result in some sort of bias that has to be accounted for.
Again, this isn’t a traditional analytics case study, because you don’t have a dataset to analyze. But if you had a dataset for this type of question, you might think about metrics like:
Here’s a featured answer from one of our community members, pdmhkr20:
When “online” is mentioned, does it refer to a notification sent to all drivers on their app, regardless of whether they are online or offline? Or is it specifically to alert drivers who are offline so that, based on their availability, they can decide whether to take up deliveries?
Assuming that this feature allows more drivers to deliver food to customers, it would lead to reaching more customers, which in turn would generate more orders and thus more revenue. Therefore, it is assumed that increasing the user base is the key driver for the success of this feature, and the analysis will be based on this assumption.
DoorDash has three types of users: customers, dashers, and merchants. The impact of the feature change on all these segments will be analyzed to understand its effect on the entire ecosystem.
Firstly, based on the number of customer orders, which represents the demand, push notifications or emails will be sent to dashers, asking them to go online to meet the demand and thereby increase the supply. The dashers see the notifications, accept the deliveries, go to the restaurant, pick up the order, and deliver it to the customers.
From the restaurant’s perspective, with more dashers available, they can accept more orders.
From the customers’ viewpoint, there will be fewer order cancellations.
Examine exactly how the product works. How does a user access a certain feature? How does a user use a certain feature? What kinds of different users are there?
Based on the user flows mentioned, the following metrics can be defined for the three groups:
Dasher:
Merchants:
Customers:
Since the main interest is in increasing customer orders on the platform and retaining customers, the key metrics to determine if the feature is working properly are average delivery times per hour and the number of customer orders.
To test the hypothesis, group the dashers who went online during peak hours after receiving the notification and compare them with a group of dashers who did not go online. Analyze if the delivery times decreased as the number of drivers increased.
Conduct A/B testing to validate the results.
Here’s a featured answer from one of our community members, poised-gold-tahr:
minimizing wrong orders before user placing or when users are placing the orders?
order cancellation rate via self-service cancellation or contacting support to cancel order. This is a binary variable, so a binary classification model is applicable.
a banner/in app notification when predicted to be a wrong order, when user land on the restaurant page & when user are at the check-out.
past app interactions:
we should balance between precision and recall. In this case, wrong orders will cause customer/merchant/dasher dissatisfaction if we fail to detect (FN) & if we notify users too often when it’s not wrong order (FP), they might abort the checkout process. this is also dependent on the intervention we build. [For example if instead of a notification at the checkout page, we might want to ask users if they need to update their delivery address when they open the app and is hundreds of miles away from their usual delivery address. In this case we may care more about recall]
suppose we build the prediction model to predict wrong orders when users are on the checkout page optimizing for precision, and we pop up a window/in app notification to ask user to double confirm the order. we should perform offline testing and AB testing to determine effectiveness before launching.
after offline testing, we should design an AB test & roll out plan.
If you have a DoorDash data science interview coming up, practice with these resources from Interview Query: