Interview Query

Takehomes

Determine the percentage of customers that won't wait for an item to get restocked.

Python
Time Series
Probability
6 Hrs

Answer multiple questions to show your data science skills

Analytics
Growth
Machine Learning
SQL
AB Testing
6 Hrs

Create a model to predict how long it will take a driver to deliver an order

Machine Learning
Python
Regression
5 Hrs 30 Mins

Create a model to recommend bookings to Brazilian Airbnb users

Machine Learning
Recommendation Engine
72 Hrs

Help a cross-functional Airbnb team grow bookings in Rio de Janeiro

Analytics
Presentation
Data Visualization
EDA
Growth
6 Hrs

Formulate business goals related to driver churn

Pandas
Business Case
R
1 Hr

Determine how well a product at Stripe is performing.

Analytics
EDA
Marketing Analytics
6 Hrs

Preform Open-ended analysis of a data set and make a notebook showing any interesting threads you find

Analytics
EDA
Business Case
3 Hrs

Bike-Sharing Program Q&A

Mckinsey & CompanyMckinsey & Company

Answer questions regarding data from a bike-sharing program.

EDA
6 Hrs

Estimate the financial impact of launching a new product line on monthly sales.

Analytics
Presentation
Growth
Business Case
Marketing Analytics
4 Hrs

Create a model to forecast revenue from sales in the future

Machine Learning
Python
Pandas
Regression
Time Series
6 Hrs

Create forecast the revenue of new movies

Machine Learning
Python
Pandas
Regression
6 Hrs

Create a model to predict if an article is spam or not

Machine Learning
Classification
Python
Pandas
6 Hrs

Conduct analysis to discover how to promote Affirm to merchants

Presentation
EDA
SQL
6 Hrs

Answer multiple questions to show your data science skills

3 Hrs

Design and write queries for a KPI dashboard

SQL
Data Engineering
Database Design
6 Hrs

Determine if a newly launched product effectively reduces overspending (and answer some probability questions).

EDA
Growth
AB Testing
Probability
6 Hrs

Create a model to classify if a piece of text was made a human or a bot.

Machine Learning
Classification
Python
NLP
6 Hrs

Create a model to determine if a promotion should be given to a user

6 Hrs 30 Mins

Find what state Grubhub should focus on for new product development

Analytics
EDA
Business Case
Marketing Analytics
3 Hrs

Create a model to predict the frequency of high winds that can help estimate how long it takes trucks to flip over in high winds

Python
6 Hrs

Complete four short data science questions covering the full breath of the job

Analytics
Machine Learning
Python
Business Case
Marketing Analytics
R
ML System Design
AB Testing
1 Hr

Determine the most efficient use of space inside a supermarket.

2 Hrs

Create a model to predict whether or not a debtor will default on a loan

6 Hrs

Create a model to price daily estimate the shelf value of different food items

Machine Learning
6 Hrs

Design a system to show the correlation between the price of a cryptocurrency and Twitter sentiment about the token.

Data Engineering
Feature Design
Finance
6 Hrs

Complete two long-form challenges, to test your SQL and story telling skills

Analytics
Business Case
SQL
6 Hrs

Explore, analyze, visualize, and model Supercell's revenue data

Analytics
Data Visualization
EDA
Machine Learning
6 Hrs

Write a script to transform the provided CSV to a desired output CSV.

Python
Data Cleaning
2 Hrs

Analyze the business prospects for a new e-commerce startup and see if they are worth investing in

Analytics
EDA
Python
Pandas
Regression
Time Series
Business Case
Marketing Analytics
6 Hrs

Build a model that predicts the type of crime as soon as an emergency call comes in.

Machine Learning
Classification
ML System Design
5 Hrs

How would you build a product recommendation system?

Recommendation Engine
Data Cleaning
Deployment
5 Hrs

Create a model to cluster inquires by their text content

4 Hrs

Create a presentation with suggestions on how to reduce traffic congestion

Presentation
EDA
Machine Learning
Deployment
72 Hrs

Pitch Forecast

Swish AnalyticsSwish Analytics

Build a model that will predict the probability of a fastball, slider, etc., in a real-time environment.

Machine Learning
Classification
ML System Design
Deployment
4 Hrs

Do exploratory data analysis on user behavior on Gordon Ramsay Masterclass page

3 Hrs 30 Mins

Given some legacy Python 2 code, identify bugs and errors in the code, reformat the code to Python 3, and suggest other ways to improve the code.

Machine Learning
Python
Model Evaluation
Code Review
2 Hrs

Create a model that predicts the overall title risk of a property.

Machine Learning
Python
Regression
6 Hrs

Create a model to detect if a transaction on a credit card was fraudulent or not

Machine Learning
Classification
Python
6 Hrs

Give a presentation to help the product and engineering team understand user behavior.

Analytics
Presentation
Marketing Analytics
6 Hrs

Build a transparent Redis proxy service

Python
Web Development
6 Hrs

Mortality Rate during Sha'ban

Saudi commission for health specialtiesSaudi commission for health specialties

Conduct analysis to see whether or not more people die during the month of Sha'ban.

EDA
Python
R
72 Hrs

Loan Default Model

Business opticsBusiness optics

Create a model to predict the probability of a debtor defaulting on a loan

Machine Learning
Classification
Python
Pandas
3 Hrs

Create a model predict if a permit is about electrical permissions

Machine Learning
Classification
Python
6 Hrs

Create a function to parse messy json files

Algorithms
Data Modeling
Data Cleaning
45 Mins

Create a model to predict if a user will click on a link

5 Hrs 30 Mins

Write two queries using different approaches to aggregate NPS by client by month.

SQL
6 Hrs

Identify which features are most important for getting a user to adopt our product

Analytics
12 Hrs

Perform analysis and modeling to access the risk of cyber attacks on healthcare facilities

EDA
Machine Learning
Statistics
6 Hrs 30 Mins

Perform analysis on a data set of product details that is formatted in an inconvenient manner. Provide suggestions to improve the data model.

Analytics
Business Case
Business Model
Data Cleaning
2 Hrs

Create a model to price daily electricity costs for households

Machine Learning
Python
Pandas
Regression
Time Series
6 Hrs

Discharge Rate vs. Workload

Mid-atlantic permanente medical groupMid-atlantic permanente medical group

Determine if the amount of "busy days" at a hospital affects the discharge rate of a patient.

Presentation
EDA
Python
6 Hrs

Analyze which pricing method is best for e-learning course

Analytics
Marketing Analytics
5 Hrs 30 Mins

Create a model to estimate the acceleration of a car

Machine Learning
Python
Pandas
Regression
Time Series
Deep Learning
6 Hrs

Evaluate conversion rate predictions by device for a set of entities.

Machine Learning
Model Evaluation
6 Hrs

Create tables that grouped by a complicated key

Pandas
2 Hrs

Conduct analysis the prices of short-term rentals in Phoenix, Arizona

Analytics
Pandas
Business Case
3 Hrs 20 Mins

Answer multiple questions to show your data science skills

SQL
Probability
Multiple Questions
6 Hrs

Complete two short data science questions covering the EDA and Error Analysis

Analytics
EDA
Pandas
Statistics
1 Hr 30 Mins

Give a recommendation on how City Year should focus on different types of clients based on survey data.

Business Case
Probability
6 Hrs

Answer questions about two data sets that have an usual way of labeling data

EDA
Python
Pandas
6 Hrs

How would you allocate the budget you were given to acquire new users?

EDA
Growth
Marketing Analytics
6 Hrs

Build a model to predict the number of order requests per hour for five regions.

Machine Learning
Python
6 Hrs

Create a model to predict if a book-delivery startup will be able to pay back a loan

Machine Learning
Classification
Python
Pandas
6 Hrs

Create a model that predicts if a child will likely play Square Panda games in the next seven days.

Machine Learning
Marketing Analytics
6 Hrs