Expedia Group is a leading global travel platform committed to making travel accessible for everyone. The data scientist position at Expedia involves a blend of coding, data analysis, and collaboration with various departments to drive business insights and improvements.
Are you ready to tackle complex analytical problems and influence positive changes in a fast-paced environment? In this guide, we’ll tackle how they conduct their data science interviews, along with commonly asked Expedia data scientist interview questions to help you prepare better.
If your CV happens to be among the shortlisted few, a recruiter from the Expedia Talent Acquisition Team will make contact and verify key details like your experiences and skill level. Behavioral interview questions may also be a part of the screening process.
In some cases, the Expedia data scientist hiring manager stays present during the screening round to answer your queries about the role and the company itself. They may also indulge in surface-level technical and behavioral discussions.
The whole recruiter call should take about 30 minutes.
Once you pass the initial screening, Expedia will send you an online assessment focusing on technical skills. This might include coding challenges pertaining to classification problems using datasets like Airbnb.
Successfully navigating the recruiter and online test rounds will present you with an invitation for the technical screening round. Technical screening for the Expedia Data Scientist role usually is conducted through virtual means, including video conference and screen sharing. Questions in this 1-hour long interview stage may revolve around:
If you pass the technical interviews, you may be assigned a take-home task. This assignment generally involves analyzing data and presenting your findings. You’ll have a week to complete the task, which tends to include:
Followed by a second recruiter call outlining the next stage, you’ll be invited to attend the onsite interview loop. Multiple interview rounds will be conducted during your day at the Expedia office, including:
Your technical prowess, including programming and ML modeling capabilities, will be evaluated throughout these interviews.
Typically, interviews at Expedia vary by role and team, but commonly Data Scientist interviews follow a fairly standardized process across these question topics.
A team wants to A/B test changes in a sign-up funnel, such as changing a button from red to blue and/or moving it from the top to the bottom of the page. How would you set up this test?
Your manager ran an A/B test with 20 different variants and found one significant result. Would you find anything suspicious about these results?
A social media company launched in a new city and saw a slow decrease in the average number of comments per user from January to March, despite consistent user growth. What could be the reasons for this decrease, and what metrics would you investigate?
Given all the different marketing channels and their respective costs at a company selling B2B analytics dashboards, what metrics would you use to assess the value of each marketing channel?
You have a 4x4 grid with a mouse trapped in one of the cells. You can “scan” subsets of cells to know if the mouse is within that subset. How would you determine the mouse’s location using the fewest scans?
find_bigrams
to return a list of all bigrams in a sentence.Write a function called find_bigrams
that takes a sentence or paragraph of strings and returns a list of all its bigrams in order. A bigram is a pair of consecutive words.
Given a table of bank transactions with columns id
, transaction_value
, and created_at
, write a query to get the last transaction for each day. The output should include the id, datetime, and transaction amount, ordered by datetime.
find_change
to find the minimum number of coins for a given amount.Write a function find_change
to find the minimum number of coins that make up the given amount of change cents
. Assume we only have coins of value 1, 5, 10, and 25 cents.
Write a function to simulate drawing balls from a jar. The colors of the balls are stored in a list named jar
, with corresponding counts of the balls stored in the same index in a list called n_balls
.
calculate_rmse
to compute the root mean squared error.Write a function calculate_rmse
to calculate the root mean squared error of a regression model. The function should take in two lists, one representing the predictions y_pred
and another with the target values y_true
.
Explain the key differences between Lasso and Ridge Regression, focusing on their regularization techniques and how they handle coefficients.
Identify the type of model used for determining loan approval based on customer inputs.
Given that personal loans are paid in monthly installments, describe how you would measure the difference between two credit risk models over a specific timeframe.
List and explain the metrics you would use to evaluate the performance of a new credit risk model.
Describe the criteria and methods you would use to determine if a decision tree algorithm is appropriate for predicting loan repayment.
Explain the steps and metrics you would use to assess the performance of a decision tree model both before deployment and after it is in use.
Describe the process by which a random forest algorithm generates its forest and explain the advantages of using random forest over logistic regression.
Explain the interpretation of logistic regression coefficients when dealing with categorical and boolean variables.
Write a function that takes the number of tosses and the probability of heads as input. The function should return a list of randomly generated results (‘H’ for heads and ’T’ for tails) equal in length to the number of tosses.
Write a function that takes a list of integers as input and outputs the sample variance, rounded to 2 decimal places.
Given the probabilities that a specific item X is available at warehouse A (0.6) and warehouse B (0.8), calculate the probability that the item X would be found on Amazon’s website.
You should plan to brush up on any technical skills and try as many practice interview questions and mock interviews as possible. A few tips for acing your Expedia data scientist interview include:
Average Base Salary
Average Total Compensation
Expedia Group fosters an inclusive, diverse, and innovative work environment. The company values creativity and collaboration and provides flexibility in working arrangements. They strive to make a positive impact and are committed to employee growth and a strong support system.
To prepare for the take-home assignment, focus on refining your data analysis and presentation skills. Make sure to understand the business context and be ready to discuss your approach and findings during the presentation. Practicing similar tasks on Interview Query can help build the necessary skills.
Responsibilities include applying knowledge in SQL, Python, or R to solve business problems, optimizing processes, and communicating complex analytical concepts clearly. Data Scientists collaborate with cross-functional teams to derive actionable insights and support marketing and capital allocation decisions.
The role of a Data Scientist at Expedia, Inc. offers a unique blend of technical challenges and business impact. You will get the opportunity to delve into A/B testing, statistical modeling, and data-driven decision-making while collaborating with a variety of teams across the organization. The interview process, although rigorous, is designed to be efficient and supportive, reflecting the company’s commitment to a positive candidate experience.
If you want more insights about the company, check out our main Expedia Interview Guide, where we have covered many interview questions that could be asked. Additionally, explore our interview guides for other roles such as software engineer and data analyst to learn more about Expedia’s interview process for different positions.
Good luck with your interview!