Here’s a secret about quant and finance interview questions: They don’t often relate to finance. Instead, the probability questions in trading interviews are framed as quantitative problems and ask interviewees to make calculations based on the provided information.
To do well in these statistics and probability interview questions for quants and traders, you need intermediate-to-advanced knowledge of the said fields, econometrics, and, increasingly, computer science. Although finance-specific knowledge is helpful, you don’t need it to pass the interview.
Finance statistics and probability interview questions fall into three categories:
Quantitative finance interviews are notorious for their difficulty. In general, you can expect a number of brainteasers and probability puzzles throughout the interview process, and these typically progress in difficulty into the later interview rounds.
According to Dirk Bester, a PhD finance quant, who has been interviewed dozens of times and wrote a book on quant interview questions, here are some things to remember about the interview process:
How difficult are questions? Generally, you can expect “easy” to “medium” probability and statistics questions, although the difficulty is somewhat dependent on the role and company. Jane Street, Citadel, and SIG are notorious for highly difficult quant interviews, and with these companies, you shouldn’t expect many “easy” questions.
Probability questions in trading interviews cover many different subjects and concepts in statistics, probability, and econometrics, including:
Randomness: Randomness questions are calculations of random events. It’s using math to predict events like dice rolls, one-off events, or just specific unknowns.
Counting: Counting is using probability to figure out the number of outcomes possible in a finite situation.
Probability outcomes: This concept involves both distinct outcomes that are mutually exclusive and not exclusive—often involving a sum of probabilities.
Independence: Independence is the study of two outcomes that are unrelated. Calculating the probability of both of them occurring, or the product of their probabilities individually occurring.
Expected value: Expected values are key to probability distributions and are the average random variable.
Confidence intervals: The calculation of not just the specific outcome but the range of outcomes and the expectedness of those outcomes.
Quant interview questions typically take the form of brainteasers or probability case studies. You’re provided with a scenario and make a calculation based on the provided information. The most common probability interview question types in quant finance interviews are:
In quant interviews, easy questions typically involve less complicated mathematical concepts, but will require some statistical analysis.
To solve this conceptual probability problem, it’s easiest to find out the probability of never rolling a three.
The probability of not rolling a three is ⅙. To not roll a three, we can do 1 - ⅙ = ⅚. How does that change with 2 dice? To not roll a three would be ⅚ * ⅚ = 25⁄36. So the probability of not rolling a three with two dice is 1 - 25⁄36 = 11⁄36.
What’s different when we’re using N dice?
More context. You notice that of all customers who bought subscriptions in January 2020,, about 10% canceled their subscriptions before the next cycle on February 1. Assume that your new customer acquisition is uniform and that customer churn goes down by 20% month-over-month.
This is a statistical analysis case question, and because the 10% is compiling, you need to calculate a new churn rate for February and so on. For example, in February churn rate will be 20% less than the initial churn rate of 10%, so the new churn rate will be .1(1-.2) = .08, and by March, you need to take the total customer base divided by 2 to get total churn rate.
A definition-based question like this is generally asked early in the interview process. Your goal should be to convey these concepts quickly and in layman’s terms.
Key thing to remember: Covariance can take on any numeric value, while correlation can only be between -1 and 1.
Let’s say you have to draw two cards from a shuffled deck, one at a time. What’s the probability that the second card is not an Ace?
If we sum up all of the probabilities in each scenario, we should be able to get the probability of drawing a second ace.
One question to add: does order matter here? Is drawing an ace on the second card the same thing as drawing an ace on the first card and still drawing a second card?
Write a function that takes an input as the number of tosses and a probability of heads and return a list of randomly generated results equal in length to the number of tosses. Each result represents the outcome of a coin toss, where ‘H’ represents heads and ’T’ represents tails.
Consider a stick of length 1. It is broken into three pieces at two randomly selected points. What is the probability that the three pieces can form a triangle?
Note that to form a triangle, the length of any side must be less than the sum of the lengths of the other two sides. This condition must hold for all three sides.
Martingale strategy is a gambling concept that involves doubling one’s bet after each loss to recover previous losses and make a profit.
One potential application is in the field of online advertising, where advertisers may use a Martingale-like approach to adjust bidding strategies for ad placements based on past performance.
Most of the probability interview questions for finance roles are medium difficulty. Nearly any statistical or probability concept can come up, so to prepare, start by reviewing basic-to-advanced probability.
With this probability events question, keep in mind that the biased coin is heads 30% of the time. You will need to use binomial distribution in which there exists n independent experiments.
Then using n choose k principles, you can calculate:
You can create a chart to map out the likelihood of each scenario (there are 6 scenarios) - for example X > Y: both positive - you then can simulate a random sampling and equate that all six are likely to occur. Therefore, 2x > Y is 50% likely.
If you are confused, think about how both X and Y are random variables across the same distribution, and that 2X as being on average double positive or double negative the value that Y is.
Remember that normal distributions can have any mean or standard deviation (so long as the standard deviation is positive, obviously). Having a certain mean or standard deviation is not enough to make a distribution non-normal.
Although this answer has multiple possible solutions, one solution could be that the data is non-continuous. For example, a plane wreck is a non-continuous data point, there are either 0 or 1 crashes. You can’t have .5 crashes.
What does a uniform distribution look like? Just a straight line over the range of values from 0 to d, where any value between 0 to d is equally likely to be randomly sampled. So, let’s make this easy to understand practically. If we’re given N samples and we have to estimate what d is with zero context of statistics and based on intuition, what value would we choose?
For example, if our N sample is 5 and our values are: (1,4,6,2,3), what value would we guess as d? Probably the max value of 6 right?
But, let’s look at another example. Let’s say our N sample is 5 again, and our values are instead: (20,30,28,26,16). Would our estimate still be the maximum value of 30?
More context. Based on the historical data, 98% of reviews are legitimate and 2% are fake. If a review is fake, there is a 95% chance that the machine learning algorithm identifies it as fake. For legitimate reviews, the algorithm correctly identifies it 90% of the time as legitimate.
This question requires knowledge of Bayes’ Rule! You’ll use L as a legitimate event and I as identified as legitimate.
Capital approval rates have gone down for our overall approval rate. Let’s say last week it was 85% and the approval rate went down to 82% this week which is a statistically significant reduction.
What could be the cause of the decrease?
Netflix has hired people to rate movies. Out of all raters, 80% carefully rate movies and 60% of the movies as good and 40% as bad. The other 20% are lazy raters and rate 100% of the movies as good.
Assuming all raters rate the same amount of movies, what is the probability that a film is rated good?
A climate research organization has a time-series dataframe with daily temperature readings for different cities. This dataframe has three columns: date, city, and temperature.
Due to data recording issues, the temperature reading for some days might be missing. The organization requires the temperature readings for each day, so they ask you to interpolate the missing data.
Write a Python function using Pandas that uses a linear interpolation to estimate the missing data and fill out the dataframe.
In quant interviews, hard probability interview questions are highly technical and complex and most likely require advanced math to solve. Depending on interview expectations, it might not be necessary to get these right. Instead, your problem-solving process is what is being assessed.
Follow up question. Design a test and describe which outcome would tell you that the coin is biased?
This question tests a variety of concepts including statistical testing, bias and variance, and binomial distributions. One can design the following test. Flip the coin for N=100 times (making sure that every flip is independent of the others) and count the number of heads/tails.
The Null hypothesis is that the coin is fair (Not biased), and the alternative hypothesis is that the coin is biased. Given the number of trials (N), a given level of confidence (typically 95%), we might be able to decide whether the coin is biased or not with 95% confidence.
Here is how we can obtain the confidence interval; We can consider each coin toss as a Bernoulli distribution with probability of head as p. For a fair coin, one may assume P(T) = H = 0.5. Also, N trials of a Bernoulli distribution give us a Binomial distribution. The Binomial distribution has a mean of np (=50 for 100 tosses of a fair coin) and standard deviation of the square root of np(1-p) which equals 5 for n = 100 and p = 0.5.
What else do we need to do?
For this question, assume that if the water level exceeds his height, the swimmer will die.
Hint: We aren’t given the distribution of the lake depth. What would be a good guess as to what the distribution is? Your answer doesn’t need to be formally calculated if doing so wouldn’t be feasible in a real-life interview.
More context. You select a coin from a jar of 1000 coins. Of all the coins, 999 are fair and 1 is double-sided with two heads. You pick a coin at random.
Here’s a hint: We’re given 999 fair coins and one biased coin and the question is asking about an event probability of whether a toss will be heads based on prior information of what exists in the jar. What kind of probability theorem does that insinuate?
Let’s say that you’re drawing N cards (without replacement) from a standard 52-card poker deck. Each card is unique and part of 4 different suits and 13 different ranks.
Let’s calculate it out.
If we draw N cards from a deck of 52, the probability that the first card is not a pair is 100%, given that you need at least two cards to make a pair.
The probability that the second card is a new card ranking is 48⁄51. How did we compute that?
Interview Query offers a variety of resources to help you prepare for your finance or trading interview. Start with our list of 31 commonly asked probability interview questions, or our list of 20+ statistics interview questions.
You can also check out our probability interview course module, which includes a look at how probability interviews are conducted, common types of questions, and the most frequently asked probability concepts.