The title of ‘data analyst’ can be considered a very broad occupation that includes roles that are involved with data, business process, project analysis, and management. One of the essential skills for a data analyst is the ability to be technically literate. That means that as a data analyst, you should be aware of the various functions of different applications and useful software, even ones in which you do not operate regularly. One of these programs (and perhaps the most useful and popular of them all) is Microsoft Excel, which you will almost assuredly spend time mastering over your career.
Excel is one of the most common types of data analyst interview questions, and Excel interviews typically range from basic knowledge about the software to advanced functions about how to visualize and format the data, which means preparation can be slightly confusing and overwhelming.
To prepare for an Excel interview, you should review and practice:
In order to establish why Excel is so useful, it is helpful to think of the teams you may work or be embedded with. The finance department of a company is an excellent example. Establishing plans and testing models for this team puts you in a position of deciding on crucial financial decisions for the company itself. That is why a successful data analyst should focus on learning and improving their financial reporting, forecasting and strategizing, which is where knowing important Excel functions and tools comes in handy. You may very well work with other functions of a business like sales or customer success (retention), and the following tools will also be a huge asset for their planning as well:
Pivot tables are perhaps one of the tools that will save you the most time in Excel. It is capable of summarizing and analyzing data in a matter of seconds. And regardless of the size of the data itself, it will provide you with a summary of it, with the ability to quickly update when new data is inputted. With these advantages, the pivot table can be one of the best ‘friends’ of data analysts who use Microsoft Excel regularly.
Charts are the essential way for you to present your data. Everyone can turn in pages upon pages of numbers. And sometimes that kind of presentation is required or requested as well. More often however, the detailed and customizable Charts section in Excel is going to help you create graphics that both convey data and tell a better story. Common charts include:
Calculated items work as custom formulas that use the sum of different items. The sum of different items is calculated and put into a pivot field. It can be looked at almost as a supportive feature of Pivot Tables.
This feature also goes well when you are presenting your final results to a group. It helps you differentiate categories of data cells from each other. You can highlight, emphasize and change the format of the cells, often by adding cell color based on the cell contents. As a business analyst, you are going to deal with numbers constantly. Therefore, presenting them in a more fashionable way will help everyone to realize and understand what they are looking at easily.
It is crucial that you know Excel in today’s times, especially when you have to work with data. A data analyst has to know a number of tools, and excel is one of the basic ones in that list. There are plenty of other tools that can work with large datasets, but it is Excel where most analysis starts, before increasing in complexity. Some questions you need know how to solve in Excel at a basic level are:
Excel can accommodate a number of basic and advanced data formats. Some of the most common ones are numbers, percentages, dates, and text.
There are many ways in which one can use data formats in Excel. For example, numbers can be used as decimals or rounded figures, percentages can be used to show part of a whole (the whole being 100%), dates can be changed depending on the region and location that Excel is connected to, and text can be used to analyze data and reports imported from other spreadsheets. These formats are often used in conjunction with one another when doing more advanced coding and manipulation, which we will cover further in the article (for example, multiplying a number by a percentage).
A spreadsheet in Microsoft Excel is made up of cells organized into rows and columns. Information is inputted into the cells in order to organize, display and manipulate data. For example, a spreadsheet can contain the names of a list of products, the price per unit, and the number of units sold. This enables the viewer to read and search through this list and make calculations for what data needs to be analyzed.
There are a total of six formats in which data can be exported. They are:
Choosing which format to export it may be dependent on the team that needs the file or constraints on how large a file can be for its end destination.
Wrapping text is useful for data presentation and keeping your sheet clean. You must select the text you want to wrap, click wrap text from the home tab, and you will see the text wrap within a cell.
A Ribbon refers to the topmost area of the application that contains menu items and toolbars available in MS Excel. Ribbons can be shown/hidden using CTRL+F1. The ribbon runs on the top of the application and acts as a replacement for the toolbars and menu dropdowns that can be hard to navigate without lots of muscle memory. The ribbons have various tabs on the top, and upon clicking on it you can see a group of commands which you can use without having to re-navigate to the tab header. Additionally, hovering over ribbon commands will show you the hotkey combination to perform that function, so that you can memorize some of the functions you use most often.
The same as you use in mathematics which is defined by the term “PEMDAS” or “BEDMAS”. This stands for:
Each worksheet on Microsoft Excel is comprised of multiple rows and columns. The points where the rows and columns intersect form a rectangle, which is also known as a cell.
The cell address is a unique identification value given to every cell on an Excel sheet to make it easier to find data present on the document. The cell address is denoted by the respective column letter and corresponding row number for the cell. The notation is simple, A1 is the left uppermost cell in a spreadsheet, and other cells count outwards from there.
Right click on the bottom sheet tab of the sheet that you wish to delete. The right click will bring up a menu of options, one of which is delete. Selecting delete will activate a prompt that asks if you are ready to permanently delete the sheet; make sure you have a saved version or really are positive you will no longer need the information contained within.
You can annotate a cell in excel using multiple techniques like utilizing colors, callouts, and cell comments.
Cell references are used to refer to data located in the same Excel spreadsheet but to data in a different cell. This becomes useful when building custom formulas that rely on data from different locations within the same sheet or from a different sheet.
To add a comment to a sheet, right click on the sheet you want to add the comment to and choose insert comment from the menu. Type your comment in the comment area provided. Cell comments are directly linked to specific cells and are indicated by a red triangle in the corner of a cell. To see the comment, simply hover over the cell with your mouse.
There are three ways in which you can do this, and they are: Setting passwords to open the workbook. Adding, removing or hiding sheets (make sure you have reviewed how to unhide the sheet for future reference). Protecting the sheet from the alteration of window sizes or positions.
Microsoft Excel 2003 can handle 4,000 different cell format combinations, and Excel 2007 and later can handle 64,000. Any unique combination of formats counts towards this number: for example, one cell formatted with Arial font and pink fill color, and another cell with Arial font and blue fill color, would count as two different combinations.
The default font of all cell content in a workbook is Calibri (in black). However, this can be changed by selecting any cell and clicking the Font drop-down menu on the Home tab. You can change the font, size and color and make your text bold, italic or underlined. Other options available to you include: - Cell Fill colors. - Cell Borders. - Cell styles. - Auto number formatting (adding currency symbols, percentage symbols, etc). - Alignment. - Auto text formatting (changing the appearance of dates and times etc).
A red triangle in the upper right corner of a cell indicates that a cell comment has been attached to this cell. If you hover over the cell with your cursor, the comment will be displayed. Check back to “How do you insert a comment” above to review that process.
Absolute cell referencing is the exact opposite of relative cell referencing. By marking the row number and column letter with a $ symbol (e.g. A1), you can make a cell reference fixed (or “absolute”). This means that when you copy and paste it to another cell or use AutoFill, the cell references will not change.
When you insert the dollar sign, it tells a cell in Excel whether to change the location of the reference or not, if the formula for it is copied to other cells. You can also make only the row or column absolute ($A1, or A$1), in order to allow a portion of a reference to be updated when pasting or AutoFilling.
The shortcut to turn filters on or off is Ctrl+Shift+L. To display the filter drop-down menu, the shortcut is Alt+Down Arrow
Formulas are written instructions for a calculation in Excel. All calculations within a spreadsheet will be written as formulas (e.g. =C3+C4+C5+C7+C8). On the other hand, functions are prewritten formulas and a feature of Excel. Excel has over 500 built-in functions that allow users to achieve complicated calculations without having to create the formula themselves, or type it out in full.
You need to go to the Home tab, click the Clear button and then select the option for Clear Formats. Any formatting done on the cell will be removed without affecting content.
=NOW() function will return the current date and time. If you just want to enter the current time, but not date, use the keyboard shortcut Ctrl+Shift+semicolon key, and for the current date without time, press Ctrl+Semicolon.
A dashboard is used to present important information through graphical representation. It is helpful in presenting huge data on a single computer screen so it can be reviewed with a glance, and useful insights can be inferred from it. Often a dashboard will utilize color coded gauges to show if metrics are within desired parameters or if a deeper dive is needed on a specific topic.
Text to column option in the Data tab could be used to split the original column into two or more columns. You can split by number of characters, or by delimiting (looking for a certain indicator). A common example is separating cell FirstLastName (Susan Crabapple) into a First Name (Susan) and Last Name (Crabapple). By delimiting based on the space, you could separate a whole range of names into their component parts.
The TEXT function in Excel allows you to convert a value to text in a specific format. This function is particularly useful when you want to format numbers, dates, or times in a specific way within a text string.
For example, you can use =TEXT(A1, “mm/dd/yyyy”) to convert a date in cell A1 into a text string with the format “mm/dd/yyyy”. Common scenarios include formatting dates for reports, adding currency symbols to numbers, or combining text with formatted numbers or dates.
Explaining data succinctly is part of the job, but explaining the software and tools that data analysts use definitely is not, and not everyone is naturally gifted in this area. This means that you need to be prepared for any kind of question that the interviewer may ask, and can show your additional capabilities by articulating what other interviewees may not be able to.
As the interview progresses, the questions can get tougher depending on how well you know the software. If you have experience in data analytics, you can expect questions on functions and other tasks that you can carry out in Microsoft Excel, such as writing SQL Queries within excel.
A chart in Excel is a feature that allows you to display data through a range of visually intuitive graphs. These charts and graphs can make it easier and quicker to comprehend data, in comparison to just looking at the numbers on the worksheet. Available charts on Excel include:
Each of these charts have their advantages and disadvantages, based on the story you wish to tell your audience. Make sure you have a reason for choosing a specific chart, and that it is not emotion or random.
A macro is a sequence of performable actions in Excel which have been recorded, saved, and named for easy use in the future.
A macro can then be called on whenever necessary to complete the sequence of actions without the user having to replicate each step manually. This saves valuable time and effort when performing repetitive tasks with larger sets of data. You might, for example, need to manipulate a data set in the same way every week, but it involves 15 steps to complete. By recording a macro, you can manipulate the data in a consistent way extremely quickly, with just a click of the button.
Formatting will automatically be copied to all selected sheets, so to apply a single format to all the sheets in a workbook, simply hold down Ctrl (or Cmd on Mac) to select every sheet. Only formatting edits will be copied over with this method, not data.
A relative cell address (or relative cell reference) is a cell address within a formula that does not include a $ symbol before the row number or column letter. This means that when the formula is copied one column over, the cell address will change accordingly. The formula will calculate relative to its position, rather than displaying the calculation from the original position. This is why making formulas with absolute cell references is so important, as you often do not want your equations to roll forward automatically.
A drop-down list allows the creator of a spreadsheet to control the data inputted into certain cells. In other words, when the creator only wants one of four predetermined options to be used, instead of relying on potentially unreliable user inputs (misspellings, non-standard categories), they can ensure it is adhered to by having the users click an option from a drop-down list. Good inputs equal good outputs, and reducing the opportunities for manual entry will allow for better analysis down the line.
The Name Box is an input box above the Excel sheet, to the left of the formula bar. Its default mode displays the address of the currently selected cell, but it has other uses too. Firstly, the Name Box can be used to quickly select a specific cell or range of cells: typing in a cell reference like G8 will automatically navigate to the cell G8, and typing in a range of cells like G8:G30 will select all cells within that range. Secondly, the Name Box can be used to create a named range, and then can be used as a drop-down menu to navigate between your assorted named ranges.
LOOKUP
function in Microsoft Excel?The LOOKUP function allows the user to find exact or partial matches in the spreadsheet. The VLOOKUP option lets the user search for data located in the vertical position. The HLOOKUP option functions in the same way but in the horizontal plane.
Data can be imported from a variety of sources by selecting the Data tab and clicking Get External Data > From Other Sources. Excel worksheet data, data feeds, text files and other such data formats can be imported, but you will need to create relationships between the imported tables and those in your worksheet before using them to create a pivot table.
To create a link in Excel, select the element you wish to use as the anchor (this can be a cell or an object like a picture). You can then use a variety of pathways: select Link from the Insert tab, right-click and select Link on the menu, or press Ctrl+K. This will bring up a series of options that will allow you to indicate what kind of content you would like to link to, such as a file, a web page, a specific location or an email address.
You can do this by using the WEEKDAY function. First, enter the formula in the required cell, WEEKDAY(A1,1). The second argument (1) determines to what day excel will assign the value. Press Enter, and the Formula will return 3, meaning that your date is the 3rd day of the week.
When working in a massive workbook with many sheet tabs, it can be helpful to know that you are looking at the correct sheet and not missing any hidden tabs. The SHEET formula returns the number of the sheet you are in, such that the second sheet tab would return a value of 2. This is beneficial as an organizational tool.
If the last parameter is not specified via TRUE or FALSE, the return value will default to TRUE (approximate), and show an approximate match for your request. This is usually not what you are looking to accomplish, as the typical use case for VLOOKUP is to find an exact match.
There are several methods to refresh a pivot table without making a new one from scratch:
You can also refresh all pivot tables within a workbook at once by selecting the arrow underneath the Refresh button in the pivot table tools menu, and select Refresh All.
The most common error message in Excel is the #### error message, which is displayed when a cell is not large enough to display all the data that has been inputted into it. To solve this error message, simply drag the cell to increase its width or depth.
Other common errors include #VALUE! , which results when a portion of the data you are trying to analyze has a different format than the rest, or #NAME! , when the formula was written incorrectly and is not processing as it should.
To find but not delete duplicates in a column, select the target range of data and navigate to the Style group on the Home tab. Here, click the arrow next to Conditional Formatting. You will then be able to choose Highlight Cell Rules, Duplicate Values, and enter the values you wish to find duplicates of. This will highlight duplicates of the values you entered, but will not manipulate the data within.
Macros for Excel are written in VBA (Visual Basic for Applications). You can use VBA to manually type out actions for a macro yourself, or you can “record” actions using the Macro Recorder in Excel, and the software will convert these to VBA code for you. Once you become stronger in VBA, you can review recorded macros and perform the updates yourself to tweak your use cases.
You can find unique values by selecting the desired range of data and navigating to Data > Sort & Filter > Advanced. To permanently delete duplicate values and create a list of unique values only, click Data > Data Tools > Remove Duplicates.
The product of the pivot table feature in Excel is often referred to in other software as a contingency table or cross-tabulation. To create a “crosstab” use the PivotTable option on the Insert tab.
The Freeze Pane function in Excel serves to lock rows or columns of a sheet so that they are always visible. This can be found on the View Tab of the ribbon. This is often helpful when you want to keep headers of columns in view, or perhaps the identifier of a row while scrolling right.
In Excel, when copying formulas from one cell to another, the way cell references are treated can differ depending on their type:
By understanding and using these references effectively, data analysts can make their spreadsheets more dynamic and reduce errors during data manipulation. It also helps if they need to quickly scale formulas across dozens or hundreds of columns and rows.
To apply conditional formatting in Excel to highlight cells with values above a certain threshold, first select the range of cells you want to format. Then, go to the “Home” tab, click on “Conditional Formatting,” and choose “New Rule.” In the New Formatting Rule dialog box, select “Format cells that contain,” then set the condition to “greater than” and input the threshold value. Choose the formatting style you want to apply (e.g., fill color), and click “OK.”
No matter what position you work in, no matter which company you work for, if you are unable to solve the problems you face, you will not be successful. That means you have to be good at identifying the problems themselves. Then you have to consider which ones can be or should be solved. After your evaluations, you can create solutions to challenges to both the organization and to you as a data analyst. By being able to answer difficult questions on Excel, and knowing how to navigate these issues, you will find your job as a data analyst much easier.
These harder questions about Excel include data analysis, sorting and other hard functions.
Conditional formatting is a method that allows us to identify the characteristics (conditions) of a cell’s contents that are of interest to us, and superimpose highlighting of cells or ranges that meet that criteria. When a cell is updated with new information, the conditional formatting will recognize and update itself. For instance, if you wanted to apply a green highlight to any cell that contains the letter Z, and you update Randolph to Ziggy, the conditional format would automatically apply.
The VLOOKUP function can only move in one direction, from left to right. Therefore, the information you wish to seek out must be located in a column to the right of the lookup value’s location. In newer versions of Excel, a successor to VLOOKUP has been added, called XLOOKUP. This new function works in any direction and defaults to exact matches rather than approximate. At some point in the future, XLOOKUP will completely replace VLOOKUP, but this will not happen until the majority of users have moved away from using older versions of Excel.
VLOOKUP is not case-sensitive, and will always return the first value of the match irrespective of the case. In other words, the name Apgar and the acronym APGAR would be viewed as the same by VLOOKUP. It is, however, possible to manipulate VLOOKUP into returning case-sensitive values by using a helper column, or by sorting your data in an ascending or descending order so that the value you want is always the first to be encountered by VLOOKUP.
You can use two MATCH functions within an INDEX formula to specify a cell reference and return the value of that cell. The dynamic formula will return the corresponding data of any two MATCH values you input. For example, if you have a table detailing the price per unit and the number of units sold for a variety of products, you can use the match index function to return a specific piece of information about a specific product.
COUNT: This function counts how many cells within a specified range contain numerical data. It will ignore (not count) any cells that are blank or contain text or symbols only.
COUNTA: This function counts how many cells within a specified range contain data of any type (the cell is filled). It will count all cells that are not blank.
COUNTBLANK: This function will count the number of blank cells within the designated range.
COUNTIF: This function will count only the cells whose value meets a certain condition specified by the user.
Data Validation limits the type of values that can be inserted into a single cell or a set of cells by a user. It again works to limit user input errors and keep the data inputs clean.
Yes. It is possible to configure the ‘Easy Access Toolbar’ above the home button to display your most commonly used shortcuts.
You can extract a unique list of items using Advanced Filters, or you can also extract a particular item from various worksheets. It may be assumed that the Advanced Filter is an advanced Auto Filter variant.
With an array, a user can perform multiple calculations on multiple items using just one formula. For instance, if you want to multiply A1 by B1, A2 by B2, A3 by B3, and so on, instead of writing a formula for each calculation, you can write =A1:A10*B1:B10 in cell C1, and the results for each calculation will spill down the C column.
This is called a multi-cell array, as the results will be stored in multiple cells. You can also create a single-cell array if, for example, you wanted to sum the entirety of the values calculated by the above array. This would look like: =SUM(A1:A10*B1:B10) and return a single value calculated by adding the results of all ten multiplications together.
The FIND function will return the numerical location of this target (with the first character of the text being 1). The LEFT function can then extract the number of characters specified by the FIND function from the beginning of the text (i.e the left).
However, the value returned by FIND will include the space itself, so we need to subtract 1 from the value in order to find the actual ending point of the first name. The formula would look like this:=LEFT(A1,FIND(“ ”,A1)-1).
A second method would separate the first names and last names and deposit them into separate new columns, using the Text to Columns feature found in the Data tab. We covered this earlier under how to split information in a column. The Text to Columns dialogue box will allow you to select the delimiter separating each field (e.g space) and show you a preview of the result. The last step will allow you to choose where you want the result to be displayed.
It is the duty of a function to return the value of the mission it performs. Subroutines, meanwhile, do not return the importance of the assignment they perform. Functions are used as formulas, as they are in spreadsheets. Subroutines are not used as formulas directly inside spreadsheets.
The name of the workbook from which the code is running is provided by ThisWorkbook. ActiveWorkbook is as the name implies, the workbook that is actually active in the numerous open workbooks.
Using the name box is a fast way to return to a particular section of the worksheet. To return to a particular area of a worksheet, you can type the cell address or name of the range in the name box.
Circular references occur when a formula in a cell refers back to itself either directly or indirectly. Excel will display a warning message if it detects a circular reference, because it can cause the program to calculate endlessly while never generating a result. It is therefore critical to address the circular references immediately to prevent them from potentially interfering with other formulas. To identify and correct circular references:
Excel is one of the most frequently asked data analyst interview questions, and they are not designed to catch candidates out or determine whether they “know the correct answer”. They are instead used in tandem with portfolios and past experiences to gauge a candidate’s experience and proficiency with the software. These questions are not “hard” per se, but they are often quite technical. Therefore, if you are familiar with the software and take your time to prepare for your interview, you will have a high chance of acing it!
In addition to Excel, you’ll also want to practice a variety of SQL questions for data analysts and data analyst behavioral questions prior to the interview.
For in-depth interview preparation, we recommend exploring the data analytics learning path and the product metrics learning path. In case you are seeking a guide that is tailored to a specific company, the Google data analyst interview guide would be worth your consideration.