In this section of the course, you will deepen your understanding of spreadsheets by exploring advanced features like pivot tables and data validation. Additionally, you will be introduced to the Google Data Analytics Certification through Coursera, which provides valuable certifications for those looking to expand their knowledge in data analytics.
By the end of this module, you will be more adept at using spreadsheets for data analysis and well-prepared for the Google Data Analytics Certification exam. With your grasp of spreadsheet fundamentals and structured thinking techniques, you will be ready to address real-world challenges with reliable and well-thought-out solutions.
Learning Objectives:
Discuss the role and responsibilities of a data analyst in relation to the use of spreadsheets.
Demonstrate proficiency in using spreadsheets to perform essential data analyst tasks, such as data entry and organization.
Understand and apply formulas in spreadsheets, including defining them and providing specific examples.
Compare formulas and functions, highlighting their similarities and differences.
Explore structured thinking principles, such as understanding the problem domain, scope of work, and context.
Test your knowledge on working with spreadsheets
1. When giving a spreadsheet a title, what are some best practices to follow? Select all that apply.
Titles should state what the data in the spreadsheet is about (Correct)
Titles should be short (Correct)
Titles should be in ALL CAPS
Titles should be clear (Correct)
Correct: Spreadsheet titles should be short, clear, and state exactly what the data in the spreadsheet is about.
2. Fill in the blank: Data analysts can use _____ to highlight the area around cells in order to see spreadsheet data more clearly.
filters
italics
pivot tables
borders (Correct)
Correct: Data analysts use borders to highlight the area around cells in order to see spreadsheet data more clearly.
3. Within a spreadsheet, data analysts use which tools to save time and effort by automating commands? Select all that apply.
Functions (Correct)
Tables
Formulas (Correct)
Filters
Correct: Data analysts use formulas and functions to save time and effort by automating commands.
Test your knowledge on using formulas in spreadsheets
1. Which of the following are examples of operators used in formulas? Select all that apply.
Forward slash (/) (Correct)
Asterisk (*) (Correct)
Hyphen (-) (Correct)
Hash (#)
Correct: The asterisk, hyphen, and forward slash are examples of operators used in formulas.
2. In a spreadsheet, a formula should always start with which of the following operators?
Dash (—)
Colon (:)
Equal sign (=) (Correct)
Plus-minus (±)
Correct: In a spreadsheet, a formula should always start with an equal sign.
3. What is the term for the set of cells that a data analyst selects to include in a formula?
Data boundary
Data range (Correct)
Cell set
Cell domain
Correct: The set of cells a data analyst selects to include in a formula is called the data range.
4. In a formula, the plus sign (+) is the operator for addition, and the hyphen (-) is the operator for subtraction.
True (Correct)
False
Correct: In a formula, the plus sign (+) is the operator for addition, and the hyphen (-) is the operator for subtraction.
5. Which of the following functions quickly counts how many items in a range of cells meet a given criterion?
The COUNTIF function (Correct)
The TEXT function
The MAX function
The SUM function
Correct: The COUNTIF function quickly counts how many items in a range of cells meet a given criterion. Using functions to make calculations and analyze data is an important skill for a data analyst. Going forward, you will continue to develop this skill as you work with more complex datasets.
Test your knowledge on using functions in spreadsheets
1. Data analysts use which of the following functions to quickly perform calculations in a spreadsheet? Select all that apply.
MIN (Correct)
SUM (Correct)
MASS
AVERAGE (Correct)
Correct: AVERAGE, MIN, and SUM are functions used to quickly perform calculations in a spreadsheet.
2. What is the term for a preset command in a spreadsheet?
Function (Correct)
Quotient
Cell
Range
Correct: A preset command in a spreadsheet is called a function.
3. You are working with spreadsheet data about a cross-country relay race. Each runner’s times are located in cells H2 through H28. To find the runner with the slowest time, what is the correct function?
=MIN(H2-H28)
=HIGH(H2-H28)
=MAX(H2:H28) (Correct)
=LOW(H2:H28)
Correct: The function is =MAX(H2:H28). The largest numeric value corresponds to the slowest time in the race. MAX returns the largest numeric value from a range of cells. And H2:H28 is the specified range.
4. When you created a complete and thorough mock SOW, which foundational pieces of content did you include? Select all that apply.
Deliverables (Correct)
Milestones (Correct)
Timeline (Correct)
Reports (Correct)
Budget
Correct: In your mock scope-of-work, you should have included four foundational pieces of content: the deliverables, milestones, timeline, and reports. Once these items are in place, the SOW will help keep you, your team, and your project stakeholders organized and on the same page. Going forward, you can use this industry-standard tool to clarify a project’s business goals and how to accomplish them—just like a professional data analyst!
Test your knowledge on Structured thinking
1. What are the key elements of structured thinking? Select all that apply.
Implementing a solution
Recognizing the current problem or situation (Correct)
Organizing available information (Correct)
Revealing gaps and opportunities in order to identify the options (Correct)
Correct: Structured thinking is the process of recognizing the current problem or situation, organizing available information, revealing gaps and opportunities, and identifying the options.
Correct: Considering inclusive sample populations, social context, and self-reported data enable fairness in data collection.
2. Fill in the blank: A scope of work is an agreed-upon _____ of the work you’re going to perform on a project.
diagram
to-do list
outline (Correct)
report
Correct: A scope of work is an agreed-upon outline of the work you’re going to perform on a project.
3. What are some strategies to ensure your data is accurate and fair? Select all that apply.
Make sure you start with an accurate representation of the population in the sample (Correct)
Use data that is very personal to you.
Think through the “who, what, where, when, how, and why” of your data (Correct)
Collect the data in an objective way (Correct)
Correct: To ensure your data is accurate and fair, make sure you start with an accurate representation of the population in the sample, collect the data in an objective way, and ask questions about the data.
Ask Questions to Make Data-Driven Decisions Weekly Challenge 3
1. Fill in the blank: In spreadsheets, data analysts begin _____ with an equal sign (=).
cells
charts
formulas (Correct)
numbers
Correct: In spreadsheets, data analysts begin formulas with an equal sign (=).
2. What do data analysts use to label the type of data contained in each column in a spreadsheet?
Menus
Attributes (Correct)
Tables
Headings
Correct: To label the type of data contained in each column in a spreadsheet, data analysts use attributes.
3. To determine an organization’s annual budget, a data analyst might use a slideshow.
True
False (Correct)
Correct: To determine an organization’s annual budget, a data analyst might use a spreadsheet.
4. Which of the following statements describes a key difference between formulas and functions?
Formulas span two or more cells, and functions exist in only one cell.
Formulas are written by the user, and functions are already defined. (Correct)
Formulas are used in graphs, and functions are not.
Formulas contain words and numbers, and functions contain numbers only.
Correct: Formulas are written by the user, and functions are already defined.
5. In the function =MAX(A1:A12), what does A1:A12 represent?
The maximum
The formula
The range (Correct)
The operator
Correct: In the function, A1:A12 represents the range. A range is a collection of two or more cells.
6. What is the correct spreadsheet formula for multiplying cell D5 times cell D7?
=D5^D7
=D5/D7
=D5*D7 (Correct)
=D5xD7
Correct: The correct spreadsheet formula for multiplying cell D5 times cell D7 is =D5*D7. The asterisk (*) is the operator for multiplication.
3. Fill in the blank: The job of _____ is to investigate alerts and determine whether an incident has occurred.
incident coordinators
security analysts (CORRECT)
public relations representative
technical leads
Where they analyze these security alerts, the security analysts determine if any incidents have occurred.
7. Fill in the blank: By negatively influencing data collection, ____ can have a detrimental effect on analysis.
bias (Correct)
objectivity
partiality
filtering
Correct: By negatively influencing data collection, bias can have a detrimental effect on analysis.
8. Which of the following are ways that data analysts can add context to their data? Select all that apply.
Create reports for stakeholders
Consider where the data came from (Correct)
Ask questions about the data (Correct)
Use descriptive column headers (Correct)
Correct: To add context to their data, data analysts ask questions about the data, consider where it came from, and use descriptive column headers.
9. Both formulas and functions in spreadsheets begin with what symbol?
Colon (:)
Bracket ([)
Hyphen (-)
Equal sign (=) (Correct)
Correct: Both formulas and functions in spreadsheets begin with an equal sign.
10. A data analyst could use spreadsheets to achieve which of the following tasks?
Predict next quarter’s sales (Correct)
Motivate employees
Build code for a new app
Write reports
Correct: A spreadsheet could be used to predict next quarter’s sales.
11. Formulas are created by the user, whereas functions are preset commands in spreadsheets.
True (Correct)
False
Correct: Formulas are created by the user, whereas functions are preset commands in spreadsheets.
12. In the function =MAX(G3:G13), what does G3:G13 represent?
The range (Correct)
a table
an attribute
an observation
Correct: In the function =MAX(G3:G13), G3:G13 represents the range. A range is a collection of two or more cells.
13. What is the correct spreadsheet formula for multiplying cell K3 times cell K8?
=K3*K8 (Correct)
=K3^K8
=K3xK8
=K3/K8
Correct: The correct spreadsheet formula for multiplying cell K3 times cell K8 is =K3K8. The asterisk () is the operator for multiplication.
14. To avoid bias when collecting data, a data analyst should keep what in mind?
Opinion
Context (Correct)
Graphs
Stakeholders
Correct: To avoid bias when collecting data, a data analyst should keep context in mind.
15. Attributes are used in spreadsheets for what purpose?
Label the data in each column (Correct)
Insert data into each column
Analyze the data in a row
Add a new column
Correct: Attributes are used to label the type of data in each column in a spreadsheet.
16. A data analyst might use descriptive column headers in order to achieve what goal?
Alphabetize the spreadsheet data
Filter the data
Add context to their data (Correct)
Protect the spreadsheet
Correct: A data analyst might use descriptive column headers in order to add context to the data.
17. Which of the following statements accurately describe formulas and functions? Select all that apply.
Formulas and functions assist data analysts in calculations, both simple and complex. (Correct)
Functions are preset commands that perform calculations. (Correct)
Formulas are instructions that perform specific calculations. (Correct)
Formulas may only be used once per spreadsheet column.
Correct: Formulas are instructions that perform specific calculations, while functions are preset commands that perform calculations. Both formulas and functions assist data analysts in performing calculations, whether simple or complex.
18. In the function =MAX(B5:B15), what does B5:B15 represent?
Column
Attribute
Observation
Range (Correct)
Correct: In the function =MAX(B5:B15), B5:B15 represents the range. A range is a collection of two or more cells.
19. What is the correct spreadsheet formula for multiplying cell H2 times cell H5?
=H2xH5
=H2/H5
=H2*H5 (Correct)
=H2^H5
Correct: The correct spreadsheet formula for multiplying cells H2 and H5 is =H2H5. The asterisk () is the operator for multiplication.
21. Data analysts use metadata for what tasks? Select all that apply.
To perform data analyses
To evaluate the quality of data (CORRECT)
To interpret the contents of a database (CORRECT)
To combine data from more than one source (CORRECT)
Data analysts leverage metadata to merge, assess, and analyze data within a database.
Data analysts utilize metadata to integrate, evaluate, and interpret the information in a database.
Data analysts depend on metadata to combine, review, and understand the data in a database.
22. Structural metadata indicates how a piece of data is organized and whether it’s part of one or more than one data collection.
True (CORRECT)
False
Correct: Structural metadata indicates how a piece of data is organized and whether it’s part of one, or more than one, data collection.
23. What is the process that data analysts use to ensure the formal management of their company’s data assets?
Data mapping
Data governance (CORRECT)
Data aggregation
Data integrity
Correct: Data governance is the process of ensuring the formal management of a company’s data assets.
24. A data analyst chooses not to use external data because it represents diverse perspectives. This is an appropriate decision when working with external data.
True
False (CORRECT)
Correct: External data representing diverse perspectives is not an appropriate reason to choose not to use a dataset. A data analyst might choose not to use external data if the it cannot be confirmed to be reliable.
25. A data analyst reviews a database of Wisconsin car sales to find the last car models sold in Milwaukee in 2019. How can they sort and filter the data to return the last five cars sold at the top of their list? Select all that apply.
Sort by sale date in ascending order
Sort by sale date in descending order (CORRECT)
Filter out sales outside of Milwaukee (CORRECT)
Filter out sales not in 2019 (CORRECT)
The analyst can exclude sales from outside Milwaukee in 2019 and sort them by date in descending order.
The analyst can filter out sales not from Milwaukee in 2019 and arrange them in descending order by date.
The analyst can eliminate sales outside Milwaukee in 2019 and order them by date in descending order.
26. When writing a query, the name of the dataset can either be inside two backticks, or not, and the query will still run properly.
True (CORRECT)
False
Correct: When writing a query, the name of the dataset can either be inside two backticks, or not, and the query will still run properly.
27. You are working with a database table that contains customer data. The first_name column lists the first name of each customer. You are only interested in customers with the first name Mark.
You write the SQL query below. Add a WHERE clause that will return only customers named Mark.
SELECT
*
FROM
customer
How many customers are named Mark?
5
2 (CORRECT)
3
1
The clause WHERE first_name = ‘Mark’ filters results to return only customers named Mark. The full query would be SELECT * FROM customer WHERE first_name = ‘Mark’. The WHERE clause is used to specify conditions that must be met for the results to be included. It consists of the column name, an operator (such as equals), and the value to filter by. Text values should be enclosed in quotes. There are two customers named Mark.
28. When working with data from an external source, what can metadata help data analysts do? Select all that apply.
Choose which analyses to run
Combine data from more than one source (CORRECT)
Understand the contents of a database (CORRECT)
Ensure data is clean and reliable (CORRECT)
Correct: Metadata helps data analysts understand the contents of a database, ensure data is clean and reliable, and understand how to combine data from more than one source.
Correct: Metadata helps data analysts understand the contents of a database, ensure data is clean and reliable, and understand how to combine data from more than one source.
Correct: Metadata helps data analysts understand the contents of a database, ensure data is clean and reliable, and understand how to combine data from more than one source.
29. Think about data as driving a taxi cab. In this metaphor, which of the following are examples of metadata? Select all that apply.
Passengers the taxi picks up
Make and model of the taxi cab (CORRECT)
License plate number (CORRECT)
Company that owns the taxi (CORRECT)
Correct: The license plate number, make and model of the cab, and the company that owns the taxi represent structural metadata.
Correct: The license plate number, make and model of the cab, and the company that owns the taxi represent structural metadata.
Correct: The license plate number, make and model of the cab, and the company that owns the taxi represent structural metadata.
30. What are some key benefits of using external data? Select all that apply.
External data is free to use.
External data is always reliable.
External data can provide industry-level perspectives. (CORRECT)
External data has broad reach. (CORRECT)
Using external data offers significant advantages, such as a wide reach and the ability to offer insights from an industry-wide perspective.
31. A data analyst reviews a national database of movie theater showings. They want to find the first movies shown in San Francisco in 2001. How can they organize the data to return the first 10 movies shown at the top of their list? Select all that apply.
Sort by date in descending order
Sort by date in ascending order (CORRECT)
Filter out showings outside of San Francisco (CORRECT)
Filter out showings not in 2001 (CORRECT)
Correct: The analyst can filter out showings outside of San Francisco in 2001 and sort by date in ascending order.
32. You are working with a database table that contains customer data. The city column lists the city where each customer is located. You want to find out which customers are located in Berlin.
You write the SQL query below. Add a WHERE clause that will return only customers located in Berlin.
SELECT
*
FROM
customer
How many customers are located in Berlin?
9
12
2 (CORRECT)
7
33. Primary and foreign keys are two connected identifiers within separate tables. These tables exist in what kind of database?
A document that contains policies, standards, and procedures
A document that outlines the procedures to take in each step of incident response (CORRECT)
A document that details system information
A document that outlines a security team’s contact information
34. Fill in the blank: Data governance is the process of ensuring that a company’s _____ are managed in a formal manner.
data assets (CORRECT)
business tasks
business strategies
data engineers
Correct: Data governance is the process of ensuring that a company’s data assets are managed in a formal manner.
35. A nonprofit maintains a list of how many laptops they provide to each school in the county. In the table, there is a column called number_of_laptops. A data analyst wants to determine which schools were given the fewest laptops. How should they sort the data to return these schools first?
Sort numerically in descending order
Sort alphabetically in ascending order
Sort numerically in ascending order (CORRECT)
Sort alphabetically in descending order
Correct: The data analyst should sort the number_of_laptops column numerically in ascending order.
36. When writing a query, you must remove the two backticks around the name of the dataset in order for the query to run properly.
True
False (CORRECT)
Correct: When writing a query, the name of the dataset can either be inside two backticks, or not, and the query will still run properly.
37. Think about data as a student at a high school. In this metaphor, which of the following are examples of metadata? Select all that apply.
Grades the student earns
Classes the student is enrolled in (CORRECT)
Student’s ID number (CORRECT)
Student’s enrollment date (CORRECT)
The student ID number, enrollment date, and enrolled classes are examples of structural metadata.
These elements, including the student ID, enrollment date, and enrolled courses, represent structural metadata.
Structural metadata is represented by the student ID number, enrollment date, and the classes the student is enrolled in.
38. Fill in the blank: Data _____ is the process of ensuring the formal management of a company’s data assets.
aggregation
governance (CORRECT)
mapping
integrity
Correct: Data governance is the process of ensuring the formal management of a company’s data assets.
39. In what circumstance might a data analyst choose not to use external data in their analysis?
The data cannot be confirmed to be reliable (CORRECT)
The data is free for anyone to access
The data represents diverse perspectives
The data is too thorough
Correct: A data analyst might choose not to use external data in their analysis if the data cannot be confirmed to be reliable.
23. In this spreadsheet, what will be returned from the formula =C4*E2?
(n/a)
A
B
C
D
E
1
95
37
1
6
27
2
5
0
49
31
5
3
78
2
6
2
3
4
6
30
10
62
40
5
42
7
21
9
10
5
15
50 (CORRECT)
2
Correct!
24. Fill in the blank: A range is a _____ of two or more cells.
grid
worksheet
collection (CORRECT)
table
Correct!
25. What are some common focus areas when creating a scope of work as a data analyst? Select all that apply.
ERROR or parsing error
VALUE
REF (CORRECT)
NUM
Correct!
26. A junior data analyst encounters a spreadsheet error. They realize that their formula is referencing a row that has been deleted, so they correct the formula. What type of error did they resolve?
protects
manages
analyzes
monitors (CORRECT)
27. In this spreadsheet, which SUM function will add all of the values between cells B1 and D6?
(n/a)
A
B
C
D
E
F
1
5435
8568
23525
9679
2534
1745
2
1234
5421
3452
6589
4567
856
3
567
96789
2214
7887
2340
890
4
2231
43453
12375
23129
78906
9430
5
46342
86967
56796
52136
673
5856
6
35235
4740
83254
644
5794
352
=SUM(B1-D6)
=SUM(B1:D6) (CORRECT)
=SUM(B1,D6)
=SUM,B1+D6
Correct!
28. Which of the following statements accurately describe structured thinking and the problem domain? Select all that apply.
The problem domain involves establishing checkpoints so stakeholders are aware of project status and progress.
The starting place for structured thinking is the problem domain. (CORRECT)
Structured thinking involves recognizing the current problem or situation, organizing available information, revealing gaps and opportunities, and identifying options. (CORRECT)
Understanding the problem domain makes it possible to know the specific area of analysis required. (CORRECT)
Correct!
29. In this spreadsheet, what should be done to correct the DIV error in cell D2?
(n/a)
A
B
C
D
1
68
7
404
0
2
892
0
73
#DIV/0!
3
0
528
599
0
Insert a comma between the cell ranges in the formula.
Copy the formula to the rest of the cells in column D.
Search for the correct value in column A
Ensure the formula is not trying to divide by zero or an empty cell. (CORRECT)
Correct!
30. What is the formula for dividing the value in spreadsheet cell G10 by the value in cell G9?
=G10/G9 (CORRECT)
G10,G9=
G10:G9
=G10/9
Correct!
31. In this spreadsheet, what will be returned from the formula =A4*D3?
(n/a)
A
B
C
D
E
1
95
37
1
6
27
2
5
0
49
31
5
3
78
2
6
2
3
4
6
30
10
62
40
5
42
7
21
9
10
8
4
3
12 (CORRECT)
Correct!
32. What are some common focus areas when creating a scope of work as a data analyst? Select all that apply.
Create a line chart from the data
Note any key milestones for the project (CORRECT)
Set a schedule for the project (CORRECT)
Include information about data preparation (CORRECT)
Correct!
33. What is the formula for dividing the value in spreadsheet cell H7 by the value in cell H4?
H7,H4=
=H7/H4 (CORRECT)
H7:H4
=H7/4
Correct!
34. A data professional encounters a REF error. How should they resolve the problem?
Determine whether a referenced cell, row, or column was deleted. (CORRECT)
Confirm that referenced start and end dates are in the correct order.
Ensure the referenced formula name is recognized.
Insert a delimiter between the cell values referenced in the formula.
Correct!
35. In this spreadsheet, which SUM function will add all of the values between cells E1 and F6?
(n/a)
A
B
C
D
E
F
1
636
290
9034
759
2365
634
2
210
534
931
442
854
56219
3
454
3641
7607
2452
3535
56099
4
895
97096
7636
1386
36347
4095
5
1211
3456
23142
283
131
2452
6
95685
14124
8797
3463
28568
356
=SUM(E1,F6)
=SUM(E1-F6)
=SUM,E1+F6
=SUM(E1:F6)(CORRECT)
Correct!
36. In this spreadsheet, what should be done to correct the DIV error in cell D1?
(n/a)
A
B
C
D
1
507
36
4
#DIV/0!
2
0
96
0
0
3
87
253
6
0
Insert a comma between the cell ranges in the formula.
Ensure the formula is not trying to divide by zero or an empty cell. (CORRECT)
Copy the formula to the rest of the cells in column D.
Search for the correct value in column A.
Correct!
37. Fill in the blank: A range is a collection of _____ cells.
two or more (CORRECT)
averaged
duplicate
numerically ordered
Correct!
38. What are some common focus areas when creating a scope of work as a data analyst? Select all that apply.
Develop a software solution for the analysis
Include information about analysis of quantitative and qualitative datasets (CORRECT)
Note when the client can expect reports (CORRECT)
Establish an agreed-upon outline of the work to be performed (CORRECT)
Correct!
39. A data professional troubleshoots a spreadsheet error. After researching the error, they discover that their formula is referencing a column that has been deleted. They update the formula with the correct column name. What type of error did they resolve?
NUM
ERROR or parsing error
REF (CORRECT)
VALUE
Correct!
40. In this spreadsheet, which SUM function will add all of the values between cells C1 and E6?
(n/a)
A
B
C
D
E
F
1
43
68
225
79
34
15
2
23
54
342
89
467
856
3
67
89
24
787
20
90
4
23
43
375
129
706
43
5
34
967
596
136
67
56
6
23
440
3254
644
94
35
=SUM(C1-E6)
=SUM(C1,E6)
=SUM(C1:E6)(CORRECT)
=SUM,C1+E6
Correct!
More Spreadsheet Basic CONCLUSION
Spreadsheets are incredibly versatile tools in data analytics, enabling structured thinking and helping analysts better understand and solve complex problems. If you’re interested in learning more about how data analysts leverage spreadsheets in their work, I invite you to explore the learning experience on Coursera.
This part of the course dives into the importance of structured thinking, showing how it empowers analysts to develop effective solutions and gain deeper insights into challenges. Additionally, you’ll get hands-on practice solving real-world problems using spreadsheets.
Take advantage of this opportunity to enhance your data analytics skills and gain valuable experience in a practical, engaging way!