Refer to the exhibit.
A data analyst needs to calculate the mean for Q1 sales using the data set below: Which of the following is the mean?
Correct Answer: C
Explanation:
The mean is the average of all the values in a data set. To calculate the mean, we add up all the
values and divide by the number of values. In this case, the mean for Q1 sales is ($2,000 + $3,000 +
$4,000 + $2,500 + $3,500) / 5 = $3,082.72 Reference: CompTIA Data+ Certification Exam Objectives,
page 9
Question 2
A data analyst is creating a report that will provide information about various regions, products, and
time periods. Which of the following formats would be the MOST efficient way to deliver this report?
Correct Answer: D
Explanation:
A dashboard with filters at the top that the user can toggle would be the most efficient way to deliver
this report, because it allows the user to customize the view and explore different combinations of
regions, products, and time periods. A workbook with multiple tabs for each region would be
cumbersome and repetitive. A daily email with snapshots of regional summaries would not provide
enough detail or interactivity. A static report with a different page for every filtered view would be
too long and hard to navigate. Reference: CompTIA Data+ Certification Exam Objectives, page 14
Question 3
Refer to the exhibit.
A customer list from a financial services company is shown below: A data analyst wants to create a likely-to-buy score on a scale from 0 to 100, based on an average of
the three numerical variables: number of credit cards, age, and income. Which of the following
should the analyst do to the variables to ensure they all have the same weight in the score
calculation?
Correct Answer: D
Explanation:
vNormalizing the variables means scaling them to a common range, such as 0 to 1 or -1 to 1, so that they have the same weight in the score calculation. Recoding the variables means changing their values or categories, which would alter their meaning and distribution. Calculating the percentiles of the variables means ranking them relative to each other, which would not account for their actual magnitudes. Calculating the standard deviations of the variables means measuring their variability, which would not make them comparable. Reference: CompTIA Data+ Certification Exam Objectives, page 10
Question 4
Which of the following actions should be taken when transmitting data to mitigate the chance of a
data leak occurring? (Choose two.)
Correct Answer: D, E
Explanation:
Data encryption and data masking are two actions that can be taken when transmitting data to
mitigate the chance of a data leak occurring. Data encryption means transforming data into an
unreadable format that can only be decrypted with a key. Data masking means hiding or replacing
sensitive data with fictitious or anonymized data. Both methods protect the confidentiality and
integrity of the data in transit. Reference: CompTIA Data+ Certification Exam Objectives, page 13
Question 5
Which of the following BEST describes the issue in which character values are mixed
with integer values in a data set column?
Correct Answer: D
Explanation:
The invalid data type is the best description for the issue in which character values are mixed with
integer values in a data set column. Invalid data type means that the data does not match the
expected or required format or structure for a given variable or attribute. For example, if a column is
supposed to store numerical values, but some rows contain text values, then those rows have an
invalid data type. Reference: CompTIA Data+ Certification Exam Objectives, page 10
Demo Practice Mode
You are viewing only the questions marked as Demo.