[Data Analysis]
A data analyst receives a request for the current employee head count and runs the following SQL
statement:
SELECT COUNT(EMPLOYEE_ID) FROM JOBS
The returned head count is higher than expected because employees can have multiple jobs. Which
of the following should return an accurate employee head count?
D
[Data Governance]
A data analyst created a dashboard to illustrate the traffic volume and mean response time for a call
center. The traffic data is current, but the mean response time has not updated for more than an
hour. Which of the following is the best way to verify the data's freshness?
C
[Data Governance]
Which of the following pieces of information, if made public, results in a data privacy violation?
B
[Data Acquisition and Preparation]
A data analyst receives four files that need to be unified into a single spreadsheet for further
analysis. All of the files have the same structure, number of columns, and field names, but each file
contains different values. Which of the following methods will help the analyst convert the files into
a single spreadsheet?
B
[Data Analysis]
A data analyst team needs to segment customers based on customer spending behavior. Given one
million rows of data like the information in the following sales order table:
Customer_ID
Region
Amount_spent
Product_category
Quantity_of_items
00123
East
20000
Baby
00124
West
30000
Home
00125
South
40000
Garden
00126
North
50000
Furniture
00127
East
60000
Baby
Which of the following techniques should the team use for this task?
C
[Data Governance]
A data analyst receives a notification that a customized report is taking too long to load. After
reviewing the system, the analyst does not find technical or operational issues. Which of the
following should the analyst try next?
A
[Data Concepts and Environments]
Which of the following best describes the reason an analyst would reference a data dictionary versus
a source's metadata?
B
[Data Acquisition and Preparation]
A data analyst is joining two tables with different content and one common field. Which of the
following should the analyst do to most efficiently meet this requirement?
A
[Data Concepts and Environments]
A data analyst pulls a table similar to the following one:
ID
Type
TypeID
Phone
Full Time
Full Time 1
Mobile
Part Time
Part Time 2
Work
Full Time
Full Time 3
Mobile
Which of the following best explains the data issue with TypeID?
A
[Data Analysis]
Which of the following AI types is the best option for time-series forecasting?
B