New Year Sale Special - Limited Time 70% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: sntaclus

A data scientist at a consumer goods company, has been asked to do a detailed analysis on customer profiles. The Data Scientist has identified an external data source that carries valuable additional information on their customers. The data scientist also identifies the address column as the most reliable column to join the internal data source with the external data source. Addresses may appear in different formats for example:

File A = "13 Smith St"

File B = "Unit 7, 13 Smith Street"

Which of the following techniques would be useful in this situation?

A.

Deterministic linkage

B.

Probabilistic linkage

C.

Genetic linkage

D.

Cuff linkage

A data dictionary is being developed for a dataset describing a company's customer base. Within the data dictionary, which of the following represents a composite data element?

A.

Street address

B.

First name

C.

Total sale

D.

Birthdate

A 3rd party is marketing an application for financial institutions to use for credit scoring. This application is an example of what type of analytics?

A.

Descriptive analytics

B.

Prescriptive analytics

C.

Exploratory

D.

Inferential

After analyzing sales data, the analytics team finds that the older the customer, the more expensive the neckties purchased. The team felt this was a breakthrough insight but on closer analysis realized that other factors could account for this relationship. This is a clear indication that:

A.

Correlation between variables implies causation

B.

Causation has no relationship with correlation

C.

Causation between variables does not imply correlation

D.

Correlation between variables does not imply causation

The analytics team is identifying research questions to address a business problem. The business analysis professional reminds the

team that the most important dimension to consider is the:

A.

Sources of data

B.

Quality of the data

C.

Timeframe of analysis

D.

Measurement scale

The research question prompting the use of analytics is well-defined. The team obtains the results and determines that the source data did not provide reliable results. As a result of this finding, the team modifies the original question to one that can be answered by the data. What is a risk that could impact the value of this analysis?

A.

The objective of the original research may not be met

B.

Timelines will be pushed out making stakeholders unhappy

C.

Increased costs associated with the source data

D.

The quality of the analysis may be negatively impacted

While sourcing data, an analyst runs into a situation where different business units are usingdifferent names to refer to the same data element. This lack of standardization is resulting in confusion and additional time required to properly prepare data for analysis. Which practice, if implemented would address this situation and mature the organization's business analytics practice?

A.

Data quality management

B.

Database operations management

C.

Data warehousing

D.

Meta data management

A dataset contains 10 measures of workplace sustainability. The analytics team is in need of producing a single score of sustainability. Which of the following techniques if used would achieve this objective?

A.

Logistic regression

B.

Linkage algorithms

C.

Factor analysis

D.

K means clustering

A marketing director has asked the question 'How many product purchases are expected this coming year given the current marketing campaign?". What type of analytics would be performed to answer this question?

A.

Descriptive

B.

Predictive

C.

Diagnostic

D.

Prescriptive

Operation managers are concerned about the increasing attrition rates in the call center. A series of interviews is being conducted with call center agents to collect information to better understand the problem. Interviewees will ask open and closed ended questions that are both quantitative and qualitative. Which of the following is considered a qualitative open-ended question?

A.

How does call volume contribute to job burnout?

B.

Would morale improve if you could work 2 days per week from home?

C.

How many calls on average do you service in an hour?

D.

Do you receive more calls on Mondays or Fridays?