A data scientist at a consumer goods company, has been asked to do a detailed analysis on customer profiles. The Data Scientist has identified an external data source that carries valuable additional information on their customers. The data scientist also identifies the address column as the most reliable column to join the internal data source with the external data source. Addresses may appear in different formats for example:
File A = "13 Smith St"
File B = "Unit 7, 13 Smith Street"
Which of the following techniques would be useful in this situation?
A data dictionary is being developed for a dataset describing a company's customer base. Within the data dictionary, which of the following represents a composite data element?
A 3rd party is marketing an application for financial institutions to use for credit scoring. This application is an example of what type of analytics?
After analyzing sales data, the analytics team finds that the older the customer, the more expensive the neckties purchased. The team felt this was a breakthrough insight but on closer analysis realized that other factors could account for this relationship. This is a clear indication that:
The analytics team is identifying research questions to address a business problem. The business analysis professional reminds the
team that the most important dimension to consider is the:
The research question prompting the use of analytics is well-defined. The team obtains the results and determines that the source data did not provide reliable results. As a result of this finding, the team modifies the original question to one that can be answered by the data. What is a risk that could impact the value of this analysis?
While sourcing data, an analyst runs into a situation where different business units are usingdifferent names to refer to the same data element. This lack of standardization is resulting in confusion and additional time required to properly prepare data for analysis. Which practice, if implemented would address this situation and mature the organization's business analytics practice?
A dataset contains 10 measures of workplace sustainability. The analytics team is in need of producing a single score of sustainability. Which of the following techniques if used would achieve this objective?
A marketing director has asked the question 'How many product purchases are expected this coming year given the current marketing campaign?". What type of analytics would be performed to answer this question?
Operation managers are concerned about the increasing attrition rates in the call center. A series of interviews is being conducted with call center agents to collect information to better understand the problem. Interviewees will ask open and closed ended questions that are both quantitative and qualitative. Which of the following is considered a qualitative open-ended question?