Q1. Here are some of the advantages of Apache Pig: ease of programming, behind-the-scenes code optimization, and extensibility of capabilities. Please discuss each of the advantages in more detail and use an example for at least one of them.
Deadline: 1 Day.
Q2. Your will select a big data analytics project that is introduced to an organization of your choice … please address the following items:
Provide a background of the company chosen.
Determine the problems or opportunities that that this project will solve. What is the value of the project?
Describe the impact of the problem. In other words, is the organization suffering financial losses? Are there opportunities that are not exploited?
Provide a clear description regarding the metrics your team will use to measure performance. Please include a discussion pertaining to the key performance indicators (KPIs).
Recommend a big data tool that will help you solve your problem or exploit the opportunity, such as Hadoop, Cloudera, MongoDB, or Hive.
Evaluate the data requirements. Here are questions to consider: What type of data is needed? Where can you find the data? How can the data be collected? How can you verify the integrity of the data?
Discuss the gaps that you will need to bridge. Will you need help from vendors to do this work? Is it necessary to secure the services of other subject matter experts (SMEs)?
What type of project management approach will you use this initiative? Agile? Waterfall? Hybrid? Please provide a justification for the selected approach.
Provide a summary and conclusion.