Question 7 : You have huge volume of logs collected from website web servers. And wanted to analyze them for recommendations. You selected HDFS files system to store your regularly collected logs. And you asked your HDFS administrator that i need replication factor 3 for this data. Why you asked for replicating your data with 3 copies ? 

A. Data protection and scheduling flexibility

B. Elimination of requirement for a combiner process

C. Elimination of requirement for Shuffle and Sort process

D. Memory optimization and minimizing tasks to run

You have no rights to post comments