300 Questions for OREILLY DataBricks Apache Spark Developer Certification + 5 Page Revision notes

Question 6 : You have executed below Python spark code

1. >>> lines = sc.textFile("hadoopexam.txt") 

2. >>> lines.count() 

3. 127

4. >>> lines.first() 

5. u'# Apache Spark'

In which line the first RDD created ? 

1.     1 

2.     2 

3.     3

4.    4

5.    5 

Correct Answer : 1 Exp : >>> lines = sc.textFile("README.md") # Create an RDD called lines

>>> lines.count() # Count the number of items in this RDD


>>> lines.first() # First item in this RDD, i.e. first line of README.md

u'# Apache Spark'


In the example above, the variable called lines is an RDD, created here from a text file

on our local machine. We can run various parallel operations on the RDD, such as

counting the number of elements in the dataset (here lines of text in the file) or printing

the first one. We will discuss RDDs in great depth in later chapters, but before we go

any further, let's take a moment now to introduce basic Spark concepts.

Apache Spark Training & Certifications:  Apache Spark is new and fastest data processing engine for Big Data world, after Hadoop it’s becoming more popular in Industry (recently demand increased a lot). Now using power of Hadoop and Spark. Hence, data processing speed has dramatically increased. So if you wish to work in/with Big Data then Learning Spark is a must even for becoming data scientist., HadoopExam Learning Resources launched low cost material for in depth learning of Spark in the form of Spark Professional Training with Hands on practice sessions and helping you to get certified with most popular Apache Spark Certification conducted by Oreilly and Databricks only. So without delaying start preparing or prove your skills of Apache Spark, subscribe to our trainings and certification material with special discount of unbeatable price. You can request free updates as well, whenever it is done.

Spark Professional Training      Oreilly Databricks Spark Certification     Hortonworks HDPCD Spark Certification     Cloudera CCA175 Hadoop and Spark Developer Certifications


  1. Apache Spark Professional Training with Hands On Lab Sessions 
  2. Oreilly Databricks Apache Spark Developer Certification Simulator
  3. Hortonworks Spark Developer Certification 
  4. Cloudera CCA175 Hadoop and Spark Developer Certification