using spark programing language Q1. Which of the below option is…

Question Answered step-by-step using spark programing language Q1. Which of the below option is… using spark programing language    Q1. Which of the below option is not the join type in spark?Select one:a. right_antib. innerc. leftd. left_outer  Q2. Filter function in where clause is return?Select one:a. Stringb. Booleanc. Intd. Value Q3. Which property is required to set for limit the log level to error?Select one:a. sc.setLogTo(“ERROR”)b. sc.setInfoLevel(“ERROR”)c. sc.setLogLevel(“ERROR”)d. none of above Q4 . We have employee.json file stored at hdfs location “/user/spark/json/employee”. Write spark dataframe api to read the json file. API should infer schema as well.  Q5.Data frame APIs are low level API of spark? yes or No  Q6. which of the below property is required to convert dataframe to dataset?Select one:a. import spark.implicit._b. import spark.sqlc. import org.apache.spark.sql._d. import spark.encoding  Q7. Which programming language is preferred when you are writing low level api like rdd in spark?Select one:a. Scalab. Javac. Pythond. R Q8: Convert below sql query into spark dataframe api. Select customer_id, concat(city,” “, state) as address, substring(customer_email, -1, 3) as email_type from customer where customer_zipcode in (select zipcode from address); Q9. which of the below dataframe api involve in data shuffling ?Select one:a. df.groupByb. df.na.dropc. df.withColumnRenamed. df.selectExpr  Q10. You have been given below patient data in csv format, patientID,name,dateOfBirth,lastVisitDate1001,Ah Teck,1991-12-31,2012-01-201002,Kumar,2011-10-29,2012-09-201003,Ali,2011-01-30,2012-10-21 Accomplish the following activities using dataframe api or dataset api. 1. Find all the patients whose lastVisitDate between current time and ‘2012-09-15’2. Find all the patients who born in 20113. Find all the patients age4. List patients whose last visited more than 60 days ago5. Select patients 18 years old or younger  Q11. You have been given below two files course.txtid,course1,Hadoop 2,Spark3,HBasefee.txtid,fee2,39003,42004,2900Accomplish the following activities using dataframe api or dataset api.1. Select all the courses and their fees , whether fee is listed or not.2. Select all the available fees and respective course. If course does not exists still list the fee3. Select all the courses and their fees , whether fee is listed or not. However, ignore records having fee as null.   Computer Science Engineering & Technology Object-Oriented Programming PROGRAMMER PRFE Share QuestionEmailCopy link Comments (0)