bidezones.com | Hadoop Multiple Choice Question | bidezones.com | Apache Hadoop - Online Exam Test Papers | Apache Hadoop - MCQs[multiple choice questions and answers ] | Apache Hadoop - Mock Test Papers | Apache Hadoop - Practice Papers

Question:

 Distributed Cache can be used in

1.Mapper phase only

2. Reducer phase only

3.In either phase, but not on both sides simultaneously

4.In either phase

Posted Date:-2022-05-12 08:58:03

Question:

 Managed tables in Hive:

1. Can load the data only from HDFS

2.Can load the data only from local file system

3.Are useful for enterprise wide data

4. Are Managed by Hive for their data and metadata

Posted Date:-2022-05-12 10:15:33

Question:

 Maximum size allowed for small dataset in replicated join is:

1.10KB

2.10 MB

3.100 MB

4.500 MB

Posted Date:-2022-05-12 09:52:39

Question:

 Read the statement and select the correct options: 

distcp command ALWAYS needs fully qualified hdfs paths.

1.True

2.False

3.True, if source and destination are in the same cluster

4.False, if source and destination are in the same cluster

Posted Date:-2022-05-12 08:42:13

Question:

 Reduce side join is useful for

1.Very large datasets

2.Very small data sets

3.One small and other big data sets

4.One big and other small datasets

Posted Date:-2022-05-12 08:57:06

Question:

 The schema of a relation can be examined through:

1.ILLUSTRATE

2.DESCRIBE

3.DUMP

4.EXPLAIN

Posted Date:-2022-05-12 09:54:59

Question:

 What does â€œVelocityâ€ in Big Data mean?

1.Speed of input data generation

2.Speed of individual machine processors

3.Speed of ONLY storing data

4.Speed of storing and processing data

Posted Date:-2022-05-11 04:02:15

Question:

 What does commodity Hardware in Hadoop world mean?

1.Very cheap hardware

2.Industry standard hardware

3.Discarded hardware

4.Low specifications Industry grade hardware

Posted Date:-2022-05-11 04:00:09

Question:

 What is the optimal size of a file for distributed cache?

1.<=10 MB

2.>=250 MB

3.<=100 MB

4.<=35 MB

Posted Date:-2022-05-12 09:04:21

Question:

 What should be an upper limit for counters of a Map Reduce job?

1. ~5s

2. ~15

3.~150

4.~50

Posted Date:-2022-05-12 09:44:32

Question:

 Which of the following are NOT big data problem(s)?

1.Parsing 5 MB XML file every 5 minutes

2.Processing IPL tweet sentiments

3.Processing online bank transactions

4.both (a) and (c)

Posted Date:-2022-05-11 04:01:11

Question:

 Which of the following are not possible in Hive?

1.Creating Synonym

2.Writing Update Statements

3.Creating Indexes

4.Both (a) and (b)

Posted Date:-2022-05-12 10:01:17

Question:

 Which of the following class is responsible for converting inputs to key-value Pairs of Map Reduce

1.FileInputFormat

2.InputSplit

3.RecordReader

4.Mapper

Posted Date:-2022-05-12 09:45:33

Question:

 Which of the following is NOT the component of Flume? (B)

1.Sink

2.Database

3.Source

4.Channel

Posted Date:-2022-05-12 08:44:45

Question:

 Which of the following is/are true about combiners?

1.Combiners can be used for mapper only job

2. Combiners can be used for any Map Reduce operation

3.Mappers can be used as a combiner class

4.Combiners are primarily aimed to improve Map Reduce performance

Posted Date:-2022-05-12 08:56:05

Question:

 Which of the following type of joins can be performed in Reduce side join operation?

1.Equi Join

2.Left Outer Join

3.Full Outer Join

4.All of the above

Posted Date:-2022-05-12 09:43:31

Question:

 Who will initiate the mapper?

1.Task tracker

2.Job tracker

3.Combiner

4.Reducer

Posted Date:-2022-05-12 10:02:12

Question:

A Map reduce job can be written in:

1.Java

2.Ruby

3.Python

4.Any Language which can read from input stream

Posted Date:-2022-05-12 09:47:23

Question:

Data can be supplied to PigUnit tests from:

1.HDFS Location

2.Within Program

3.Both (a) and (b)

4.None of the above

Posted Date:-2022-05-12 09:56:13

Question:

From the options listed below, select the suitable data sources for the flume.

1.Publicly open web sites

2.Local data folders

3.Remote web servers

4.Both (a) and (c)

Posted Date:-2022-05-12 08:40:51

Question:

Hadoop is a framework that allows the distributed processing of:

1. Small Data Sets

2. Semi-Large Data Sets

3.Large Data Sets

4.Large and Small Data sets

Posted Date:-2022-05-12 10:04:54

Question:

Hadoop is open source.

1.ALWAYS True

2.True only for Apache Hadoop

3.True only for Apache and Cloudera Hadoop

4.ALWAYS False

Posted Date:-2022-05-11 04:24:58

Question:

Hive can be used for real time queries.

1.True

2.False

3.True if a data set is small

4.True for some distributions

Posted Date:-2022-05-12 08:17:15

Question:

Hive queries response time is in order of

1.Hours at least

2.Minutes at least

3.Seconds at least

4.Milliseconds at least

Posted Date:-2022-05-12 10:14:27

Question:

Name Node monitors block replication process

1.TRUE

2.FALSE

3.Depends on file type

4.All of the above

Posted Date:-2022-05-12 10:08:20

Question:

NameNode tries to keep the first copy of data nearest to the client machine.

1.ALWAYS true

2.ALWAYS False

3.True if the client machine is the part of the cluster

4.True if the client machine is not the part of the cluster

Posted Date:-2022-05-12 08:27:04

Question:

Number of mappers is decided by the

1.Mappers specified by the programmer

2.Available Mapper slots

3.Available heap memory

4.Input Splits

Posted Date:-2022-05-12 09:05:59

Question:

Parameters could be passed to Pig scripts from:

1.Parent Pig Scripts

2.Shell Script

3.Configuration File

4.All the above except (a)

Posted Date:-2022-05-12 09:53:47

Question:

Pig is a:

1.Programming Language

2.Data Flow Language

3.Query Language

4.Database

Posted Date:-2022-05-12 09:48:20

Question:

Pig is good for:

1.Data Factory operations

2.Creating multiple datasets from a single large dataset

3.Implementing complex SQLs

4.Both (A) and (B)

Posted Date:-2022-05-12 09:50:10

Question:

Read the statement:

NameNodes are usually high storage machines in the clusters.

1.True

2.False

3.Depends on cluster size

4.True if co-located with Job tracker

Posted Date:-2022-05-12 08:35:58

Question:

Sliding window operations typically fall in the category of__________________.

1.OLTP Transactions

2.Big Data Batch Processing

3.Big Data Real Time Processing

4.Small Batch Processing

Posted Date:-2022-05-11 04:16:27

Question:

The mechanism used to create replica in HDFS is____________.

1.Gossip protocol

2.Replicate protocol

3.HDFS protocol

4.Store and Forward protocol

Posted Date:-2022-05-12 08:22:28

Question:

The term Big Data first originated from:

1.Stock Markets Domain

2.Banking and Finance Domain

3.Genomics and Astronomy Domain

4.Social Media Domain

Posted Date:-2022-05-11 04:03:16

Question:

What decides number of Mappers for a MapReduce job?

1.File Location

2.mapred.map.tasks parameter

3.Input file size

4.Input Splits

Posted Date:-2022-05-12 10:07:05

Question:

What is HBase used as?

1.Tool for Random and Fast Read/Write operations in Hadoop

2.Faster Read only query engine in Hadoop

3.MapReduce alternative in Hadoop

4.Fast MapReduce layer in Hadoop

Posted Date:-2022-05-11 04:17:35

Question:

What is Hive used as?

1. Hadoop query engine

2. MapReduce wrapper

3.Hadoop SQL interface

4.All of the above

Posted Date:-2022-05-11 04:18:36

Question:

What is the default HDFS block size?

1.32 MB

2.64 KB

3.128 KB

4.64 MB

Posted Date:-2022-05-12 08:18:39

Question:

What is the default HDFS replication factor?

1.4

2.1

3.3

4.2

Posted Date:-2022-05-12 08:19:16

Question:

Where is the HDFS replication factor controlled?

1.mapred-site.xml

2.yarn-site.xml

3.core-site.xml

4.hdfs-site.xml

Posted Date:-2022-05-12 08:28:17

Question:

Which of following is the return data type of Filter UDF?

1.String

2.Integer

3.Boolean

4.None of the above

Posted Date:-2022-05-12 09:59:25

Question:

Which of following statement(s) are correct?

1.Master and slaves files are optional in Hadoop 2.x

2.Master file has list of all name nodes

3.Core-site has hdfs and MapReduce related common properties

4.hdfs-site file is now deprecated in Hadoop 2.x

Posted Date:-2022-05-12 10:11:00

Question:

Which of following statement(s) are true about distcp command? (A)

1.It invokes MapReduce in background

2.It invokes MapReduce if source and destination are in the same cluster

3.It canâ€™t copy data from the local folder to hdfs folder d)

4.You canâ€™t overwrite the files through distcp command

Posted Date:-2022-05-12 08:43:50

Question:

Which of the following are example(s) of Real Time Big Data Processing?

1.Complex Event Processing (CEP) platforms

2.Stock market data analysis

3.Bank fraud transactions detection

4. both (a) and (c)

Posted Date:-2022-05-11 04:15:17

Question:

Which of the following are NOT metadata items?

1.HDFS block locations

2.Replication factor of files

3.Access Rights

4.File Records distribution

Posted Date:-2022-05-12 10:06:09

Question:

Which of the following are NOT true for Hadoop?

1.Itâ€™s a tool for Big Data analysis

2.It supports structured and unstructured data analysis

3.It aims for vertical scaling out/in scenarios

4.Both (a) and (c)

Posted Date:-2022-05-11 04:19:46

Question:

Which of the following are the Big Data Solutions Candidates?

1.Processing 30 minutes Flight sensor data

2.Interconnecting 50K data points (approx. 1 MB input file)

3.Processing User clicks on a website

4.All of the above

Posted Date:-2022-05-12 10:03:44

Question:

Which of the following are the core components of Hadoop?

1.HDFS

2.Map Reduce

3.HBase

4.Both (a) and (b)

Posted Date:-2022-05-11 04:23:23

Question:

Which of the following are true for Hadoop Pseudo Distributed Mode?

1.It runs on multiple machines

2.Runs on multiple machines without any daemons

3.Runs on Single Machine with all daemons

4.Runs on Single Machine without all daemons

Posted Date:-2022-05-12 10:09:41

Question:

Which of the following Batch Processing instance is NOT an example of 
Big Data Batch Processing?

1.Processing 10 GB sales data every 6 hours

2.Processing flights sensor data

3.Web crawling app

4.Trending topic analysis of tweets for last 15 minutes

Posted Date:-2022-05-11 04:14:18

Question:

Which of the following can be used to control the number of part files  in a map reduce program output directory?

1. Number of Mappers

2.Number of Reducers

3.Counter

4.Partitioner

Posted Date:-2022-05-12 08:53:42

Question:

Which of the following constructs are valid Pig Control Structures?

1.f-else

2.For Loop

3.Until Loop

4.None of the above

Posted Date:-2022-05-12 09:57:58

Question:

Which of the following Hadoop config files is used to define the heap size?

1.hdfs-site.xml

2.core-site.xml

3.hadoop-env.sh

4.Slaves

Posted Date:-2022-05-12 08:30:01

Question:

Which of the following is NOT a type of metadata in NameNode?

1.List of files

2.Block locations of files

3.No. of file records

4.File access control information

Posted Date:-2022-05-12 08:20:09

Question:

Which of the following is not a valid Hadoop config file?

1.mapred-site.xml

2.hadoop-site.xml

3.core-site.xml

4.Masters

Posted Date:-2022-05-12 08:30:56

Question:

Which of the following is the correct representation to access â€˜â€™Skillâ€ from the Bag {â€˜Skillsâ€™,55, (â€˜Skillâ€™, â€˜Speedâ€™), {2, (â€˜Sanâ€™, â€˜Mateoâ€™)}}

1.$3.$1

2.$3.$0

3.$2.$0

4.$2.$1

Posted Date:-2022-05-12 09:51:26

Question:

Which of the following is the correct sequence of MapReduce flow?

1.Combine ??Reduce ??Map

2.Map ??Combine ??Reduce

3.Reduce ??Combine ??Map

4.None of These

Posted Date:-2022-05-12 08:46:22

Question:

Which of the following is the highest level of Data Model in Hive?

1.Table

2.View

3.Database

4.Partitions

Posted Date:-2022-05-12 10:12:57

Question:

Which of the following is true for Hive?

1.Hive is the database of Hadoop

2.Hive supports schema checking

3.Hive doesnâ€™t allow row level updates

4.Hive can replace an OLTP system

Posted Date:-2022-05-12 10:11:56

Question:

Which of the following is/are correct?

1.NameNode is the SPOF in Hadoop 1.x

2.NameNode is the SPOF in Hadoop 2.x

3.NameNode keeps the image of the file system also

4.Both (a) and (c)

Posted Date:-2022-05-12 08:21:08

Question:

Which of the following operations canâ€™t use Reducer as combiner also?

1.Group by Minimum

2.Group by Maximum

3.Group by Count

4.Group by Average

Posted Date:-2022-05-12 08:54:54

Question:

Which of the following writable can be used to know the value from a mapper/reducer?

1.Text

2.IntWritable

3.Nullwritable

4.String

Posted Date:-2022-05-12 09:46:21

Posted on by Our Team

bidezones.com /Apache Hadoop/Hadoop Multiple Choice Question Sample Test,Sample questions

Question: Distributed Cache can be used in

Question: Managed tables in Hive:

Question: Maximum size allowed for small dataset in replicated join is:

Question: Read the statement and select the correct options: distcp command ALWAYS needs fully qualified hdfs paths.

Question: Reduce side join is useful for

Question: The schema of a relation can be examined through:

Question: What does â€œVelocityâ€ in Big Data mean?

Question: What does commodity Hardware in Hadoop world mean?

Question: What is the optimal size of a file for distributed cache?

Question: What should be an upper limit for counters of a Map Reduce job?

Question: Which of the following are NOT big data problem(s)?

Question: Which of the following are not possible in Hive?

Question: Which of the following class is responsible for converting inputs to key-value Pairs of Map Reduce

Question: Which of the following is NOT the component of Flume? (B)

Question: Which of the following is/are true about combiners?

Question: Which of the following type of joins can be performed in Reduce side join operation?

Question: Who will initiate the mapper?

Question: A Map reduce job can be written in:

Question: Data can be supplied to PigUnit tests from:

Question: From the options listed below, select the suitable data sources for the flume.

Question: Hadoop is a framework that allows the distributed processing of:

Question: Hadoop is open source.

Question: Hive can be used for real time queries.

Question: Hive queries response time is in order of

Question: Name Node monitors block replication process

Question: NameNode tries to keep the first copy of data nearest to the client machine.

Question: Number of mappers is decided by the

Question: Parameters could be passed to Pig scripts from:

Question: Pig is a:

Question: Pig is good for:

Question: Read the statement: NameNodes are usually high storage machines in the clusters.

Question: Sliding window operations typically fall in the category of__________________.

Question: The mechanism used to create replica in HDFS is____________.

Question: The term Big Data first originated from:

Question: What decides number of Mappers for a MapReduce job?

Question: What is HBase used as?

Question: What is Hive used as?

Question: What is the default HDFS block size?

Question: What is the default HDFS replication factor?

Question: Where is the HDFS replication factor controlled?

Question: Which of following is the return data type of Filter UDF?

Question: Which of following statement(s) are correct?

Question: Which of following statement(s) are true about distcp command? (A)

Question: Which of the following are example(s) of Real Time Big Data Processing?

Question: Which of the following are NOT metadata items?

Question: Which of the following are NOT true for Hadoop?

Question: Which of the following are the Big Data Solutions Candidates?

Question: Which of the following are the core components of Hadoop?

Question: Which of the following are true for Hadoop Pseudo Distributed Mode?

Question: Which of the following Batch Processing instance is NOT an example of Big Data Batch Processing?

Question: Which of the following can be used to control the number of part files in a map reduce program output directory?

Question: Which of the following constructs are valid Pig Control Structures?

Question: Which of the following Hadoop config files is used to define the heap size?

Question: Which of the following is NOT a type of metadata in NameNode?

Question: Which of the following is not a valid Hadoop config file?

Question: Which of the following is the correct representation to access â€˜â€™Skillâ€ from the Bag {â€˜Skillsâ€™,55, (â€˜Skillâ€™, â€˜Speedâ€™), {2, (â€˜Sanâ€™, â€˜Mateoâ€™)}}

Question: Which of the following is the correct sequence of MapReduce flow?

Question: Which of the following is the highest level of Data Model in Hive?

Question: Which of the following is true for Hive?

Question: Which of the following is/are correct?

Question: Which of the following operations canâ€™t use Reducer as combiner also?

Question: Which of the following writable can be used to know the value from a mapper/reducer?

More MCQS[bidezones.com ]

Question:
Distributed Cache can be used in

Question:
Managed tables in Hive:

Question:
Maximum size allowed for small dataset in replicated join is:

Question:
Read the statement and select the correct options: distcp command ALWAYS needs fully qualified hdfs paths.

Question:
Reduce side join is useful for

Question:
The schema of a relation can be examined through:

Question:
What does â€œVelocityâ€ in Big Data mean?

Question:
What does commodity Hardware in Hadoop world mean?

Question:
What is the optimal size of a file for distributed cache?

Question:
What should be an upper limit for counters of a Map Reduce job?

Question:
Which of the following are NOT big data problem(s)?

Question:
Which of the following are not possible in Hive?

Question:
Which of the following class is responsible for converting inputs to key-value Pairs of Map Reduce

Question:
Which of the following is NOT the component of Flume? (B)

Question:
Which of the following is/are true about combiners?

Question:
Which of the following type of joins can be performed in Reduce side join operation?

Question:
Who will initiate the mapper?

Question:
A Map reduce job can be written in:

Question:
Data can be supplied to PigUnit tests from:

Question:
From the options listed below, select the suitable data sources for the flume.

Question:
Hadoop is a framework that allows the distributed processing of:

Question:
Hadoop is open source.

Question:
Hive can be used for real time queries.

Question:
Hive queries response time is in order of

Question:
Name Node monitors block replication process

Question:
NameNode tries to keep the first copy of data nearest to the client machine.

Question:
Number of mappers is decided by the

Question:
Parameters could be passed to Pig scripts from:

Question:
Pig is a:

Question:
Pig is good for:

Question:
Read the statement: NameNodes are usually high storage machines in the clusters.

Question:
Sliding window operations typically fall in the category of__________________.

Question:
The mechanism used to create replica in HDFS is____________.

Question:
The term Big Data first originated from:

Question:
What decides number of Mappers for a MapReduce job?

Question:
What is HBase used as?

Question:
What is Hive used as?

Question:
What is the default HDFS block size?

Question:
What is the default HDFS replication factor?

Question:
Where is the HDFS replication factor controlled?

Question:
Which of following is the return data type of Filter UDF?

Question:
Which of following statement(s) are correct?

Question:
Which of following statement(s) are true about distcp command? (A)

Question:
Which of the following are example(s) of Real Time Big Data Processing?

Question:
Which of the following are NOT metadata items?

Question:
Which of the following are NOT true for Hadoop?

Question:
Which of the following are the Big Data Solutions Candidates?

Question:
Which of the following are the core components of Hadoop?

Question:
Which of the following are true for Hadoop Pseudo Distributed Mode?

Question:
Which of the following Batch Processing instance is NOT an example of Big Data Batch Processing?

Question:
Which of the following can be used to control the number of part files in a map reduce program output directory?

Question:
Which of the following constructs are valid Pig Control Structures?

Question:
Which of the following Hadoop config files is used to define the heap size?

Question:
Which of the following is NOT a type of metadata in NameNode?

Question:
Which of the following is not a valid Hadoop config file?

Question:
Which of the following is the correct representation to access â€˜â€™Skillâ€ from the Bag {â€˜Skillsâ€™,55, (â€˜Skillâ€™, â€˜Speedâ€™), {2, (â€˜Sanâ€™, â€˜Mateoâ€™)}}

Question:
Which of the following is the correct sequence of MapReduce flow?

Question:
Which of the following is the highest level of Data Model in Hive?

Question:
Which of the following is true for Hive?

Question:
Which of the following is/are correct?

Question:
Which of the following operations canâ€™t use Reducer as combiner also?

Question:
Which of the following writable can be used to know the value from a mapper/reducer?