Big Data Analytics
Big Data Analytics
Big Data Analytics
A. Open-source
B. Real-time
C. Java-based
Question 2. What are the challenges faced by optimization of Big Data analysis ?
B. Both data and cost-effective ways to mine data to make business sense out of it
Question 3. Which among the Listed step is not used for deployment of big data solution
A. Data Ingestion
B. Data Processing
C. Data dissemination
D. Data Storage
Question 4. Which among the following is used to provide multiple inputs to Hadoop?
A. MultipleInputs class
B. MultipleInput Format
C. FileInput Format
D. DBInput Format
A. Spreads data
B. Analyse data.
C. Organize data
D. Collect data
Question 6. Which analytics tool lets users create charts and dashboards to share online?
A. Apache Spark
B. Plotly
C. Lumify
D. None
B. Tasks
C. Maps
D. Records
A. Job node
B. Data node
C. Task node
D. Name node
Question 9. Studying the Forms of Big Data, which one of these is not included?
A. Structured
B. Unstructured
C. Processed
D. Semi-Structured
Question 10. Which among the following has the world’s largest Hadoop cluster?
A. Apple
B. Datamatics
C. Facebook
B. Key- Everything up to tab character Value- Remaining part of the line after tab character
Question 12. Identify the framework used for performing remote procedure calls and data
serialization.
A. Drill
B. BigTop
C. Avro
D. Chukwa
Question 13. Which part of the MapReduce is responsible for processing one or more chunks of data
and producing the output results?
A. Maptask
B. Mapper
C. Task execution
B. Shareware
D. Commercial
A. Grunt
B. FS
C. HDFS
Question 16. ______ is interpolated into the quotes to correctly handle spaces within the schema.
A. $SCHEMA
B. $ROW
C.$SCHEMASPACES
D.$NAMESPACES
Question 17. Identify slave/worker node that holds the user data in the form of Data Blocks.
A. Data Block
B. NameNode
C. DataNode
D. Replication
Question 18. Predictive analytics relies on capturing relationships between explanatory variables and
the ___.
A. Predicted variables
B. Descriptive variables
C. Prescriptive variables
D. All of the mentioned above
₹Question 19. Apart from HBaseAdmin which is the other important class in this package that
provide DDL functionalities.
A. HTableDescriptor
B. HDescriptor
C. HTable
D. HTabDescriptor
Question 20. To register a “watch” on a znode data, you need to use the ___ commands to access
the current content or metadata.
A. stat
B. put
C. receive
D. gets
A. master-worker fashion
C. worker/slave fashion
Question 22. Which industries employ the use of so-called "Big Data" in their day to day operations?
A. Weather forecasting
B. Marketing
C. Healthcare
Question 23. Which of the following scenario may not be a good fit for HDFS?
A. HDFS is not suitable for scenarios requiring multiple/simultaneous writes to the same file
B. HDFS is suitable for storing data related to applications requiring low latency data access
C. HDFS is suitable for storing data related to applications requiring low latency data access
Question 24.______ was designed to overcome the limitations of the other Hive file formats.
A. ORC
B. OPC
C. ODC
Question 25._ ____ is general-purpose computing model and runtime system for distributed data
analytics.
A. Mapreduce
B. Drill
C. Oozie
A. Commercial
B. Shareware
Question 28. Fault Tolerance in RDD is achieved using which of the following:
C. Lazy-evaluation
Question 29. Which algorithm is not the solution for multiclass classification problem?
A. Naive Bayes
B. Random Forests
C. Logistic Regression
D. Decision Trees