Answer : D
Option A is incorrect. Hive is an open-source, data warehouse, and analytic package that runs on top of a Hadoop cluster. Hive scripts use an SQL-like language called Hive QL (query language) that abstracts programming models and supports typical data warehouse interactions. Hive enables you to avoid the complexities of writing Tez jobs based on directed acyclic graphs (DAGs) or MapReduce programs in a lower level computer language, such as Java. Hive extends the SQL paradigm by including serialization formats. You can also customize query processing by creating table schema that matches your data, without touching the data itself. In contrast to SQL (which only supports primitive value types such as dates, numbers, and strings), values in Hive tables are structured elements, such as JSON objects, any user-defined data type, or any function written in Java.
https://docs.aws.amazon.com/emr/latest/ReleaseGuide/emr-hive.html
Option B is incorrect.Apache HBase is an open source, non-relational, distributed database developed as part of the Apache Software Foundation`s Hadoop project. HBase runs on top of Hadoop Distributed File System (HDFS) to provide non-relational database capabilities for the Hadoop ecosystem
https://docs.aws.amazon.com/emr/latest/ReleaseGuide/emr-hbase.html
Option C is incorrect. Apache HCatalog is a tool that allows you to access Hive metastore tables within Pig, Spark SQL, and/or custom MapReduce applications. HCatalog has a REST interface and command line client that allows you to create tables or do other operations.
https://docs.aws.amazon.com/emr/latest/ReleaseGuide/emr-hcatalog.html
Option D is correct. Apache Presto is a fast SQL query engine designed for interactive analytic queries over large datasets from multiple sources
https://docs.aws.amazon.com/emr/latest/ReleaseGuide/emr-presto.html
Note:
Interactive data exploration is performed by apache presto, apache drill. We are not speaking of data science. We are speaking of analyzing data from multiple data sources. if its executing some machine learning, zepplin comes into picture. for interactive data exploration on EMR, generally we refer to Presto.