I tried to setup connection for Hive (HDP) using Teradata Studio, while setting up connection profile,TD Studio asks for JDBC and WebHCat information and forces the connection info for both to be setup.
I am curious to know the purpose for both the connection types? If I can connect using JDBC then why does WebHCat information is required? and vice versa. Would it be good if user has choice to select any one? or disable one of the mode of connectivity?
Currently, the DSE and Navigator are populated by WebHCat and JDBC is used for running queries. Getting rid of WebHCat dependencies if JDBC credentials are provided is on our roadmap for this year.
Studio uses webHcat to gather the catalog information on the Hadoop database and table objects for the Data Source Explorer. We have a JIRA opened to use HiveQL to gather that information. We refer to the HiveQL connection as JDBC.