Apache ZooKeeper is a centralized service that facilitates the coordination of distributed applications using a simple hierarchal architecture. In addition to supporting centralized administration of configuration information, ZooKeeper offers a number of features out of the box, including naming, synchronization, and group services. ZooKeeper services can be replicated onto server clusters, called ensembles, which allow for a scalable environment with a high-level of availability.
The driver supports Apache ZooKeeper on both 32-bit and 64-bit versions of the following operating systems:
Windows
Linux
AIX
The driver supports retrieving connection information from Apache ZooKeeper services using the methods described in this section.
To retrieve connection information from an Apache ZooKeeper service:
Set the Zookeeper Discovery option to 1 (Enabled).
Set the Zookeeper Namespace option to specify the name of the ZooKeeper name space to which you want to retrieve configuration information. The default is /hiveserver2.
Set the Host Name connection option to specify a list of the member servers for the ZooKeeper ensemble to which you want to connect. The value for this option takes the following form:
One of the primary benefits of ZooKeeper is that it provides centralized management of configuration settings. To take advantage of this feature, the driver transparently retrieves configuration information from the ZooKeeper service at connection. This information is then used to determine the behavior of the driver when establishing the session with the Hive server. However, as a result, settings for certain connection options are overridden by settings provided by the service. The following table describes the type of information typically retrieved by the driver and affected connection options.
Table 5. Configuration information retrieved from Apache ZooKeeper