Apache Hive

Apache Hive, data warehouse infrastructure on Hadoop

Hive is a software that works on Hadoop clusters creating a layer that allows the developer to abstract from the management of HDFS and MapReduce files through SQL-based data query operations, with the HiveQL language.

Editor de consultas SQL de Apache Hive

Hive can perform queries of not too much complexity, it does not allow transactional operations, and by providing a language similar to the SQL of relational databases to work with large amounts of data, this software is very suitable for data warehouse and analytics environments. For these reasons, Apache Hive is defined as a data warehouse infrastructure on top of Hadoop.

Hive was initially developed by Facebook, although it has evolved as an open source project of Apache, within the Hadoop ecosystem, and is currently used by large companies such as Netflix or Amazon in Amazon Elastic MapReduce or AWS.

Apache Hive is installed as a tool within a Hadoop installation and obviously needs Hadoop clusters to be running in order to be able to work on them.

Queries to Hive can be launched either directly from a command line environment or from applications via standard data connectors such as JDBC or ODBC. It should be noted that the abstraction layer provided by Hive, while it can greatly simplify the development of data-driven applications, is not as efficient as the direct use of MapReduce and HDFS file management, as the interpreter increases application latency considerably.

Log in to post comments

Otros productos software del fabricante

Apache Hadoop

Arquitectura de apache Hadoop

The Hadoop software library is a framework that enables distributed processing of large datasets using clusters of computers or servers, using simple programming models.

Hadoop is designed to scale…

Apache Spark

Spark is an open source framework from Apache Software Foundation for distributed processing of large amounts of data on clusters of computers, designed for use in Big Data environments, and created to enhance the capabilities of its…

Prueba Semrush gratis 14 días!

Empresas especializadas

Featured software

Globalgest ERP

GlobalGest ERP

Globalgest ERP is a cloud-based enterprise resource planning software designed for construction, engineering, environmental, photovoltaic and general facilities companies...

LANSA BI

LANSA BI is a business intelligence tool that seamlessly integrates with IBM DB2 databases and is specially designed to provide analytics for IBM i/AS400 applications.
Its native integration with DB2 enables real-time data analysis and business intelligence report…

Semrush

Semrush Semrush is a web tool for SEO and SEM analysis, focused on the search for keywords (Keyword Research) and competitive analysis.
This web tool, pay per use, provides a user-friendly analysis using data giving access to organic positioning and pay per click for the top 20 positioned keywords in the search results (SERP) of local versions of Google and Bing search engines for key countries and to more than 71…

Promoted Resources

Today's Top Picks for Our Readers:

Recommended by