Hive

(29)
4.2 out of 5 stars

Hive provides a mechanism to project structure onto this data and query the data using a SQL-like language called HiveQL. At the same time this language also allows traditional map/reduce programmers to plug in their custom mappers and reducers when it is inconvenient or inefficient to express this logic in HiveQL.

Work for Hive?

Learning about Hive?

We can help you find the solution that fits you best.

Find the Right Product

Hive Reviews

Write a Review
Filter Reviews
Filter Reviews
  • Ratings
  • Company Size
  • User Role
  • User Industry
Ratings
Company Size
User Role
User Industry
Showing 32 Hive reviews
LinkedIn Connections
Hive review by User in Internet
User in Internet
Validated Reviewer
Verified Current User
Invitation from G2 Crowd
Reviewed On

Hive is useful but can be quirky

What do you like best?

The best thing about HIVE is that anyone that is familiar with SQL can take advantage of HIVE's ability to run map reduce jobs. Newer version of HIVE is getting better at supporting windowing functions and fleshing out any inconsistencies. So far the documentation is good enough for getting me through my tasks and there is still on-going support for this product, which is a pretty good sign to me.

What do you dislike?

Older versions of HIVE sucks. There are lots of limitations that will force you to write HiveQL queries that are not straight forward and, even potentially, inefficient. For example, no support for window functions and no equality comparisons on joins can make your life very difficult so you will need to fall back to using some whacky full joins or self joins to accomplish the same task.

What business problems are you solving with the product? What benefits have you realized?

We are using HIVE as a data warehouse. One of the benefits of HIVE is that it can break your SQL queries into a series of map reduce jobs, so its supposed to speed up your queries if given enough compute nodes.

Sign in to G2 Crowd to see what your connections have to say about Hive
Headshots
Hive review by Consultant in Information Technology and Services
Consultant in Information Technology and Services
Validated Reviewer
Verified Current User
Invitation from G2 Crowd
Reviewed On

Hive review for its use in big data applications

What do you like best?

Hive is great for handling logs in big data projects. We are using the same in our project and it is great for using joins and grouping which is very difficult and tricky in map reduce. It has a lot of udf packages and it is very easy to add new udfs. We were also using bucketing and clustering to optimize the query. Concept of external tables and the way we can manipulate data even when table is deleted from hive is really amazing. Lot of connectors available in the market for different softwares.

What do you dislike?

The thing which I dislike is latency and the way it saves data. While inserting data I have to wait a lot of few records. Compiler execution plan is very immature as it does not do proper query optimization. Though the community is working fast for overcoming quickly but I think it will take time for hive to be

Recommendations to others considering the product

Yes, I would highly recommend this product as this helps to solve a lot of problems mainly for logs. It has lots of connectors and so compatibility issues are not there. You can use it with hbase , tableau etc.

So, its worth using.

What business problems are you solving with the product? What benefits have you realized?

We are using hive mainly for saving our logs. it helps us to keep track of what records are inserted, which records have failed and what are relationship between them. we are using tableau for analyzing data .

What Other Non-Relational Databases solution do you use?

Thanks for letting us know!
Hive review by <span>Dhharvi S.</span>
Dhharvi S.
Validated Reviewer
Invitation from G2 Crowd
Reviewed On

Big Data with SQL

What do you like best?

Hive provides an ease to the user who wants to store bulk data, in a tabular manner.

It works on the same queries like SQL, making it easy for using the traditional database system.

Because of this reason, people need not have to study some new language and can still adapt to the Big Data Culture.

Also it has features like partition, and bucketing, helping in segregation of data.

Data can directly be loaded into hive, by HDFS, using the CSV files of the same format, or from Hbase by making a pointer to the Hbase table, providing a link within Hadoop.

What do you dislike?

For small amount of data also, it runs map reduce job, which consumes some time, and thus is not efficient for the same.

We do not have a concept of primary key in Hive, so we can have redundant entries.

Also till the older version, update and delete were not possible, and now also in the new version, if we want to use the update and delete commands, the performance of the tool gets degraded.

Recommendations to others considering the product

For storing bulk amount of data in a tabular manner, and where there's no need need of primary key, or just in case, if redundant data is received, it will not cause a problem.

What business problems are you solving with the product? What benefits have you realized?

We are using Hive for storing logs, of data, being generated, in our business.

Further we will be using these logs for reconciliation purpose, helping in keeping a track of data.

Hive review by User in Higher Education
User in Higher Education
Validated Reviewer
Invitation from G2 Crowd
Reviewed On

HIve Review - Data Science Perspective

What do you like best?

Hive is the best out there for answering ad-hoc queries in parallel paradigm. It works very well with Hadoop Echo system (mainly integrates perfectly with HDFS).

- Easy to use as it implements most of SQL functions.

What do you dislike?

- Needs more optimization for complex queries (like caching, auto-partitioning,etc ...) to speed up the latency of the queries.

- Tuning the hive parameters is really challenging for the users. The default settings don't work with the large queries.

- Hive is perfect if 90-95% of the queries are read-only. It is not suitable for applications with heavily updates

What business problems are you solving with the product? What benefits have you realized?

Get quick insights from big data in case of the customers' data don't fit on one machine. It helps a lot for data preparation (i.e. creating temporary tables), that can be consumed by other machine learning solutions like Spark to build machine learning models that add more business values.

Hive review by <span>Pavlo S.</span>
Pavlo S.
Validated Reviewer
Verified Current User
Invitation from G2 Crowd
Reviewed On

Hive as the base for BI tools to get data from Hadoop

What do you like best?

The Hive is intended to simplify your experience with Hadoop and allows developers and business analyst apply their SQL knowledge to query data, build reports, build etl etc.

What do you dislike?

As the open source software it has common issues with support. Also Hive doesn't support many features that traditional SQL has.

Recommendations to others considering the product

The Hive and Hadoop is not a database in classical understanding, and the purpouse is to proceed the big volumes of data. But in case you'll try to query some small table you'll notice that it can take x1000 more time to get resulting the data.

Hive doesn't work with single record and it should not be considered as persisnent arrea for biling like systems.

What business problems are you solving with the product? What benefits have you realized?

The main purpouse of using Hive is to building reports and do analysis of data that is stored in the Hadoop file system. As for now it is the only one framework that can be used by all most popular BI tools to read the data from the HDFS.

Hive review by <span>Pradeepkumar K.</span>
Pradeepkumar K.
Validated Reviewer
Verified Current User
Invitation from G2 Crowd
Reviewed On

For all the batch operations!

What do you like best?

The syntax of hive! Its almost SQL so its easy to use. External tables, partitions, buckets, UDFs all the features I like to use with hive. ORC data format occupying lesser space and retrieving the data much faster.

Learning curve looks easier as it is similar to SQL but hold on! you must learn all the features of hive before writing a big hql to join multiple hundreds GBs tables and fetch results. Otherwise if you write it like a regular SQL it may take hours to process. So hive is always at its best when you set the optimization parameters before you run your scripts. Also its complex datatypes make hive more useful than other RDBMS.

What do you dislike?

Hive is comparatively slower than its competitors. Its easy to use but that comes with the cost of processing, If you are using it just for batch processing then hive is well and fine.

Recommendations to others considering the product

If you are looking for some easy to use big data product to run queries and generate reports on batch mode then Hive is the tool!

What business problems are you solving with the product? What benefits have you realized?

Generating datasets from huge files for reporting purposes.

Kate avatar
Kate from G2 Crowd

Learning about Hive?

I can help.
* We monitor all Hive reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. Validated reviews require the user to submit a screenshot of the product containing their user ID, in order to verify a user is an actual user of the product.