Apache Hama is a great application for parallel computing. The main feature that they have is for bulk data, that means even though the processing can be pretty slow for millions of data entries Apache Hama will do it efficiently. They support Bulk synchronous programming, vertex-centric programming, and neuron-centric programming. There are manuals available for these programming models which will help you get started.
What do you dislike?
Apache Hama should support more programming models like POSIX. They should have a better interface for the models that they readily support. The manuals available should be better and clearer. The manuals they provide are not easy to implement and understand. Even an expert would not be able to do it. All in all you might find some bugs all over but they are minor. They should be quick to fix them as soon as possible for users to use their product.
Recommendations to others considering the product
If you want a lot of parallel processing on large amounts of data then it is a good idea to produce bulk synchronous programming models and use Apache Hama for the computations, it will be pretty efficient to use this software tool for that data related problems.
What business problems are you solving with the product? What benefits have you realized?
We have a big data center with servers collecting large amounts of data and we need to extensively work on this data to provide intel to the business division. Apache Hama helps us to analyse and optimize the data with parallel computing.
* We monitor all Apache Hama reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. Validated reviews require the user to submit a screenshot of the product containing their user ID, in order to verify a user is an actual user of the product.