What do you like best?
The ETL Tool is always two steps ahead when it comes to BigData awareness. Besides, it's fully embeddable, scalable and customizable.
The Platform has a very smart architecture and it's thought to be extensible and to give the ease-of-use that the user needs. It also presents a great amount of tools and frameworks to build whatever the new techologies bring in terms of Data Vualization/Consumption.
What do you dislike?
In the latest releases, Pentaho has been tackling their big weaknesses, which is to bring the end user closer to the data side of things. It still has to be improved, especially if you consider the concurrency, but as you can tell from their public activities, they're working on that as I write.
Recommendations to others considering the product
When trying to get help, follow this order"
- Support portal knowledge Base - if applicable;
- Community Forums - check also your country for local communities;
- jira.pentaho.com - to file or read cases, not to ask for support.
What business problems are you solving with the product? What benefits have you realized?
I've worked on multiple implementations since 2008 and I can give some examples of problems solved such as:
- Determining when water meters start to produce faulty measurements/billing - State level water company;
- Integrating big and critic data from multiple suppliers with different formats but one pipeline;
- Integrating multiple data-mining tools using Pentaho, in order to implement fraud detection on an Insurance Company;
The benefits I can outline are:
- The enormous community that will back you up;
- The reliability of the tool, behind a very responsive support team;
- The fact that when it comes to sustainability, opensource DOES matter, even if you're a big deployer.