AWS Data Pipeline

(2)
1.8 out of 5 stars

AWS Data Pipeline is a web service that helps you process and move data between different AWS compute and storage services, as well as on-premises data sources, at specified intervals.

Work for AWS Data Pipeline?
Db2 on cloud 2x

Learning about AWS Data Pipeline?

We can help you find the solution that fits you best.

Find the Right Product

AWS Data Pipeline Reviews

Write a Review
Filter Reviews
Filter Reviews
Ratings
Company Size
User Role
User Industry
Showing 2 AWS Data Pipeline reviews
LinkedIn Connections
AWS Data Pipeline review by <span>Sheehan A.</span>
Sheehan A.
Validated Reviewer
Invitation from G2 Crowd
Reviewed On

Powerful, but needs better GUI Tools

What do you like best?

AWS Data Pipeline makes it very easy to get started and move data between various sources. If you're already using AWS services such as S3 or Redshift, Data Pipeline heavily reduces the lines of code / applications required to move data between AWS data sources.

You can simply run data pipeline on a schedule, specify where you want to move data between, and tell it to run processing in a series of steps if you need to filter your data. It is a good and simple way to do basic data movement.

Pipeline retries, gives you errors, and has a dashboard that shows you all of the jobs that are in queue.

What do you dislike?

Data Pipeline can be a black box at times. Error messages are not good, and it is difficult to understand what exactly failed since it is an amazon service. The scheduler doesn't give timely notifications at times, so it is hard to determine the true state of the data pipeline

Recommendations to others considering the product

If you are a small company and don't want to spend resources building your own pipeline, use AWS. It is out of the box, gets the job done, and you can learn to work around some of its deficiencies. It provides a great way to schedule the movement of data.

What business problems are you solving with the product? What benefits have you realized?

We have various ETL jobs that run on data pipeline. We move data between various sources into a data warehouse for final reporting. Data pipeline is used to move all data into a single data lake, and then run pre-processing steps before loading into a warehouse.

Sign in to G2 Crowd to see what your connections have to say about AWS Data Pipeline
Headshots
AWS Data Pipeline review by User in Internet
User in Internet
Validated Reviewer
Invitation from G2 Crowd
Reviewed On

Json blob is a horrible way to organize jobs

What do you like best?

The only thing I can appreciate is once it is setup then it is fine.

What do you dislike?

It really should be redesigned as a json blob is a terrible way to organize SQL jobs. Why can't things be scheduled like a calendar or meeting request?

What business problems are you solving with the product? What benefits have you realized?

We use it for our ETL process.

What Relational Databases solution do you use?

Thanks for letting us know!

There are not enough reviews of AWS Data Pipeline for G2 Crowd to provide buying insight. Below are some alternatives with more reviews:

1
Microsoft SQL Logo
Microsoft SQL
(944)
SQL Server 2017 brings the power of SQL Server to Windows, Linux and Docker containers for the first time ever, enabling developers to build intelligent applications using their preferred language and environment. Experience industry-leading performance, rest assured with innovative security features, transform your business with AI built-in, and deliver insights wherever your users are with mobile BI.
2
MySQL Logo
MySQL
(518)
MySQL is an open source database solution.
3
Oracle Database 12c Logo
Oracle Database 12c
(286)
Helps customers lower IT costs and deliver a higher quality of service by enabling consolidation onto database clouds.
4
PostgreSQL Logo
PostgreSQL
(209)
PostgreSQL is a powerful, open source object-relational database system.
5
SAP HANA Logo
SAP HANA
(124)
SAP HANA converges database and application platform capabilities in-memory to transform transactions, analytics, text analysis, predictive and spatial processing so businesses can operate in real-time.
6
Db2 Logo
Db2
(107)
IBM® Db2® is the database that offers enterprise-wide solutions handling high-volume workloads. It is optimized to deliver industry-leading performance while lowering costs.
7
SQLite Logo
SQLite
(105)
SQLite is a software library that implements a self-contained, serverless, zero-configuration, transactional SQL database engine
8
Teradata Database Logo
Teradata Database
(63)
The Teradata Database easily and efficiently handles complex data requirements and simplifies management of the data warehouse environment.
9
MariaDB Logo
MariaDB
(53)
MariaDB is a high performance, open source database that helps the world's busiest websites deliver more content, faster.
10
SAP HANA, express edition Logo
SAP HANA, express edition
(26)
SAP HANA, express edition is a streamlined version of SAP HANA that can run on laptops and other resource-constrained hosts, such as a cloud-hosted virtual machine. SAP HANA, express edition is free to use for in-memory databases up to 32GB of RAM.
Show more
Kate avatar
Kate from G2 Crowd

Learning about AWS Data Pipeline?

I can help.
* We monitor all AWS Data Pipeline reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. Validated reviews require the user to submit a screenshot of the product containing their user ID, in order to verify a user is an actual user of the product.