AWS Data Pipeline

(2)
1.8 out of 5 stars

AWS Data Pipeline is a web service that helps you process and move data between different AWS compute and storage services, as well as on-premises data sources, at specified intervals.

Work for AWS Data Pipeline?

Learning about AWS Data Pipeline?

We can help you find the solution that fits you best.

Find the Right Product

AWS Data Pipeline Reviews

Write a Review
Filter Reviews
Filter Reviews
Ratings
Company Size
User Role
User Industry
Showing 2 AWS Data Pipeline reviews
LinkedIn Connections
AWS Data Pipeline review by <span>Sheehan A.</span>
Sheehan A.
Validated Review
Review Source

Powerful, but needs better GUI Tools

Reviewed On
Validated Review
What do you like best?

AWS Data Pipeline makes it very easy to get started and move data between various sources. If you're already using AWS services such as S3 or Redshift, Data Pipeline heavily reduces the lines of code / applications required to move data between AWS data sources.

You can simply run data pipeline on a schedule, specify where you want to move data between, and tell it to run processing in a series of steps if you need to filter your data. It is a good and simple way to do basic data movement.

Pipeline retries, gives you errors, and has a dashboard that shows you all of the jobs that are in queue.

What do you dislike?

Data Pipeline can be a black box at times. Error messages are not good, and it is difficult to understand what exactly failed since it is an amazon service. The scheduler doesn't give timely notifications at times, so it is hard to determine the true state of the data pipeline

Recommendations to others considering the product

If you are a small company and don't want to spend resources building your own pipeline, use AWS. It is out of the box, gets the job done, and you can learn to work around some of its deficiencies. It provides a great way to schedule the movement of data.

What business problems are you solving with the product? What benefits have you realized?

We have various ETL jobs that run on data pipeline. We move data between various sources into a data warehouse for final reporting. Data pipeline is used to move all data into a single data lake, and then run pre-processing steps before loading into a warehouse.

0 of 0 found this helpful.
Helpful?
Sign in to G2 Crowd to see what your connections have to say about AWS Data Pipeline
Headshots
AWS Data Pipeline review by User in Internet
User in Internet
Review Source
Validated Review
What do you like best?

The only thing I can appreciate is once it is setup then it is fine.

What do you dislike?

It really should be redesigned as a json blob is a terrible way to organize SQL jobs. Why can't things be scheduled like a calendar or meeting request?

What business problems are you solving with the product? What benefits have you realized?

We use it for our ETL process.

0 of 0 found this helpful.
Helpful?

What Relational Databases solution do you use?

Thanks for letting us know!

There are not enough reviews of AWS Data Pipeline for G2 Crowd to provide buying insight. Below are some alternatives with more reviews:

1
Microsoft SQL Logo
Microsoft SQL
(703)
Microsoft SQL enables the user to build mission-critical applications using high-performance, in-memory security technology across OLTP, data warehousing, business intelligence and analytics.
2
MySQL Logo
MySQL
(456)
MySQL is an open source database solution.
3
Oracle Database 12c Logo
Oracle Database 12c
(263)
Helps customers lower IT costs and deliver a higher quality of service by enabling consolidation onto database clouds.
4
PostgreSQL Logo
PostgreSQL
(194)
PostgreSQL is a powerful, open source object-relational database system.
5
SAP HANA Logo
SAP HANA
(116)
SAP HANA converges database and application platform capabilities in-memory to transform transactions, analytics, text analysis, predictive and spatial processing so businesses can operate in real-time.
6
SQLite Logo
SQLite
(98)
SQLite is a software library that implements a self-contained, serverless, zero-configuration, transactional SQL database engine
7
DB2 Logo
DB2
(87)
IBM® DB2® is the database that offers enterprise-wide solutions handling high-volume workloads. It is optimized to deliver industry-leading performance while lowering costs.
8
Teradata Database Logo
Teradata Database
(61)
The Teradata Database easily and efficiently handles complex data requirements and simplifies management of the data warehouse environment.
9
MariaDB Logo
MariaDB
(52)
MariaDB is a high performance, open source database that helps the world's busiest websites deliver more content, faster.
10
Informix Enterprise Edition Logo
Informix Enterprise Edition
(25)
IBM® Informix® Enterprise Edition enables you to store, access and query all your data with first-rate efficiency and agility. Real-time analytics, always-on transactions and seamless data integration get you to answers faster and speed time to market. And with the exception of Informix Warehouse Accelerator, this database includes all Informix features on all supported platforms to provide you easier app development and deployment with unlimited scalability. Full grid and replication capabilities ensure always-on data access, and optional storage compression helps maximize resources.
Show more
Kate avatar
Kate from G2 Crowd

Learning about AWS Data Pipeline?

I can help.
* We monitor all AWS Data Pipeline reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. Validated reviews require the user to submit a screenshot of the product containing their user ID, in order to verify a user is an actual user of the product.