Step 1 :
Download latest PDI version from http://pentaho.com/download.
Choose 32 Bit or 64 Bit based on OS requirements.
Pentaho Kettle Data Integration Installation for Windows, Pentaho Data Integration ( PDI ) - Generic Design Guidelines
Tuesday, September 24, 2013
Friday, September 20, 2013
Pentaho Data Integration ( PDI ) - Generic Design Guidelines
Design for Failure Handling
Recommended to ensure that the data source is available before a process is kicked off. One basic design principle is that the ETL job needs to be able to fail gracefully when a data availability test fails.
Kettle contains following features to do this.
Recommended to ensure that the data source is available before a process is kicked off. One basic design principle is that the ETL job needs to be able to fail gracefully when a data availability test fails.
Kettle contains following features to do this.
- Test a repository connection.
- Ping a host to check whether it's available.
- Wait for a SQL command to return success/failure based on a row count condition.
- Check for empty folders.
- Check for the existence of a file, table, or column.
- Compare files or folders.
- Set a timeout on FTP and SSH connections.
- Create failure/success outputs on every available job step.
Pentaho Data Integration ( PDI ) - Overview
Pentaho Data Integration is a flexible tool that allows collecting data from disparate sources such as databases, files, and applications, and turning the data into a unified format that is accessible and relevant to end users. Pentaho Data Integration provides the Extraction, Transformation, and Loading (ETL) engine that facilitates the process of capturing the right data, cleansing the data, and storing the data using a uniform and consistent format.
Common Uses of Pentaho Data Integration Include:
Common Uses of Pentaho Data Integration Include:
Subscribe to:
Posts (Atom)
-
This step can be used to perform various types of aggregations such as sum, average, min, max e.t.c. Input data always need to be sorted for...
-
The Get Filenames step allows you to retrieve information associated with filenames in the file system. The obtained file name is added to ...
-
Error Message Error connecting to database [ORA_TEST_JDBC] : org.pentaho.di.core.exception.KettleDatabaseException: Error occured while tr...