Data Warehouse Guide
You are here: Home Datastage vs Informatica

Datastage vs Informatica

E-mail Print PDF

The differences mentioned below are generic ones. At some places the corresponding version of the software is mentioned, which means it might be applicable to only that version. However, please be advised that with release of new version of the software some of the features and behaviours go for a change.

Integration with the repository

Datastage is closely integrated with its repository (universe database). Informatica is not. With the introduction of repository server, they have isolated the server calls with repository calls to reduce the load. This has both advantages and disadvantages.

Transformation Engine

Datastage is more powerful transformation engine by using functions (Oconv and IConv) and routines. We can do almost any transformation. Informatica is more visual, programmer friendly.

Lookups

Lookups are much faster in Datastage, because the way the hash files are built. You can tune the hash files to get an optimal performance.

Job performance

Datastage best practices calls for landing the data in between transformations and have smaller simpler job. The job when compiled generates a Basic routine and bigger the routine is, slower the job performs. For a simple project, you might end up having 3-4 times more jobs in Datastage than in Informatica.

Handling heterogeneous sources

Datastage does not perform very well with heterogeneous sources. You might end up extracting data from all the sources and putting them intoa hash and start your transformation. This may not be the case with Informatica.
Deployment facility: Ability to handle initial deployment, major & minor releases and patches with ease.

  • Informatica: Yes.
  • DataStage: No

Support for looping the source row (For While Loop).

  • Informatica: Supports for comparing immediate previous record
  • DataStage; Does not support.

Slowly Changing Dimension.

  • Informatica: Supports Full History, Recent Values, Current & Previous Values.
  • DataStage: Supports only through Custom scripts. Does not have a wizard to do this.

Rejected records.

  • Informatica: Cab be captured.
  • DataStage: Cannot be captured (Cab be captured in a separate file).

Debugging Facility.

  • Informatica: Does not Support.
  • DataStage: Supports basic debugging facilities for testing.

Metadata repository can be stored in RDBMS

  • Informatica: Yes.
  • DataStage: No.

Support And Maintenance: Command line operation.

  • Informatica: Yes (pmcmd).
  • DataStage: Yes (dsjob).
 
Interview Questions Data minining blog