Some useful tips of how to debug parallel jobs in datastage.
Enable the following environment variables in DataStage Administrator:
* APT_PM_PLAYER_TIMING – shows how much CPU time each stage uses
* APT_PM_SHOW_PIDS – show process ID of each stage
* APT_RECORD_COUNTS – shows record counts in log
* APT_CONFIG_FILE – switch configuration file (one node, multiple nodes)
* OSH_DUMP – shows OSH code for your job. Shows if any unexpected settings were set by the GUI.
* APT_DUMP_SCORE – shows all processes and inserted operators in your job
* APT_DISABLE_COMBINATION – do not combine multiple stages in to one process. Disabling this will make it easier to see where your errors are occurring.
Use a Copy stage to dump out data to intermediate peek stages or sequential debug files. Copy stages get removed during compile time so they do not increase overhead.
Use row generator stage to generate sample data.
Look at the phantom files for additional error messages: c:\datastage\project_folder\&PH&
To catch partitioning problems, run your job with a single node configuration file and compare the output with your multi-node run. You can just look at the file size, or sort the data for a more detailed comparison (Unix sort + diff commands).
Translate to your Language
Labels
- BI
- Big Data
- BO
- BO Universe Context
- BusinessObjects Context
- BusinessObjects Universe Context
- Cognos
- Command Line DataStage Job Export
- DataStage
- DataStage Command Line Compile
- DataStage Job Compile in command line
- DataStage SCD
- DataStage Server JobCompile Script
- DB
- DW
- DW-SCD
- Optimizing BO Universe and Reports
- Optimizing Business Objects Universe and Reports
- Other
- Q&A
- SAP BO
- SCD
Disclaimer Statement
Total Pageviews
Category
- BI
- Big Data
- BO
- BO Universe Context
- BusinessObjects Context
- BusinessObjects Universe Context
- Cognos
- Command Line DataStage Job Export
- DataStage
- DataStage Command Line Compile
- DataStage Job Compile in command line
- DataStage SCD
- DataStage Server JobCompile Script
- DB
- DW
- DW-SCD
- Optimizing BO Universe and Reports
- Optimizing Business Objects Universe and Reports
- Other
- Q&A
- SAP BO
- SCD
Track
Follow us on FaceBook
About
Powered by Blogger.
Category
- BI
- Big Data
- BO
- BO Universe Context
- BusinessObjects Context
- BusinessObjects Universe Context
- Cognos
- Command Line DataStage Job Export
- DataStage
- DataStage Command Line Compile
- DataStage Job Compile in command line
- DataStage SCD
- DataStage Server JobCompile Script
- DB
- DW
- DW-SCD
- Optimizing BO Universe and Reports
- Optimizing Business Objects Universe and Reports
- Other
- Q&A
- SAP BO
- SCD
Contributors
Blog Archive
-
▼
2010
-
▼
September
- Informatica PowerCenter 7 Architecture and Perform...
- Informatica PowerCenter 7 Architecture and Perform...
- DataStage Performance Tuning
- DataStage Performance Tuning
- BusinessObjects Enterprise XI session timeout
- BusinessObjects Enterprise XI session timeout
- 10 Don't Do on DataStage
- 10 Don't Do on DataStage
- The Deciding factors for ETL Tools
- The Deciding factors for ETL Tools
- End-to-End Data Governance Webinar by RalphKimbal
- End-to-End Data Governance Webinar by RalphKimbal
- Business Objects Tips and Tricks
- Business Objects Tips and Tricks
- Business Objects Tips and Tricks
- DataStage Convert 32bit(2GbHash Limit) Hash to 64bit
- DataStage Convert 32bit(2GbHash Limit) Hash to 64bit
- DataStage Convert 32bit(2GbHash Limit) Hash to 64bit
- DataStage tip for beginners
- DataStage tip for beginners
- Debugging DataStage parallel jobs
- Debugging DataStage parallel jobs
- Data Generation Using DataStage
- Data Generation Using DataStage
- Data Generation Using DataStage
- DataStage and Slowly Changing Dimensions
- DataStage and Slowly Changing Dimensions
- DataStage and Slowly Changing Dimensions
- DOS Batch Script to Export DataStage Jobs Automati...
- DOS Batch Script to Export DataStage Jobs Automati...
- DataStage Server JobCompile Script
- DataStage Server JobCompile Script
- BusinessObjects Context
- BusinessObjects Context
- Optimizing Business Objects Universe and Reports
- Optimizing Business Objects Universe and Reports
- Optimizing Business Objects Universe and Reports
-
▼
September
Popular Posts
-
Netezza dosen't have any isnumeric check function, so in order to find out whether the column has non-numeric, use the following logic n...
-
If you have table like below GROUP_NAME GROUP_ID PASS_FAIL COUNT GROUP1 5 FAIL 382 GROUP...
-
@echo off :: ----------------------------------------------------------------- :: DataStageExport.bat :: -----------------------------------...
-
Netezza has 3 internal planner Fact Relationship Planner (factrel_planner) Snowflake Planner Star Planner In the course of a query pla...
-
What is Big Data? Big data is data that exceeds the processing capacity of traditional database systems. The data is too big, moves too fast...
-
Source :-dwh-ejtl.blogspot.com Q1) Tell me what exactly, what was your role? A1) I worked as ETL Developer. I was also involved in requireme...
-
One of the regular viewer of this blog requested me to explain the important's of the ETL mapping document. What is ETL Mapping Document...
-
I am sure most of you heard about market buzz words nosql,newsql... and it often make our DW developers to get confused on this new terms.Mo...
-
In data warehousing, a conformed dimension is a dimension that has the same meaning to every fact table in the structure. Conformed dim...
-
This note assumes some familiarity with the DataStage transformation engine. DataStage is normally used to process multiple input files and/...
Sunday, September 26, 2010
Debugging DataStage parallel jobs
by Unknown |  at 8:01 PM
- Netezza ISNUMERIC Data Check Logic
- How to do PIVOT in netezza SQL
- Datastage DSX Files Export Script
- Netezza Optimizer Parameters
- Big Data and Hadoop Questions and Answers
- Informatica Job Interview Question & Answers
- What is ETL Mapping Document ?A Real Time Example
- NOSQL 101
- Conformed Dimensions with example
- Data Generation Using DataStage
0 comments: