0% found this document useful (0 votes)

648 views

Error Handling: Archive For The ETL Exception & Error Handling' Category

The document discusses the importance of error handling in ETL processes. It notes that there are two main types of errors: data errors and process errors. Data errors can be handled using row error logging to capture errors in error tables where they can be analyzed and corrected. Process errors can be handled by configuring email notifications for session failures. The document also provides details on how row error logging works and how it captures error information for analysis.

Uploaded by

Sunil Katta

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

648 views

Error Handling: Archive For The ETL Exception & Error Handling' Category

Uploaded by

Sunil Katta

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 4

Error Handling

Identifying errors and creating an error handling strategy is very important.

The 2 types of errors in an ETL process are – Data Errors & Process Errors.
To handle Data errors we can use the Row Error Logging feature. The errors are captured into
the

error tables. We can then analyse, correct and reprocess them.

To handle Process errors we can configure an email task to notify the event of a session failure.

Row Error Logging: When we configure the session with this option the Integration service logs

errors
information to relational tables or to an error log file.
First time it creates the table or a file and then onwards it appends to the existing table or file.
This log file contains information such as source name, row ID, row data, transformation error
code etc. which can be used to determine the cause & source of an error.
By default the Integration service does not write the dropped rows to session log or create a
reject
file. So we can enable verbose tracing to write to session log. Performance is decreased as one
row at
a time is processed.

Archive for the ‘ETL Exception & Error Handling’ Category

For every rule there is an exception; for each exception there are more exceptions…

To implement an ETL process there are many steps that are followed. One such step is creating a
mapping document. This mapping document describes the data mapping between the source
systems and the target and the rules of data transformation.

Ex. Table / column map between source and target, rules to identify unique rows, not null
attributes, unique values, and range of a attributes, transformations rules, etc.

Without going into further details of the document, lets analyze the very next step. It seems
obvious and natural to start development of the of the ETL process. The ETL developer is all
fired up and comes up with a design document and starts developing, few days time the code is
ready for data loading.
But unexpectedly (?) the code starts having issues every few days. Issues are found and fixed.
And then it fails again. What’s happening? Analysis was done properly; rules were chalked out
& implemented according to the mapping document. But why are issues popping up? Was
something missed?
Maybe not! Isn’t it, normal to have more issues in the initial lifetime of the processes?

Maybe Yes! You have surely missed ‘Source System Data Profiling’. The business analyst has
told you rules as the how the data is structured in the source system and how it is supposed to
behave; but he/she has not told you the ‘buts and ifs’ called as EXCEPTIONS for those rules.

To be realistic it is not possible for anyone to just read you all rules and exceptions like a parrot.
You have to collaborate and dig the truth. The actual choice is yours, to do data profiling on the
source system and try to break all the rules told by the analyst. Or you can choose to wait for the
process to go live and then wakeup every night as the load fails. If you are lucky enough you
deal with an unhappy user every morning you go to the office.

Make the right choice; don’t miss ‘Source system data profiling’ before actually righting a single
line of code. Question every rule. Try to find exception to the rules. There must be at least 20
tables. One table on an average will have 30 columns; each column will have on an average 100k
values. If you make matrix of number of tables * columns * data values, it will give the number
of reasons the why your assumptions may be wrong. It’s like unit testing source data even
without loading. There is a reason why machines alone cannot do your job; there is reason why
IT jobs are more paying.

Remember, ‘for every rule there is an exception; for each exception there are more exceptions…’

Posted in Data Profiling, Data Quality, ETL Exception & Error Handling, Source System
Analysis, Uncategorized | 1 Comment »

ETL Startegy to store data validation rules

Every time there is movement of data the results have to be tested against the expected results.
For every ETL process, test conditions for testing data are defined before/during design and
development phase itself. Some that are missed can be added later on.

Various test conditions are used to validate data when the ETL process is migrated from DEV-
to->QA-to->PRD. These test conditions are can exists in the developer’s/tester’s mind
/documented in word or excel. With time the test conditions either lost ignored or scattered all
around to be really useful.

In production if the ETL process runs successfully without error is a good thing. But it does not
really mean anything. You still need rules to validate data processed by ETL. At this point you
need data validation rules again!

A better ETL strategy is to store the ETL business rules in a RULES table by target table, source
system. These rules can be in SQL text. This will create a repository of all the rules in a single
location which can be called by any ETL process/ auditor at any phase of the project life cycle.

There is also no need to re-write /rethink rules. Any or all of these rules can be made optional,
tolerances can be defined, called immediately after the process is run or data can be audited at
leisure.
This Data validation /auditing system will basically contain
A table that contains the rules,
A process to call is dynamically and
A table to store the results from the execution of the rules

Benefits:

Rules can be added dynamically with no cange to code.

Rules are stored permanantly.

Tolerance level can be changed with ever changing the code

Biz rules can be added or validated by business experts without worring about the ETL code.

NOTE: This post is applicable to all etl tools or databases like Informatica, DataStage, Syncsort
DMExpress, Sunopsis or Oracle, Sybase, SQL Server Integration Services (SSIS)/DTS, Ab
Initio, MS SQL Server, RDB, etc.

Posted in ETL Exception & Error Handling, ETL Testing, Uncategorized | 1 Comment »

Introduction to Error and exception management.

Monday, July 3rd, 2006

ETL is all about transportation, transformation and organizing of data. Of anytime something
moves (as a matter of fact even if you are perfectly stationary and items around moves) accidents
are bound to happen. So any ETL specialist believes that their code is perfect and nothing can
happen obviously lives in a fool’s paradise.

The next obvious thing is to design to manage accidents, like making a safer car or a factory.
And as an ETL specialist if you don’t do it you are no different then others. As in any country
there are laws for accidents and accident due to criminal negligence. Later being the worst.

How many times I have seen people putting ETL code into production without actually
designing processes to prevent, manage or report accidents. Writing code is one thing writing
production worthy code is another. Do ask yourself or your developers, “Is the code production
worthy?”

ERRORS: A programmatic error that causes the the program to fail or makes the program run for
uncontrolled time frame.
EXCEPTIONS: A program/code written to handle expected or unexpected errors gracefully so
that the program continues run with logging the error and bypassing the erroneous conditions or
even logging the error and gracefully exiting with error message.
More detailed description will come with topic…. ‘Unhandled exceptions results in Errors’.

Note: The topic on error and exceptions is relevant to Informatica, Data Stage, Abinitio, Oracle
warehouse builder, PLSQL, SQLLDR, Transact SQL or any ETL other tools.

Posted in ETL Exception & Error Handling, ETL Strategy | No Comments »

Multiple executions ETL process against same set of data.

Every ETL designer, developer & tester should always ask this question…”What will happen, if
I run the ETL process multiple times, against the same data set?”

Answer: 1. I get the same result set.

Answer: 2. I get multiple result set.

If you go back to the original article on What is ETL & What ETL is not! You will immediately
come to the conclusion that Answer 2 is incorrect, as ETL is not allowed to create data.

Why will the process run more than once against the same set of data? Many reasons, example
most common being operators mistake, accidental kickoff, old set of data file remaining in the
directory, staging table loaded more than once, intentional rerun of ETL process after correction
of some data in source data set, etc. Without going into further details, I would advise ETL folks
to always include in your process ways to prevent it from happening by one or more
combinations of following methods…
1. Identify the primary key (logical/physical) and put update else insert logic.
2. Deleting the target data set before processing again (based on logical/physical primary key)
3. Preventing occurrences of multiple runs by flagging processed dates
4. Marking processed records with processed flags after commit
5. Prevention of multiple loads in the staging area
6. identifying duplicate records in stage area before the data gets processed
7. more…

So do these experiments in the development or test environment run the ETL process more than
once, check the result! If you get the result 2 (copies of rows, with no way to distinguish or
retiring the old rows)
The designer or the developer is wrong & if the process as passed the QA or testing then the
tester is wrong.

Bottom line:
A test case to check multiple runs is must in life cycle of an ETL process.

Solid Starts - First 100 Days
94% (18)
Solid Starts - First 100 Days
287 pages
Hourglass Workout Program by Luisagiuliet 2
76% (21)
Hourglass Workout Program by Luisagiuliet 2
51 pages
12 Week Program: Summer Body Starts Now
89% (45)
12 Week Program: Summer Body Starts Now
70 pages
The Hold Me Tight Workbook - Dr. Sue Johnson
100% (16)
The Hold Me Tight Workbook - Dr. Sue Johnson
187 pages
Read People Like A Book by Patrick King-Edited
62% (66)
Read People Like A Book by Patrick King-Edited
12 pages
Livingood, Blake - Livingood Daily Your 21-Day Guide To Experience Real Health
77% (13)
Livingood, Blake - Livingood Daily Your 21-Day Guide To Experience Real Health
260 pages
Facial Gains Guide (001 081)
91% (45)
Facial Gains Guide (001 081)
81 pages
Cheat Code To The Universe
94% (77)
Cheat Code To The Universe
34 pages
Curse of Strahd
95% (467)
Curse of Strahd
258 pages
The Psychiatric Interview - Daniel Carlat
91% (34)
The Psychiatric Interview - Daniel Carlat
473 pages
The Borax Conspiracy
91% (57)
The Borax Conspiracy
14 pages
COSMIC CONSCIOUSNESS OF HUMANITY - PROBLEMS OF NEW COSMOGONY (V.P.Kaznacheev,. Л. V. Trofimov.)
94% (212)
COSMIC CONSCIOUSNESS OF HUMANITY - PROBLEMS OF NEW COSMOGONY (V.P.Kaznacheev,. Л. V. Trofimov.)
212 pages
The Secret Language of Attraction
86% (107)
The Secret Language of Attraction
278 pages
How To Develop and Write A Grant Proposal
83% (541)
How To Develop and Write A Grant Proposal
17 pages
Workbook For The Body Keeps The Score
88% (52)
Workbook For The Body Keeps The Score
111 pages
Donald Trump & Jeffrey Epstein Rape Lawsuit and Affidavits
83% (1016)
Donald Trump & Jeffrey Epstein Rape Lawsuit and Affidavits
13 pages
KamaSutra Positions
78% (69)
KamaSutra Positions
55 pages
7 Hermetic Principles
93% (28)
7 Hermetic Principles
3 pages
27 Feedback Mechanisms Pogil Key
75% (12)
27 Feedback Mechanisms Pogil Key
6 pages
Frank Hammond - List of Demons
92% (92)
Frank Hammond - List of Demons
3 pages
36 Questions That Lead To Love
91% (35)
36 Questions That Lead To Love
3 pages
36 Questions To Fall in Love 1
97% (31)
36 Questions To Fall in Love 1
2 pages
The 36 Questions That Lead To Love - The New York Times
94% (34)
The 36 Questions That Lead To Love - The New York Times
3 pages
100 Questions To Ask Your Partner
80% (35)
100 Questions To Ask Your Partner
2 pages
The 36 Questions That Lead To Love - The New York Times
95% (21)
The 36 Questions That Lead To Love - The New York Times
3 pages
Jeffrey Epstein39s Little Black Book Unredacted PDF
75% (12)
Jeffrey Epstein39s Little Black Book Unredacted PDF
95 pages
ALCHEMIST
64% (14)
ALCHEMIST
4 pages
1001 Songs
71% (69)
1001 Songs
1,798 pages
Zodiac Sign & Their Most Common Addictions
63% (30)
Zodiac Sign & Their Most Common Addictions
9 pages
The 4 Hour Workweek, Expanded and Updated by Timothy Ferriss - Excerpt
23% (954)
The 4 Hour Workweek, Expanded and Updated by Timothy Ferriss - Excerpt
38 pages
Knime PDF
100% (1)
Knime PDF
222 pages
DBMS Project Report
No ratings yet
DBMS Project Report
38 pages
EnterpriseOne Interview Questions
From Everand
EnterpriseOne Interview Questions
equitypress
No ratings yet
Delete XML Publisher Data Definition and Template
No ratings yet
Delete XML Publisher Data Definition and Template
3 pages
WS-BPEL 2.0 Beginner's Guide
From Everand
WS-BPEL 2.0 Beginner's Guide
Matjaz B. Juric
No ratings yet
Hadoop Capacity Planning and Dimensioning
No ratings yet
Hadoop Capacity Planning and Dimensioning
9 pages
98 364 Test Bank Lesson01
100% (2)
98 364 Test Bank Lesson01
6 pages
Concepts
No ratings yet
Concepts
7 pages
Technology PLSQL by Steven Feuerstein
No ratings yet
Technology PLSQL by Steven Feuerstein
119 pages
Oracle Apps-Tech
No ratings yet
Oracle Apps-Tech
7 pages
Apps
No ratings yet
Apps
28 pages
ORACLE PLSQL Interview Questions You'Ll Most Likely Be Asked by Vibrant Publishers
No ratings yet
ORACLE PLSQL Interview Questions You'Ll Most Likely Be Asked by Vibrant Publishers
8 pages
PLSQL Coding Standard
No ratings yet
PLSQL Coding Standard
12 pages
FLS BOISE MD070 Order Acknowledgement 1.1
No ratings yet
FLS BOISE MD070 Order Acknowledgement 1.1
27 pages
Interview Questionnaire - Technical
100% (1)
Interview Questionnaire - Technical
7 pages
Data Conversion Strategies in Oracle E-Business Suite Implementation
No ratings yet
Data Conversion Strategies in Oracle E-Business Suite Implementation
8 pages
Nagarajujuluru Experience Summary
No ratings yet
Nagarajujuluru Experience Summary
5 pages
Oracle Applications 11i System Administration Diagnostic & Troubleshooting
No ratings yet
Oracle Applications 11i System Administration Diagnostic & Troubleshooting
33 pages
Item Import Overview
No ratings yet
Item Import Overview
20 pages
Alrtrm
No ratings yet
Alrtrm
136 pages
Reports Messages and Codes Manual
No ratings yet
Reports Messages and Codes Manual
221 pages
Kiok
No ratings yet
Kiok
98 pages
Leelakrishnaangadala Oracle Dbs Kuwait
No ratings yet
Leelakrishnaangadala Oracle Dbs Kuwait
5 pages
An Introduction To Basics of Interfaces in Oracle Apps.: Interface Basics (AP r12 - AR - GL)
No ratings yet
An Introduction To Basics of Interfaces in Oracle Apps.: Interface Basics (AP r12 - AR - GL)
15 pages
Hi Sir I Didn't Received Any T.interview Call From Your Side About
No ratings yet
Hi Sir I Didn't Received Any T.interview Call From Your Side About
18 pages
A Typical Path To Transfer The Data From Legacy System To Oracle Apps
No ratings yet
A Typical Path To Transfer The Data From Legacy System To Oracle Apps
25 pages
Oracle Applications: Submitting Concurrent Request Using FND - Concurrent - Wait - For - Request Sample Code
No ratings yet
Oracle Applications: Submitting Concurrent Request Using FND - Concurrent - Wait - For - Request Sample Code
10 pages
Document
No ratings yet
Document
97 pages
Sai Oracle Apps Technical
No ratings yet
Sai Oracle Apps Technical
5 pages
Case Study of PO Purchase Order Using Open Interface Table
No ratings yet
Case Study of PO Purchase Order Using Open Interface Table
21 pages
Oracle Applications Technical Consultant
No ratings yet
Oracle Applications Technical Consultant
5 pages
Interface: Types of Interfaces
No ratings yet
Interface: Types of Interfaces
3 pages
How To Create An Oracle XML Report
No ratings yet
How To Create An Oracle XML Report
9 pages
Db2 Oracle
No ratings yet
Db2 Oracle
39 pages
Oracle Apps SQL Queries
No ratings yet
Oracle Apps SQL Queries
17 pages
O2C and P2P Cycle
No ratings yet
O2C and P2P Cycle
47 pages
Asked Question Interview Written
No ratings yet
Asked Question Interview Written
4 pages
Data Conversion, Migration and Interface ..Why Important: June 9th, 2007
No ratings yet
Data Conversion, Migration and Interface ..Why Important: June 9th, 2007
6 pages
Oracle Reports Reference 6i
No ratings yet
Oracle Reports Reference 6i
648 pages
XML Tags
No ratings yet
XML Tags
5 pages
Oracle
No ratings yet
Oracle
67 pages
Introduction To: E-Business Suite
No ratings yet
Introduction To: E-Business Suite
60 pages
All Definition
No ratings yet
All Definition
27 pages
Apps Work Shop
No ratings yet
Apps Work Shop
23 pages
3difference Between OAF and Oracle Forms
No ratings yet
3difference Between OAF and Oracle Forms
5 pages
Data Flow From SLA To GL
No ratings yet
Data Flow From SLA To GL
3 pages
Introduction To Databases: Name: Akanksha Sharma
No ratings yet
Introduction To Databases: Name: Akanksha Sharma
52 pages
SQL Ti
No ratings yet
SQL Ti
292 pages
All-In-One D2K PDF
No ratings yet
All-In-One D2K PDF
123 pages
Oracle Data Modeler - Getting Started
100% (1)
Oracle Data Modeler - Getting Started
24 pages
1.aim Methodologies
No ratings yet
1.aim Methodologies
5 pages
Suresh Nagavali Oracle EBS Technical Consultant
No ratings yet
Suresh Nagavali Oracle EBS Technical Consultant
5 pages
Madhanagopalan Venkatachalapathy: Certified Oracle Cloud Procurement Application Professional
No ratings yet
Madhanagopalan Venkatachalapathy: Certified Oracle Cloud Procurement Application Professional
16 pages
Case Study of OM Sales Order Using Open Interface TableCase Study of OM Sales Order Using Open Interface Table
No ratings yet
Case Study of OM Sales Order Using Open Interface TableCase Study of OM Sales Order Using Open Interface Table
30 pages
Oracle Interview Questions and Answers
100% (2)
Oracle Interview Questions and Answers
23 pages
Oracle PL Harender
No ratings yet
Oracle PL Harender
5 pages
Key Tables in Oracle Inventory
No ratings yet
Key Tables in Oracle Inventory
5 pages
Er - Ahtesham Ahmed CV Oracle Apps Tech Consultant
No ratings yet
Er - Ahtesham Ahmed CV Oracle Apps Tech Consultant
6 pages
Many Thanks To Rakesh Sreenivasa For Contributing Yet Another Article For Get Apps Training
No ratings yet
Many Thanks To Rakesh Sreenivasa For Contributing Yet Another Article For Get Apps Training
39 pages
Yeswanth Apps Technical Resume
No ratings yet
Yeswanth Apps Technical Resume
4 pages
Oracle E-Business Suite Manufacturing & Supply Chain Management
From Everand
Oracle E-Business Suite Manufacturing & Supply Chain Management
Bastin Gerald
No ratings yet
Oracle Essbase 9 Implementation Guide
From Everand
Oracle Essbase 9 Implementation Guide
Joseph Sydney Gomez
No ratings yet
Oracle SOA BPEL Process Manager 11gR1 A Hands-on Tutorial
From Everand
Oracle SOA BPEL Process Manager 11gR1 A Hands-on Tutorial
Ravi Saraswathi
5/5 (1)
Oracle ERP Guide for Financials
From Everand
Oracle ERP Guide for Financials
HYEONGDO KIM
No ratings yet
List Files in A Folder
No ratings yet
List Files in A Folder
12 pages
Comparision of Indexing and Hashing
No ratings yet
Comparision of Indexing and Hashing
3 pages
6 Years of Experience in Functional, DB and ETL Testing
No ratings yet
6 Years of Experience in Functional, DB and ETL Testing
3 pages
Read Me SCD-2
No ratings yet
Read Me SCD-2
2 pages
Fundamentals of Database Systems: Lesson 1: Introduction To Databases
No ratings yet
Fundamentals of Database Systems: Lesson 1: Introduction To Databases
22 pages
Maintenance and Transformation of Spatial Data
No ratings yet
Maintenance and Transformation of Spatial Data
3 pages
Ault - AWR 1
No ratings yet
Ault - AWR 1
24 pages
Database Indexing and Tuning
No ratings yet
Database Indexing and Tuning
168 pages
Shadow Paging and Aries
No ratings yet
Shadow Paging and Aries
36 pages
Oracle Apps Upgrade 12 1 3 To 12 2 5
100% (1)
Oracle Apps Upgrade 12 1 3 To 12 2 5
59 pages
Jtree, Jtable Java Programming
No ratings yet
Jtree, Jtable Java Programming
19 pages
Database
No ratings yet
Database
15 pages
Vidyalankar: Geographic Information Systems
No ratings yet
Vidyalankar: Geographic Information Systems
1 page
LESSON 1 Assessment Guide Questions
No ratings yet
LESSON 1 Assessment Guide Questions
3 pages
Data Warehouse and Data Mining Syllabus
No ratings yet
Data Warehouse and Data Mining Syllabus
5 pages
With Data Mining
No ratings yet
With Data Mining
13 pages
Module 2 DBMS Notes V 2.0 16-1-2024
No ratings yet
Module 2 DBMS Notes V 2.0 16-1-2024
20 pages
Dbms
No ratings yet
Dbms
40 pages
Data Analyst RoadMap by Mustafa Elryah 2025
No ratings yet
Data Analyst RoadMap by Mustafa Elryah 2025
22 pages
Subquery: Query and Inner Query Is Called As Subquery
No ratings yet
Subquery: Query and Inner Query Is Called As Subquery
3 pages
Coding Standards
No ratings yet
Coding Standards
9 pages
Data Mining - NOTES 2022
No ratings yet
Data Mining - NOTES 2022
16 pages
Class XII - File Handling Text Files
No ratings yet
Class XII - File Handling Text Files
10 pages
Name: Sadikshya Khanal Section: C3G2: Workshop - 9 - Hadoop Part 2
No ratings yet
Name: Sadikshya Khanal Section: C3G2: Workshop - 9 - Hadoop Part 2
51 pages
Owb 11gr2 Upgrade Migration Paths 130789
No ratings yet
Owb 11gr2 Upgrade Migration Paths 130789
16 pages
Lesson 05 buildingSMART And openBIM Terminology
No ratings yet
Lesson 05 buildingSMART And openBIM Terminology
24 pages