need some info ON .done file

Post questions here relative to DataStage Enterprise/PX Edition for such areas as Parallel job design, Parallel datasets, BuildOps, Wrappers, etc.

Moderators: chulett, rschirm, roy

Post Reply
India2000
Participant
Posts: 274
Joined: Sun Aug 22, 2010 11:07 am

need some info ON .done file

Post by India2000 »

I have a scenrio where the ETL process uses remove command to delete .done file and triggers the next job in the sequence creating anew .done before loading into the table? why and when do we use .done command.This loading depends on the job that processes files from other ETL project. Is it uesd to check whether the previous ETL process completed or not? I want to know more on this..pls help me
qt_ky
Premium Member
Premium Member
Posts: 2895
Joined: Wed Aug 03, 2011 6:16 am
Location: USA

Post by qt_ky »

I don't think .done is a command but it could be a naming convention someone developed. It sounds like you may have to trace through the jobs and scripts and events that create the trigger files to understand it better.
Choose a job you love, and you will never have to work a day in your life. - Confucius
chulett
Charter Member
Charter Member
Posts: 43085
Joined: Tue Nov 12, 2002 4:34 pm
Location: Denver, CO

Post by chulett »

Sounds like a pretty standard semaphore technique but you're the only one with access to 'the code' and thus the ability to track it down. They're typically used to signal that a file (or set of files) have transferred completely. Rather than polling for the actual file and risk starting to process it before it has (or they have) finished transferring, an empty semaphore file is sent last and is what is actually polled for; when it arrives it gets deleted and the 'real' files are processed.

Many times the semaphore or 'trigger' files have the same name as their matching transfer file, just with a fixed extension (like .done) appended to it. You can poll for "*.done" files and when one is found strip off the extension to know which file has arrived.
-craig

"You can never have too many knives" -- Logan Nine Fingers
kommven
Charter Member
Charter Member
Posts: 125
Joined: Mon Jul 12, 2004 12:37 pm

Post by kommven »

With parallel compatabilities of todays hardware/software the data or a file is allowed to read/write at the same time and this can mislead to a dirty read.

To avoid situations, .done files are used.
In some platforms a Header, Trailer records are used.
In most ER Application, details and header tables and Joinable records are used.

And this Architectual solutions are engineered as per the usage and Atomicity at row, table, file, App level...
Post Reply