Real time data processing using DataStage
Posted: Thu Jun 04, 2009 12:25 am
Hi,
I have a project requirement wherein I need to process information sent from the source system in real time. For example ,when a new customer registers, this new customers data is sent immmediately to ETL. ETL should then trigger the job and process the file immediately. There could be a case where more than one file appears per minute.
One approach to achieve this could be by using a shell script to check for the arrival of a file. This script will run 24x7 and will invoke an instance of the job as soon as a file arrives.
Is this approach advisable?
However, I have also read about some plugin stages like MQ connector and Webservices stages.
Will these stages provide me with any additional functionality?
What are the advantages of these stages?
How exactly do these stages work?
I have a project requirement wherein I need to process information sent from the source system in real time. For example ,when a new customer registers, this new customers data is sent immmediately to ETL. ETL should then trigger the job and process the file immediately. There could be a case where more than one file appears per minute.
One approach to achieve this could be by using a shell script to check for the arrival of a file. This script will run 24x7 and will invoke an instance of the job as soon as a file arrives.
Is this approach advisable?
However, I have also read about some plugin stages like MQ connector and Webservices stages.
Will these stages provide me with any additional functionality?
What are the advantages of these stages?
How exactly do these stages work?