Hi @AnilKumarPoda - There are numerous ETL tool available that can be used like SSIS, Azure Data Factory, Informatica, DOMO etc. Each one has their own function. ETL process will generally involve 2-3 stage but it can be expanded as per specific Architecture.
Staging Layer - Containing Raw Data,
Intermediate layer - Containing Merged Data,
Final Layer - For consumption into Reports.
@AnilKumarPoda
The easiest way to understand how ETL works is to understand what happens in each step of the process.
Extract
During data extraction, raw data is copied or exported from source locations to a staging area. You can extract data from a variety of data sources, which can be structured or unstructured. Those sources include but are not limited to:
In the staging area, the raw data undergoes data processing. Here, the data is transformed and consolidated for its intended analytical use case. This phase can involve the following tasks:
Filtering, cleansing, de-duplicating, validating, and authenticating the data.
Performing calculations, translations, or summarizations based on the raw data. This can include changing row and column headers for consistency, converting currencies or other units of measurement, editing text strings, and more.
Conducting audits to ensure data quality and compliance
Removing, encrypting, or protecting data governed by industry or governmental regulators
Formatting the data into tables or joined tables to match the schema of the target data warehouse.
Load
In this last step, the transformed data is moved from the staging area into a target data warehouse. Typically, this involves an initial loading of all data, followed by periodic loading of incremental data changes and, less often, full refreshes to erase and replace data in the warehouse.