ETL HighlightsThe ETL (or
Extract Transform Load) modules is one of the most
challenging aspects of a data warehousing project.
Many data warehousing projects stall at this stage.
ZogoTech's approach is much simpler than others and
helps ensures a successful deployment.
Extraction
In this phase, the Estudias ETL Manager
retrieves information from your student information
system (Datatel Colleague, SCT Banner, PeopleSoft, etc)
and puts it into text files used in the Load and
Transform stages.
Transformation (Pre-load)
In this stage, minor cleanups (such as converting the
semester formats to a common format) are applied to the
downloaded files. Most transformations are
actually done after the information is loaded, so this
is sometimes called ELT rather than ETL.
-
Files are merged to
remove duplicates and to make sure that more
recent records take
precedence over older records.
-
The database schema
is examined and any missing schema elements
(domains, function, tables, and indexes) are
created.
-
Record level cleansing is performed and CSV
files that can be bulk-loaded are created.
Load
At this stage, information is loaded into the
Estudias database.
-
Preprocesses common queries to improve
performance and scalabilit
-
Table and database
level cleansing is performed via a series of SQL
statements.
-
Creates, designs and
processes OLAP Cubes.
Transformation (Post-load)
Running the transformations after the
database is loaded means it's easy to add
new transformations using standard SQL
rather than a proprietary language.
Also see: Detailed look at ZogoTech's
Open
Architecture
|
|
|
|