Integration of metadata from data platforms to enable traceability of information on reports to system of record for a large US Healthcare Company

Client: A large US Healthcare company

Business Problem: The client wanted to provide data lineage/traceability from their Cognos dashboards through to their systems-of-record.


Lucid extended the Collibra metamodel to model the metadata pertaining to the BI metadata to support cubes and filters. Collibra Connect templates were developed to

  • Extract metadata from IBM Cognos, pertaining to dashboards, reports, cubes and framework models using the Cognos SDK  and also parsed the report catalog xml to get the report label and sort information
  • Extract metadata from Netezza using SQL queries against system tables
  • Extract metadata from Informatica PowerCenter repository and ‘stitch’ the various elements to derive summarized lineage to load into Collibra DGC

There were also breaks in the lineage due to use of Store Procedures and views against legacy databases. External design metadata had to be leveraged to supplement the metadata from the core systems.

Custom workflows were developed to manage the load of supplemental design metadata in the form of excel sheets based on custom templates, support the review & approval before publishing to the final shared metadata storage area in DGC.
Custom workflows were also developed to enable write-back of definitions from DGC Glossary to the linked framework model assets in the Cognos Framework Model.

Technology: Collibra DGC ver 5.0.2,  Collibra Connect with Collibra DGC Connector v1.3, IBM Cognos ver10.2.2 , Informatica Powercenter ver 9.6.1,  IBM Netezza ver 7.2.1


About the Author: Site Admin