As part of streamlining our data and Analytics proposition to make it a best-in-class, highly performant system, we have added a Data Warehouse layer to our system architecture.
The Clarissa Data Warehouse aggregates the raw, unstructured data from customer’s back-end platform and end-user client devices along with content metadata to build out sessions for viewing or navigation that will be used to drive the insights and get displayed on the dashboards tools that are also provided. Reducing the number of sources that can access raw logs of end-subscribers is a good privacy-compliant practice.
The Clarissa Data Warehouse is a powerful “single-source-of-truth” data set for operators taking unstructured data from customer’s back-end platform and the end-user client devices to create anonymized, encrypted sessions for viewing, navigation, and quality across the subscriber base. In addition to powering the Clarissa dashboards, the Clarissa Data Warehouse can also provide a normalised dataset for the customer to do further analysis and processing.
Output of Clarissa Data Warehouse
We offer 2 distinct data sets from the Clarissa Data Warehouse – identifiable data sets and aggregate data sets. The identifiable data sets are key to generating the insights and allowing investigation into specific household/s and/or device/s. However, the value of identifiable data diminishes very quickly as what a household or device did last week is not as relevant next month.
However, understand trends is crucial and that’s where the Clarissa aggregate tables comes in to support our customers. Each of the products are mapped to specific aggregate tables that are relevant for the applicable features. There are tables relating to Content Insights (Viewing sessions), Operational Insights (QoE, Error Analytics) and App Insights Navigation and User journeys).