SAP BI / ABAP: Introduction to SAP BI

Overview

The concept of Write Optimized DSO was introduced in BI-7.0. Write Optimized DSO unlike Standard DSO has only one relational table, i.e. Active Table and moreover there is no SID generated in Write Optimized DSO, and hence loading the data from Data Source to Write Optimized DSO takes less time and acquires less disk space.

Business Case

Require Data storage for storing detailed level of data with immediate reporting or further update facility. No over write functionality required.

Limitation of Standard DSO

A standard DSO allows you to store detailed level of information; however activation process is mandatory.
Reporting or further update is not possible until activation is completed.

Write Optimized DSO - Properties

Primarily designed for initial staging of source system data
Business rules are only applied when the data is updated to additional InfoProviders.
Stored in at most granular form
Can be used for faster upload
Records with the same key are not aggregated ,But inserted as new record, as every record has new technical key
Data is available in active version immediately for further Processing
There is no change log table and activation queue in it.
Data is saved quickly.
Data is stored in it at request level, same as in PSA table.
Every record has a new technical key, only inserts.
It allows parallel load, which saves time for data loading.
It can be included in a Process Chain, and we do not need an activation step for it.
It supports archiving.

Write-Optimized DSO - Semantic Keys

Semantic Key identifies error in incoming records or Duplicate records.
Semantic Keys protects Data Quality such that all subsequent Records with same key are written into error stack along with incorrect Data Records.
To process the error records or duplicate records, Semantic Group is defined in DTP.
Note: if we are sure there are no incoming duplicate or error records, Semantic Groups need not be defined.

Write Optimized DSO - Data Flow

1.     Construct Data Flow model.
2.     Create Data source
3.     Create Transformation
4.     Create Info Package
5.     Create DTP

Write-Optimized - Settings

If we do not check the check box "Do not Check Uniqueness of Data", the data coming from source is checked for duplication; i.e. if the same record (semantic keys) already exist in the DSO, then the current load is terminated.
If we select the check box, duplicate records are loaded as a new record; there is no relevance of semantic keys in this case.

When Write Optimized DSO is Recommended?

For faster data loads, DSOs can be configured to be Write optimized
When the access to source system is for a small duration.
It can be used as a first staging layer.
In cases where delta in DataSource is not enabled, we first load data into Write Optimized DSO and then delta load can be done to Standard DSO.
When we need to load large volume of data into Info Providers, then WO DSO helps in executing complex transformations.
Write Optimized DSO can be used to fetch history at request level, instead of going to PSA archive.

Functionality

It contains only one table, i.e. active data table (DSO key: request ID, Packet No, and Record No)
It does not have any change log table or activation queue.
Every record in Write Optimized DSO has a new technical key, and delta in it works record wise.
In Write Optimized DSO data is stored at request level like in PSA table.
In Write Optimized DSO SID is not generated.
In Write Optimized DSO Reporting is possible but it is not a good practice as it will effect the performance of DSO.
In Write Optimized DSO BEx Reporting is switched off.
Write Optimized DSO can be included in InfoSet or Multiprovider.
Due to Write Optimized DSO performance is better during data load as there is no Activation step involved. The system generates a unique technical key
The technical key in Write Optimized DSO consists of the Request GUID field (0REQUEST), the Data Package field (0DATAPAKID) and the Data Record Number field (0RECORD).

Points to Remember

Generally Write Optimized DSO is not preferred for reporting, but If we want to use it for reporting then it is recommended to define a semantic in order to ensure the uniqueness of the data.
Write-optimized DSOs can force a check of the semantic key for uniqueness when data is stored.
If this option is active and if duplicate records are loaded with regard to semantic key, these are logged in the error stack of the Data Transfer Protocol (DTP) for further evaluation.
If we need to use error stack in our flow then we need to define the semantic key in the DSO level.
Semantic group definition is necessary to do parallel loads.

Reporting

If we want to use write-optimized DataStore object in BEx queries (not preferred), it is recommended to:
1. have a semantic key and
2. ensure that the data is unique.
Here the Technical key is not visible for reporting, so it looks like any regular DSO

Use

Data that is loaded into write-optimized DataStore objects is available immediately for further processing.

They can be used in the following scenarios:

● You use a write-optimized DataStore object as a temporary storage area for large sets of data if you are executing complex transformations for this data before it is written to the DataStore object. The data can then be updated to further (smaller) InfoProviders. You only have to create the complex transformations once for all data.

● You use write-optimized DataStore objects as the EDW layer for saving data. Business rules are only applied when the data is updated to additional InfoProviders.

The system does not generate SIDs for write-optimized DataStore objects and you do not need to activate them. This means that you can save and further process data quickly. Reporting is possible on the basis of these DataStore objects. However, we recommend that you use them as a consolidation layer, and update the data to additional InfoProviders, standard DataStore objects, or InfoCubes.

Structure

Since the write-optimized DataStore object only consists of the table of active data, you do not have to activate the data, as is necessary with the standard DataStore object. This means that you can process data more quickly.

The loaded data is not aggregated; the history of the data is retained. If two data records with the same logical key are extracted from the source, both records are saved in the DataStore object. The record mode responsible for aggregation remains, however, so that the aggregation of data can take place later in standard DataStore objects.

The system generates a unique technical key for the write-optimized DataStore object. The standard key fields are not necessary with this type of DataStore object. If there are standard key fields anyway, they are called semantic keys so that they can be distinguished from the technical keys. The technical key consists of the Request GUID field (0REQUEST), the Data Package field (0DATAPAKID) and the Data Record Number field (0RECORD). Only new data records are loaded to this key.

You can specify that you do not want to run a check to ensure that the data is unique. If you do not check the uniqueness of the data, the DataStore object table may contain several records with the same key. If you do not set this indicator, and you do check the uniqueness of the data, the system generates a unique index in the semantic key of the InfoObject. This index has the technical name "KEY". Since write-optimized DataStore objects do not have a change log, the system does not create delta (in the sense of a before image and an after image). When you update data into the connected InfoProviders, the system only updates the requests that have not yet been posted.

Use in BEx Queries

For performance reasons, SID values are not created for the characteristics that are loaded. The data is still available for BEx queries. However, in comparison to standard DataStore objects, you can expect slightly worse performance because the SID values have to be created during reporting.

If you want to use write-optimized DataStore objects in BEx queries, we recommend that they have a semantic key and that you run a check to ensure that the data is unique. In this case, the write-optimized DataStore object behaves like a standard DataStore object. If the DataStore object does not have these properties, you may experience unexpected results when the data is aggregated in the query.

********************************************************************************

Write Optimized DSO is used when a Data storage object is required for storing lowest granularity records such as address and when overwrites functionality is not needed. It consists of the table of active data only, hence no need for data activation which increases data process. Data store object is available immediately for further processing; it is used as a temporary storage area for large set of data.
Write-Optimized DSO has been primarily designed to be the initial staging of the source system data from where the data could be transferred to the Standard DSO or the Info Cube.

PSA receives data unchanged to the Source system
Data is posted at document level, After loading in to standard DSOs data is deleted
Data is posted to Corporate memory write –optimized DSO from pass thru write-optimized DSO
Data is Distributed from write-optimized “pass thru” to Standard DSOs as per business requirement.

Write Optimized DSO Properties:

It is used for initial staging of source system data.
Data stored is of lowest granularity.
Data loads can be faster since it does not have the separate activation step.
Every record has a technical key and hence aggregation of records is not possible. New records are inserted every time.

Creation Of Write-Optimized DSO:
Step 1)

Go to transaction code RSA1
Click the OK button.

Step 2)

Navigate to Modelling tab->Info Provider.
Right click on Info Area.
Click on “Create Data Store Object” from the context menu.

Step 3)

Enter the Technical Name.
Enter the Description.
Click on the “Create” button.

Step 4)
Click on the Edit button of “Type of DataStore Object”.

Step 5)
Choose the Type “Write-Optimized”.

Technical keys include Request ID, Data package, Record number. No additional objects can be included under this.
Semantic keys are similar to key fields, however, here the uniqueness is not considered for over write functionality. They are instead used in conjunction with setting “Do not check uniqueness of data”.
The Purpose of Semantic Key is to identify error in incoming records or Duplicate records .
Duplicate Records are written into error stack in the subsequent order. These records in the error stack can be handled or re-loaded by defining Semantic Group in DTP.
Semantic Groups need not be defined if there will be no possibility of duplicate records or error records.

If we do not check the Check Box “Allow Duplicate Data Record “, the data coming from source is checked for duplication, i.e, if the same record (semantic keys) already exist in the DSO, then the current load is terminated.
If we select the check box , Duplicate records are loaded as a new record. There is no relevance of semantic keys in this case.

Step 6)
Activate the DSO.

Business intelligence (BI) is an application used for giving meaning to raw data that an organization has. The raw data is cleansed, stored and applied with business logics to be useful for enterprise users to make better business decisions. This data can be presented in the form of reports and can be displayed in the form of tables, charts etc. which is efficient and easier to analyse and make business decisions.

During all business activities, companies create data about customers, suppliers and internal activities. Based on these data’s, employees of various departments like HR, Finance, Accounting, Marketing etc. prepare their work plan.

Business Intelligence spans a varied set of toolset, of which the Data Ware House consolidates and loads the data from the different Source Systems, while reporting tools like Query Designer, Web Application Designer, and Analyzer are majorly used to create reports which display the data consolidated by the Datawarehouse for analysing purpose.

Business Intelligence is a SAP product which majorly focuses on providing its customers/organizations with a user friendly and very useful form of representing data that could be helpful for analyses purpose and making business decisions.

In summary, Business Intelligence tools transform raw data into reports which used for decision making and business forecasting.

Why do we need Datawarehouse & BI ?

Organizations have different kinds of data such as finance, Human resource, customer, supplier data etc., which can be stored on different kinds of storage units such as DBMS, excel sheets, SAP R/3 systems etc...Even the company's internal data is often distributed across many different systems and is not particularly well formatted.

A Data Warehouse can help to organize the data. It brings together heterogeneous Data Sources which are mostly and differing in their details. Using BI Tools one can derive meaningful reports

What makes SAP BI more effective BI tool?

Single point of access to all information is possible through BI. The data from various sources can be accessed at the single place(i.e BI).
Data collected from various sources are presented in the form of reports which is efficient for analysis of the data at a high level.
SAP BI provides easy to use GUI and better formatting
Some of the key functionality that makes SAP BI better than rest is its ability to analyze multidimensional data sources in both web and MS office environments, flexible dashboards, mobility and a flexible, scalable BI platform.
SAP BI is known for its awesome query performance, while requiring little administration
Mobile BI for end users on the go
Easy Integration with other platforms

SAP BI/ Data Warehouse Vs. OLTP systems:

OLTP(Online Transaction Processing):

These systems have detailed day to day transaction data which keeps changing. For example, R/3 or any other database.

OLAP(Online Analytical Processing):

These systems have data for analysis purpose. The input for this system is from OLTP systems. The data from the OLTP systems is made use to prepare the data for analysis purpose.

Business Intelligence is an OLAP system.

	OLTP Systems (Operative Environment)	DWH/OLAP Systems(Informative Environment)
Target	Efficiency through automation of business processes	Generating Knowledge (Competitive Advantage)
Priorities	High availability, higher data volume	Simple to use, flexible access to data
View of Data	Detailed	Frequently aggregated
Age of Data	Current	Historical
Database operations	Add, Modify, delete, update and read	Read
Typical data structures	Relational(flat tables, high normalization	Multidimensional Structure
Integration of data from various modules/applications	Minimal	Comprehensive
Dataset	6-18 months	27 years
Archiving	Yes	Yes

Tuesday, 12 May 2015

Write Optimized DSO