What is a Prepare Instance
This article describes the structure of a Prepare Instance and outlines how to map, cleanse, transform, and consolidate data from an Ingest Instance into a single version of truth in your data warehouse.
The Prepare Instance is the area in Jet Analytics Data Integration where you map data from your Ingest Instance, then cleanse, transform, and consolidate it into a unified data warehouse. It receives raw data from the Ingest Instance and applies business logic to produce clean, structured, and reliable data ready for delivery and reporting.
The Prepare Instance supports various storage types depending on your use case, including SQL Server, Azure Synapse Dedicated SQL Pool, and Snowflake.
Connect to the Prepare Instance in Jet Analytics Data Integration
Once the Jet Analytics Data Integration application is installed and you have signed in, it will connect and list the Prepare Instances that you have access to in the Solution Explorer on the left-hand side of the main window. For more information on how to grant access to instances, see Product Support.
If any changes are made within the portal to instances or data sources, make sure to refresh in Jet Analytics Data Integration by clicking the Refresh button or by selecting View and then Refresh.
Create the Prepare Instance Data Storage within Jet Analytics Data Integration
Before you can deploy and execute tables in a data area within a Prepare Instance, you must create the data storage. To do this, right-click on the instance in Jet Analytics Data Integration, select Edit Instance, and then click Create Storage. Jet Analytics will run the necessary scripts to create a database with the configured name.
Transfer Data from Ingest to Prepare
The Prepare Instance receives data transferred from the Ingest Instance. You can configure how data is copied from the Ingest storage to the Prepare storage using one of the following methods:
- ADO.net: The application-native method, which uses the ADO.net framework on the Ingest server to move data.
- Azure Data Factory: Auto-generates an ADF pipeline to transfer data from the Ingest Instance to the Prepare storage. Recommended when a significant amount of data is transferred frequently or when the Ingest server becomes overloaded. Not supported when using Snowflake storage.
Once data is transferred, the Prepare Instance extracts only the latest valid records from the Ingest storage, filtering out deleted or updated rows flagged by the Ingest Instance.