Document toolboxDocument toolbox

Azure Data Factory

This page will walkthrough the setup of Azure Data Factory in K using the direct connect method

Integration details

Scope

Included

Comments

Scope

Included

Comments

Metadata

YES

See below

Lineage

YES

 

Usage

YES

 

Sensitive Data Scanner

N/A

 

Known limitations

  • Not all sources and destinations are included in the metadata extraction. Improvements are planned to provide wider coverage

  • Sources Implemented

    • SNOWFLAKE


Step 1) Enabling Azure Data Factory Admin APIs to be accessible to an AD Group

This step is performed by the Azure Data Factory Admin

  • Under Azure services click on Data factories

 

  • Locate the Data Factory that you would like to connect to K

  • Click on Overview to copy the below details for a later step:

    • Factory name

    • Resource group name

    • Subscription ID

       


Step 2) Registering Azure Data Factory App in Azure AD

This step is performed by the Azure AD Admin


Step 3) Update your Azure Data Factory access control

To ensure your Azure Data Factory can connect to K, you will need to provide the Azure Data Factory with the correct Role Assignment

  • Follow Step 1 to navigate to your Data Factory you wish to profile. You will need to perform the following steps for each Data Factory you wish to profile.

    Open a Data Factory

  • Click on Access control (IAM) in the panel and click Add

 

 

  • Select Data Factory Contributor

  • Click Select Member.

    In the side panel add the the Service Application you created in Step 2. Click Select to add the Service Application.

    Click Review + Assign to finish adding the permission.

 


 

Step 4) Add Azure Data Factory as a New Source

  • Select Platform Settings in the side bar

  • In the pop-out side panel, under Integrations click on Sources

  • Click Add Source and select AZURE_DATA_FACTORY


Step 4) Schedule Azure Data Factory source load

  • Select Platform Settings in the side bar

  • In the pop-out side panel, under Integrations click on Sources

  • Locate your new Azure Data Factory Source and click on the Schedule Settings (clock) icon to set the schedule


Step 5) Manually run an ad hoc load to test Azure Data Factory

  • Next to your new Source, click on the Run manual load icon

    Confirm how your want the source to be loaded

  • After the source load is triggered, a pop up bar will appear taking you to the Monitor tab in the Batch Manager page. This is the usual page you visit to view the progress of source loads