Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Scroll ignore
scroll-viewporttrue
scroll-pdftrue
scroll-officetrue
scroll-chmtrue
scroll-docbooktrue
scroll-eclipsehelptrue
scroll-htmltrue
scroll-epubtrue

Open in new tab

This page will walkthrough the setup of Azure Data Factory in K using the direct connect method

Integration details

Scope

Included

Comments

Metadata

Status
colourGreen
titleYES

See below

Lineage

Status
colourGreen
titleYES

Usage

Status
colourGreen
titleYES

Sensitive Data Scanner

Status
titleN/A

Note

Known limitations

  • Not all sources and destinations are included in the metadata extraction. Improvements are planned to provide wider coverage

  • Sources Implemented

    • SNOWFLAKE

...

Step 1) Enabling Azure Data Factory Admin APIs to be accessible to an AD Group

...

  • Locate the Data Factory that you would like to connect to K

  • Click on Overview to copy the below details for a later step:

    • Factory name

    • Resource group name

    • Subscription ID

      Image RemovedImage Added

...

Step 2) Registering Azure Data Factory App in Azure AD

...

To ensure your Azure Data Factory can connect to K, you will need to provide the Azure Data Factor Factory with the correct Role Assignment

  • Follow Step 1 to navigate to your Data Factory you wish to profile. You will need to perform the following steps for each Data Factory you wish to profile.

    Open a Data Factory

    Image Added
  • Click on Access control (IAM) in the panel and click Add

...

Image Added

  • Select Data Factory Contributor

...

  • Click Select Member.

    In the side panel add the the Service Application you created in Step 2. Click Select to add the Service Application.

    Click Review + Assign to finish adding the permission.

...

...

Step 4) Add Azure Data Factory as a New Source

...

  • Select Platform Settings in the side bar

  • In the pop-out side panel, under Integrations click on Sources

  • Click Add Source and select AZURE_DATA_FACTORY

...

  • Select Direct Connect and add your Azure Data Factory details and click Next

  • Fill in the Source Settings and click Save & Next

    • Name: Give the Azure Data Factory source a name in K. If you have multiple ADFs, each one will need to have a unique name

    • Host: Enter the url e.g. adf.azure.com

    • Timeout: Default is 10, sometimes it may take longer for the API to respond, so we recommend increasing it to 20

    • Add Update the Host / Mapping details. See Database mapping - see Host / Database Mapping for more details. This step can be completed after the initial load via the guided workflow.

    • Select Enable Workspace Filtering if you wish to load only select Workspaces

  • Add Connection Details and click Save & Next

    • Tenant ID: Add the Directory (tenant) ID copied from step 2

    • Client ID : Add the Application (client) ID copied from Step 2

    • Client Secret: Add the Secret ID copied from Step 2

  • Test your connection and click Next

  • If you selected Enabled Workspace Filtering select the Workspaces you want to load. If you have a lot of workspaces this may take a bit of time to load.

  • Click Finish Setup

...