Thursday, December 17, 2020

what is self-hosted integration runtime and pipeline in Azure data factory ?

 Install a self-hosted integration runtime 

The integration runtime is a customer-managed data integration infrastructure used by  Azure Data Factory to provide data integration capabilities across different network environments.


Create a pipeline:

With ADF, existing data processing services can be composed into data pipelines that are highly available and managed in the cloud.

These data pipelines can be scheduled to ingest, prepare , transform, analyze and publish data and ADF manages and orchestrates the complex data and processing dependencies.

https://docs.microsoft.com/en-us/azure/machine-learning/team-data-science-process/move-sql-azure-adf

How to create and configure a self-hosted integration runtime ?

Azure Data Factory service: to copy data from the server onto Azure Storage.

why to install the self-hosted integration runtime?

You have to install the self-hosted integration runtime on the server so that it can be accepted as a data source in Azure Data Factory.

create and configure a self-hosted integration runtime

The  integration runtime ( IR ) is the compute infrastructure that Azure Data Factory uses to provide data-integration capabilities across different network environments.

A self-hosted  integration runtime can run copy activities between a cloud data store and a data store in a private network. It can dispatch transform activities against compute resources in an on-prem network or an Azure virtual network. The installation of a self-hosted integration runtime needs an on-prem machine or a virtual machine inside a private network.









No comments:

Post a Comment