We are the Best Consulting web site as part of the annual WebAward Competition!

(832) 981-4635
info@datacrafters.io
img
Language

English

Data Engineering in Microsoft Fabric: Part 4 – Orchestration

Abdul Alim

Abdul Alim

Data Analyst, Data Crafters

View Author Profile

Data orchestration is an automated way to manage and organize data from different sources, making it ready for analysis. It’s also known as data pipeline orchestration or data workflow orchestration.

This process covers many parts of data management, such as ensuring data quality, following data rules (governance), and automating various tasks. It also handles moving and changing data, as part of ETL (Extract, Transform, Load) operations. Data orchestration helps organizations use their data effectively by giving them a single view for analysis, reporting, and decision-making. Depending on what they need, organizations might use several orchestration tools together.

Benefits of Data Orchestration

  • Integration of Data: Combines data from multiple sources, as part of the Extract, Transform, Load (ETL) process.
  • Automation and Efficiency: Uses automated tools and technologies to manage and coordinate data across different systems and applications, reducing the need for manual intervention and increasing efficiency.
  • Job Scheduling and Workflow Management: Schedules tasks, manages workflows, and handles dependencies among various jobs.

Demonstration of an orchestration by creating a Pipeline using Dataflow and Store Procedure activity in Microsoft Fabric

In this section, we will create a data pipeline using dataflow and stored procedure activities.

  1.  Begin by navigating to the appropriate workspace and the Microsoft Fabric data engineering experience.
Image (1)

   2.  Choose ‘+ New‘ and then select ‘Data Pipeline’ to create a new pipeline.

Image (2)

  3.  Give the pipeline name and click on the Create button.

Image (3)

  4.  You’ll encounter an interface similar to the following image. In this interface, choose ‘Pipeline activity’.

Image (4)

  5.  After choosing the Pipeline activity, you’ll see a list of activities. From there, select Dataflow.

Image (5)

  6.  Within the General tab, rename the dataflow activity.  

Image (6)

  7.  In the settings tab, choose the appropriate workspace and the previously created Dataflow Gen2.

Image (7)

  8.  Next, by choosing the Dataflow activity from the graph nodes, click on the ‘Add activity‘ icon.

Image (8)

  9.  Next, choose the stored procedure from the list of activities.

Image (9)

  10.  By selecting the Stored Procedure activity, assign a name to it under the General tab.

Image (10)

  11.  Choose a suitable connection and the previously created stored procedure in the last blog from the settings tab.

Image (11)

  12.  Select the ‘Import‘ button to enter the stored procedure parameters, then input a number in the value field.

Image (12)

  13.  Then select the ‘Run‘ button to run the pipeline.

Image (13)

  14.  After clicking the ‘Run‘ button, you’ll be prompted to save first. Choose the ‘Save and run‘ button to save and run the pipeline.

Image (14)

  15.  After running the pipeline, you will be able to view its status.

Image (15)

  16. After completing its run, the pipeline will display a ‘succeeded’ status along with two activities.

Image (16)

  17.  You can also set a schedule for this data pipeline. To set a schedule refresh select the Schedule button from the home menu bar.

Image (17)

  18.  Turn on Schedule run and set the schedule at your preferred time and time zone. After putting information select the Apply button to save the scheduled run.

Image (18)

In this article, we learned about data orchestration and its benefits. We also explained how to use orchestration in Microsoft Fabric by creating a data pipeline with Dataflow and Stored Procedure activities. We showed how to run the pipeline and set up a schedule to refresh it. In conclusion, we can say orchestration is very helpful for moving and transforming data.

Post A Comment

Stay ahead in a rapidly world. Subscribe to Prysm Insights,our monthly look at the critical issues facing global business.

[mc4wp_form]