Data Flow Formatters – Flatten, Parse and Stringify in Azure Data Factory


Introduction

I recently worked on a data pipeline project where I had to process JSON files and transform the data into CSV files. Azure Data Factory provides data flow formatters transformations to process the data in the pipeline. In this article, we will discuss Flatten, Parse, and Stringify transformation.

Flatten Transformation

The flatten transformation takes array values inside hierarchical structures such as JSON and converts them into individual rows.

We will use the below JSON file in the pipeline.It contains columns id,firstname,lastname,gender,age,address complex type which has streetAddress,city and state.

The JSON file is available in the data lake folder: