Choosing a File Format – Data Engineering with Fabric


Microsoft Fabric has OneLake Storage at the center of all services.  This storage is based upon existing Azure Data Lake Storage and can be accessed with tools that you are familiar with.  Since the invention of computers, many different file formats have been created.  Understanding the pros and cons of each file type is important.

Business Problem

Our manager has asked us to understand when to use the following file formats:  AVRO, CSV, DELTA, JSON, ORC, PARQUET, and TEXT.

Technical Solution

The preferred format for files in the OneLake is the delta lake tables.  However, the OneLake is just a