Skip to main content
Version: 2.8

1 Introduction

Parquet Utils enables reading/writing Parquet files for EVL Tool. Used standalone (i.e. without EVL Tool) it provides scripts ‘csv2parquet’ and ‘parquet2csv’, and it provides metadata from Parquet files.

It is useful when you need to convert various sources directly to/from a Parquet file, especially on regular bases.

1.1 Parquet Utils used standalone

../images/Parquet-Utils/Parquet-Utils-standalone

Having Parquet Utils without any other EVL Microservice contain these scripts:

  • csv2evd – generate EVL data definition file (EVD file) based on CSV
  • csv2parquet – convert CSV to Parquet
  • parquet2csv – convert Parquet to CSV

1.2 Parquet Utils in combination with other Microservices

../images/Parquet-Utils/Parquet-Utils-combinations

In join with other EVL Microservices, like Avro Utils or EVL Data Anonymization or EVL Data Hub, you can also read/write:

  • various file formats: Avro, json, QVD, xlsx, and xml,
  • database tables: MySQL, PostgreSQL, Oracle, SQLite, Teradata, or any other by ODBC.

1.3 Parquet Utils with EVL Tool

Together with EVL Tool (an ETL tool) it can be used for example in Parquet-based architecture:

../images/Parquet-Utils/Parquet-based_Architecture