Working with Parquet files
Apache Parquet is a columnar storage format available for most of the data processing frameworks in the Hadoop ecosystem: Hive Pig Spark Drill Arrow Apache Impala Cascading Crunch Tajo ... and many more! In Parquet, the data are compressed column by column. This means that commands like these: