Parquet
|
Structured
|
Snappy, gzip; currently Snappy by default
|
Yes.
|
Yes: CREATE TABLE, INSERT, LOAD DATA, and query.
|
Text
|
Unstructured
|
LZO, gzip, bzip2, Snappy
|
Yes. For CREATE TABLE with no STORED AS clause, the default file format is uncompressed text, with values separated by ASCII 0x01 characters (typically represented as Ctrl-A).
|
Yes: CREATE TABLE, INSERT, LOAD DATA, and query. If LZO compression is used, you must create the table and load data in Hive. If other kinds of compression are used, you must load data through LOAD DATA, Hive, or manually in HDFS.
|
Avro
|
Structured
|
Snappy, gzip, deflate, bzip2
|
Yes, in Impala 1.4.0 and higher. Before that, create the table using Hive.
|
No. Load data through LOAD DATA on data files already in the right format, or use INSERT in Hive.
|