Constructor Detail

ParquetSerDe

Method Detail

setBlockSizeBytes

The Hadoop Distributed File System (HDFS) block size. This is useful if you intend to copy the data from Amazon
S3 to HDFS before querying. The default is 256 MiB and the minimum is 64 MiB. Kinesis Data Firehose uses this
value for padding calculations.

Parameters:

blockSizeBytes - The Hadoop Distributed File System (HDFS) block size. This is useful if you intend to copy the data from
Amazon S3 to HDFS before querying. The default is 256 MiB and the minimum is 64 MiB. Kinesis Data Firehose
uses this value for padding calculations.

getBlockSizeBytes

The Hadoop Distributed File System (HDFS) block size. This is useful if you intend to copy the data from Amazon
S3 to HDFS before querying. The default is 256 MiB and the minimum is 64 MiB. Kinesis Data Firehose uses this
value for padding calculations.

Returns:

The Hadoop Distributed File System (HDFS) block size. This is useful if you intend to copy the data from
Amazon S3 to HDFS before querying. The default is 256 MiB and the minimum is 64 MiB. Kinesis Data
Firehose uses this value for padding calculations.

withBlockSizeBytes

The Hadoop Distributed File System (HDFS) block size. This is useful if you intend to copy the data from Amazon
S3 to HDFS before querying. The default is 256 MiB and the minimum is 64 MiB. Kinesis Data Firehose uses this
value for padding calculations.

Parameters:

blockSizeBytes - The Hadoop Distributed File System (HDFS) block size. This is useful if you intend to copy the data from
Amazon S3 to HDFS before querying. The default is 256 MiB and the minimum is 64 MiB. Kinesis Data Firehose
uses this value for padding calculations.

Returns:

Returns a reference to this object so that method calls can be chained together.

setPageSizeBytes

The Parquet page size. Column chunks are divided into pages. A page is conceptually an indivisible unit (in terms
of compression and encoding). The minimum value is 64 KiB and the default is 1 MiB.

Parameters:

pageSizeBytes - The Parquet page size. Column chunks are divided into pages. A page is conceptually an indivisible unit
(in terms of compression and encoding). The minimum value is 64 KiB and the default is 1 MiB.

withPageSizeBytes

The Parquet page size. Column chunks are divided into pages. A page is conceptually an indivisible unit (in terms
of compression and encoding). The minimum value is 64 KiB and the default is 1 MiB.

Parameters:

pageSizeBytes - The Parquet page size. Column chunks are divided into pages. A page is conceptually an indivisible unit
(in terms of compression and encoding). The minimum value is 64 KiB and the default is 1 MiB.

Returns:

Returns a reference to this object so that method calls can be chained together.

setCompression

The compression code to use over data blocks. The possible values are UNCOMPRESSED,
SNAPPY, and GZIP, with the default being SNAPPY. Use SNAPPY
for higher decompression speed. Use GZIP if the compression ration is more important than speed.

Parameters:

compression - The compression code to use over data blocks. The possible values are UNCOMPRESSED,
SNAPPY, and GZIP, with the default being SNAPPY. Use
SNAPPY for higher decompression speed. Use GZIP if the compression ration is
more important than speed.

getCompression

The compression code to use over data blocks. The possible values are UNCOMPRESSED,
SNAPPY, and GZIP, with the default being SNAPPY. Use SNAPPY
for higher decompression speed. Use GZIP if the compression ration is more important than speed.

Returns:

The compression code to use over data blocks. The possible values are UNCOMPRESSED,
SNAPPY, and GZIP, with the default being SNAPPY. Use
SNAPPY for higher decompression speed. Use GZIP if the compression ration is
more important than speed.

withCompression

The compression code to use over data blocks. The possible values are UNCOMPRESSED,
SNAPPY, and GZIP, with the default being SNAPPY. Use SNAPPY
for higher decompression speed. Use GZIP if the compression ration is more important than speed.

Parameters:

compression - The compression code to use over data blocks. The possible values are UNCOMPRESSED,
SNAPPY, and GZIP, with the default being SNAPPY. Use
SNAPPY for higher decompression speed. Use GZIP if the compression ration is
more important than speed.

Returns:

Returns a reference to this object so that method calls can be chained together.

withCompression

The compression code to use over data blocks. The possible values are UNCOMPRESSED,
SNAPPY, and GZIP, with the default being SNAPPY. Use SNAPPY
for higher decompression speed. Use GZIP if the compression ration is more important than speed.

Parameters:

compression - The compression code to use over data blocks. The possible values are UNCOMPRESSED,
SNAPPY, and GZIP, with the default being SNAPPY. Use
SNAPPY for higher decompression speed. Use GZIP if the compression ration is
more important than speed.

Returns:

Returns a reference to this object so that method calls can be chained together.