ParquetLoader
Â
Streaming loader for Apache Parquet encoded files.
Loader | Characteristic |
---|---|
File Format | Parquet |
Data Format | Classic Table |
File Extension | .parquet , |
MIME Type | N/A (application/octet-stream ) |
File Type | Binary |
Supported APIs | load , parse , parseInBatches |
Please refer to the parquet
format page for information on
which Parquet format features are supported.
Usage​
Load a Parquet file as a table.
import {ParquetLoader} from '@loaders.gl/parquet';
import {load} from '@loaders.gl/core';
const data = await load(url, ParquetLoader, {parquet: options});
The ParquetLoader supports streaming parsing, in which case it will yield "batches" of rows.
import {ParquetLoader} from '@loaders.gl/parquet';
import {loadInBatches} from '@loaders.gl/core';
const batches = await loadInBatches('geo.parquet', ParquetLoader, {parquet: options}});
for await (const batch of batches) {
// batch.data will contain a number of rows
for (const feature of batch.data) {
switch (feature.geometry.type) {
case 'Polygon':
...
}
}
}
Compressions​
Some compressions are big and need to be imported explicitly by the application
and passed to the ParquetLoader
import {ParquetLoader} from '@loaders.gl/parquet';
import {load} from '@loaders.gl/core';
import {ZstdCodec} from 'zstd-codec';
import lz4js from 'lz4js';
const data = await load(url, ParquetLoader, {modules: {
'zstd-codec': ZstdCodec,
'lz4js': lz4js,
// brotli - only needed for compression
});
Data Format​
For details see parquet documentation.
Options​
Supports table category options such as batchType
and batchSize
.
Option | From | Type | Default | Description |
---|