sift_py.data_import.parquet
¶
| CLASS | DESCRIPTION |
|---|---|
ParquetUploadService |
|
ParquetUploadService
¶
ParquetUploadService(rest_conf: SiftRestConfig)
Bases: _RestService
| METHOD | DESCRIPTION |
|---|---|
flat_dataset_upload |
Uploads the Parquet file pointed to by |
upload |
Uploads the Parquet file pointed to by |
upload_from_url |
Uploads the Parquet file pointed to by |
| ATTRIBUTE | DESCRIPTION |
|---|---|
DETECT_CONFIG_PATH |
|
UPLOAD_PATH |
|
URL_PATH |
|
DETECT_CONFIG_PATH
class-attribute
instance-attribute
¶
flat_dataset_upload
¶
flat_dataset_upload(
asset_name: str,
path: Union[str, Path],
time_path: str,
time_format: TimeFormatType = ABSOLUTE_UNIX_NANOSECONDS,
complex_types_import_mode: ParquetComplexTypesImportModeType = BOTH,
run_name: Optional[str] = None,
run_id: Optional[str] = None,
relative_start_time: Optional[str] = None,
) -> DataImportService
Uploads the Parquet file pointed to by path to the specified asset. This function will
automatically generate the Parquet Config using the footer. See the options
below for what parameters can be overridden. Use upload if you need to specify a custom Parquet config.
Set time_path to specify which column contains timestamp information and time_format
to specify the time data format. Default is TimeFormatType.ABSOLUTE_UNIX_NANOSECONDS.
Override complex_types_import_mode to specify how to import complex types (maps and list). Default is both strings and bytes.
Override run_name to specify the name of the run to create for this data. Default is None.
Override run_id to specify the id of the run to add this data to. Default is None.
Override relative_start_time if a relative time format is used. Default is None.
upload
¶
upload(
path: Union[str, Path],
parquet_config: ParquetConfig,
show_progress: bool = True,
) -> DataImportService
Uploads the Parquet file pointed to by path using a custom Parquet config.
| PARAMETER | DESCRIPTION |
|---|---|
path
|
The path to the Parquet file. |
parquet_config
|
The Parquet config.
TYPE:
|
show_progress
|
Whether to show the status bar or not.
TYPE:
|
upload_from_url
¶
upload_from_url(
url: str, parquet_config: ParquetConfig
) -> DataImportService
Uploads the Parquet file pointed to by url using a custom Parquet config.