mecfs_bio.build_system.task.compressed_csv_to_parquet_task
Enable efficient SQL operations by converting CSV to parquet.
Classes:
-
CompressedCSVToParquetTask–Task for converting a gzipped CSV-type file to a parquet file
Attributes:
-
logger–
CompressedCSVToParquetTask
Bases: Task
Task for converting a gzipped CSV-type file to a parquet file Main use is for processing the SNP151 SNP database files
Methods:
Attributes:
-
csv_task(Task) – -
deps(list[Task]) – -
meta(Meta) – -
select_list(list[str] | None) – -
source_compression(str | None) – -
target_compression(str) – -
type_dict(Mapping[str, str] | None) –
create
classmethod
create(
csv_task: Task,
asset_id: str,
target_compression: str = "zstd",
source_compression: str | None = "gzip",
select_list: list[str] | None = None,
type_dict: Mapping[str, str] | None = None,
) -> CompressedCSVToParquetTask