mecfs_bio.build_system.task.join_dataframes_task
Task to perform a SQL-style join.
Classes:
-
JoinDataFramesTask–Task to load to dataframes, join them, and then write out the resulting dataframe.
Attributes:
-
logger–
JoinDataFramesTask
Bases: Task
Task to load to dataframes, join them, and then write out the resulting dataframe.
By default, writes as csv.
Methods:
-
create_from_result_df–Join a result dataframe to a reference dataframe.
-
execute–
Attributes:
-
backend(ValidBackend) – -
deps(list[Task]) – -
df_1_pipe(DataProcessingPipe) – -
df_2_pipe(DataProcessingPipe) – -
how(JoinStrategy) – -
left_on(Sequence[str]) – -
meta(Meta) – -
out_format(OutFormat) – -
out_pipe(DataProcessingPipe) – -
right_on(Sequence[str]) –
create_from_result_df
classmethod
create_from_result_df(
asset_id: str,
result_df_task: Task,
reference_df_task,
how: JoinStrategy,
left_on: Sequence[str],
right_on: Sequence[str],
out_format: OutFormat = CSVOutFormat(sep=","),
df_1_pipe: DataProcessingPipe = IdentityPipe(),
df_2_pipe: DataProcessingPipe = IdentityPipe(),
out_pipe: DataProcessingPipe = IdentityPipe(),
backend: ValidBackend = "polars",
)
Join a result dataframe to a reference dataframe.