Pipeline defines a set of steps that can be executed together.
Attributes
| Name | Type | Description |
|---|---|---|
| pipeline_id | uuid.UUID | technical id of the pipeline. |
| experiment_id | uuid.UUID | technical id of the experiment linked to the pipeline. |
| key | str | logical string unique id of the pipeline. |
| steps | list | list of steps connected between by their inputs and outputs. |
| template_id | uuid.UUID | template id associated to the pipeline. |
| variables | dict | dictionary of variable name with they wizata_dsapi.VarType. |
| createdById | uuid.UUID | unique identifier of creating user. |
| createdDate | int | timestamp of created date. |
| updatedById | uuid.UUID | unique identifier of updating user. |
| updatedDate | int | timestamp of updated date. |
Methods
add_model()
add a model step
| Name | Type | Default | Description |
|---|---|---|---|
| config | MLModelConfig | model configuration to define how pipeline should train and use your model. | |
| input_df | str, dict or Pipeline I/O defining input dataframe properties | ||
| output_df | None | str, dict or Pipeline I/O defining input dataframe properties |
add_plot()
add a writer step
| Name | Type | Default | Description |
|---|---|---|---|
| script | script configuration to define how pipeline should execute the plot script. | ||
| df_name | str | None | str deprecated usage |
| input_df | PipelineIO | None | str, dict or Pipeline I/O defining input dataframe properties |
add_query()
add a query step
| Name | Type | Default | Description |
|---|---|---|---|
| request | Request | request definition to add. | |
| df_name | str | query_df | output name ot use for the dataframe - use df_output for more features. |
| use_template | bool | True | by default, if pipeline is link to a template, the query will be too. set to false to disable forcing it. |
| output_df | PipelineIO | None | output df - can set a mapping. |
add_transformation()
add a transformation script
| Name | Type | Default | Description |
|---|---|---|---|
| script | name, Script or ScriptConfig. | ||
| inputs | list | None | list of Pipeline I/O or dict or str for dataframe input names. |
| outputs | list | None | list of Pipeline I/O or dict or str for dataframe output names. |
| input_df_names | list | None | deprecated support. |
| output_df_names | list | None | deprecated support. |
add_writer()
add a writer step
| Name | Type | Default | Description |
|---|---|---|---|
| config | WriteConfig | writer configuration to define how pipeline should write data into platform. | |
| input_df | str, dict or Pipeline I/O defining input dataframe properties |
api_id()
Id of the pipeline
return: string formatted UUID of the Pipeline.
check_path()
validate that steps create a valid path.
return true if path is valid, otherwise raise errors
check_variables()
verify that variables dict is a valid { "name" : "VarType" } dictionary.
endpoint()
Name of the endpoints used to manipulate pipeline.
return: Endpoint name.
from_json()
load from JSON dictionary representation
| Name | Type | Default | Description |
|---|---|---|---|
| obj |
set_id()
specify the id_value neutrally
| Name | Type | Default | Description |
|---|---|---|---|
| id_value |
return:
to_json()
Convert to a json version of Execution definition.
By default, use DS API format.
| Name | Type | Default | Description |
|---|---|---|---|
| target | str | None |