Updating a Dataset

Back to Main Help

An Update Job re-calculates all Pipeline Column values – it should be run any time changes are made to a Dataset or its Tasks.  This is called Updating the Dataset

Update Jobs will only re-calculate Pipeline Columns values that may have changed since the last Update.   You can manually Update a Dataset at any time using Pipelines > Update.

Automatic Updates

When a new Dataset is loaded (Load Data), an Update Job will be executed on all Tasks to generate initial values for all Pipeline Columns.

When additional data is loaded to an existing Dataset (Load Data), an Update Job will be executed on all Tasks to generate initial values in all Pipeline Columns for the newly loaded rows of the Dataset.

When a new Task is added to a Template, an Update Immediately? checkbox will be selected by default, indicating that an Update will be run after the Task is created to generate the new Pipeline Column results for that Task.

Manual Updates

You can start an Update in 2 ways

·      There is an Update option for each Dataset on the Datasets page (Datasets > Datasets).

·      Click Pipelines > Updates, then New Update Job and select the Dataset. 

o   To force updates of all Pipeline Column, select the Update All Columns checkbox.

When you Edit a Dataset, you can also select Tasks which will be updated the next time an Update Job is run.

Next: Using Rulesets