Training dataset resource bug #86

vishalbollu · 2019-04-25T15:13:17Z

Closes #56

Checklist:

Run make test and make lint
Test end to end manually (e.g. build/push all images, restart local operator, and run cx refresh in an example folder)
Update documentation
Update examples and cx init
Alert team if dev environment changed
Cherry-pick into release branches if it's a bugfix
Delete the branch once it's merged

deliahu · 2019-04-25T16:27:49Z

docs/applications/resources/models.md

@@ -41,11 +41,24 @@ Train custom TensorFlow models at scale.
    start_delay_secs: <int>  # start evaluating after waiting for this many seconds (default: 120)
    throttle_secs: <int>  # do not re-evaluate unless the last evaluation was started at least this many seconds ago (default: 600)

-  compute:
+  compute:         # Resources for training and evaluations steps


Maybe mention it's the TensorFlow job? Like Resources for training and evaluations steps (TensorFlow)? Similarly # Resources for constructing training dataset (Spark)? We get Omer's stamp of approval on this

@ospillinger thoughts on the doc comments?

pkg/operator/api/userconfig/compute.go

deliahu · 2019-04-25T16:32:56Z

pkg/operator/api/userconfig/quantity.go

@@ -30,6 +30,15 @@ type Quantity struct {
 	UserString string
 }

+func MustNewQuantity(str string) Quantity {


Is this used?

deliahu · 2019-04-25T16:34:31Z

pkg/operator/workloads/data_job.go

@@ -251,12 +251,7 @@ func dataWorkloadSpecs(ctx *context.Context) ([]*WorkloadSpec, error) {
 		}
 		trainingDatasets = append(trainingDatasets, modelName)
 		trainingDatasetIDs.Add(dataset.GetID())
-		dependencyIDs := ctx.AllComputedResourceDependencies(dataset.GetID())


I think we still need to append the transformedColumn.Computes since transforming the data happens in the same step as preparing the dataset

vishalbollu added 4 commits April 23, 2019 15:58

Default spark compute resource to spark workloads

193040a

Add spark compute to model config and remove default

9122a03

Merge branch 'master' into training-dataset-resource-bug

5f93120

Update model config documentation

3089938

deliahu reviewed Apr 25, 2019

View reviewed changes

vishalbollu and others added 5 commits April 26, 2019 15:49

Refactoring variable and deleting unnecessary function

38b57b4

Remove duplicate tag field in doc

d023e84

Merge branch 'master' into training-dataset-resource-bug

758d88f

Declare spark compute validation object const as ptr

31b8750

Update models.md

087e2e4

deliahu approved these changes Apr 26, 2019

View reviewed changes

vishalbollu merged commit b4b1319 into master Apr 26, 2019

vishalbollu deleted the training-dataset-resource-bug branch April 26, 2019 23:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Training dataset resource bug #86

Training dataset resource bug #86

Uh oh!

vishalbollu commented Apr 25, 2019 •

edited

Loading

Uh oh!

deliahu Apr 25, 2019

Uh oh!

vishalbollu Apr 26, 2019 •

edited

Loading

Uh oh!

ospillinger Apr 26, 2019

Uh oh!

Uh oh!

deliahu Apr 25, 2019

Uh oh!

deliahu Apr 25, 2019

Uh oh!

Uh oh!

Training dataset resource bug #86

Training dataset resource bug #86

Uh oh!

Conversation

vishalbollu commented Apr 25, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Closes #56

Uh oh!

deliahu Apr 25, 2019

Choose a reason for hiding this comment

Uh oh!

vishalbollu Apr 26, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ospillinger Apr 26, 2019

Choose a reason for hiding this comment

Uh oh!

Uh oh!

deliahu Apr 25, 2019

Choose a reason for hiding this comment

Uh oh!

deliahu Apr 25, 2019

Choose a reason for hiding this comment

Uh oh!

Uh oh!

vishalbollu commented Apr 25, 2019 •

edited

Loading

vishalbollu Apr 26, 2019 •

edited

Loading