Added support for lifecycle.started for clusters by andrewnester · Pull Request #5150 · databricks/cli

andrewnester · 2026-04-30T15:21:21Z

Changes

Adds lifecycle.started support for clusters in the direct deployment engine, mirroring the same feature for apps (#4672).

Why

Without this field, clusters defined in a bundle are always left in whatever state the API puts them in after creation.
Users have no way to declare "ensure this cluster is running after every deploy."

lifecycle.started: true guarantees the cluster is RUNNING after bundle deploy.
lifecycle.started: false creates the cluster but immediately terminates it, and subsequent deploys that detect drift (e.g., someone started the cluster manually) will stop it again.

Note: WaitAfterCreate always waits for RUNNING first — real clusters start in PENDING state and must be polled. For
started=false, we wait for RUNNING then terminate; this avoids races with the API that would reject a terminate on a still-pending cluster.

Tests

Added acceptance tests

github-actions · 2026-04-30T15:23:11Z

Approval status: pending

`/acceptance/bundle/` - needs approval

22 files changed
Suggested: @denik
Also eligible: @shreyas-goenka, @pietern, @janniklasrose, @lennartkats-db, @anton-107

`/bundle/` - needs approval

8 files changed
Suggested: @denik
Also eligible: @shreyas-goenka, @pietern, @janniklasrose, @lennartkats-db, @anton-107

General files (require maintainer)

Files: libs/testserver/clusters.go, libs/testserver/handlers.go
Based on git history:

@denik -- recent work in bundle/deployplan/, bundle/direct/dresources/, libs/testserver/

_{Any maintainer (@anton-107, @denik, @pietern, @shreyas-goenka, @simonfaltum, @renaudhartert-db) can approve all areas.

See OWNERS for ownership rules.}

shreyas-goenka · 2026-05-08T05:41:22Z

+type ClusterState struct {
+	compute.ClusterSpec
+
+	ClusterId string          `json:"cluster_id,omitempty"`


Do we really need this? Suprising that the direct deployment framework does not directly provide id to WaitAfterCreate

Yes, we do need this to pass the ClusterId. WaitAfterCreate accepts the state, in many cases Id is in the state already but not for clusters where Id/ClusterId is not part of compute.ClusterSpec

shreyas-goenka · 2026-05-08T05:49:28Z

+// ClusterRemote extends compute.ClusterDetails with a synthetic Lifecycle field so that
+// RemoteType satisfies TestRemoteSuperset (every field in ClusterState exists in ClusterRemote).
+// Lifecycle.Started is populated by DoRead from the cluster's running state.
+type ClusterRemote struct {


can we use the same struct for ClusterRemote and ClusterState?

shreyas-goenka · 2026-05-08T05:56:12Z

+		// lifecycle.started=true: fire Start; WaitAfterUpdate polls for RUNNING.
+		_, err := r.client.Clusters.Start(ctx, compute.StartCluster{ClusterId: id})
+		return nil, err
+	} else if !desiredStarted && alreadyRunning {


Should we also call delete on other states? Like PENDING | RESTARTING | RESIZING | UNKNOWN | ERROR? And poll waiting for the state transition if the state is TERMINATING?

Do we guarentee TERMINATED if started = false?

It's a good question. I believe we should only explicitly manpulate it if it's in a known good state remotely, if it's not it's better not to

shreyas-goenka · 2026-05-08T05:59:25Z

+	// cluster_id is stored in state for informational purposes only; it must not appear in plan output.
+	// PrepareState never sets it (input has no ID), so after the first deploy ch.Old="<id>" while ch.New="",
+	// causing a spurious Skip entry. Drop it unconditionally so it never pollutes plan JSON.
+	if path == "cluster_id" {


why do we need this?

Why do we need cluster_id or why do we need to skip it? We use cluster_id to later on wait for cluster status. We need to skip this because cluster_id is not part of the bundle config and we want to have a clean plan as a result anyway

andrewnester requested review from denik, pietern and shreyas-goenka April 30, 2026 15:21

andrewnester temporarily deployed to test-trigger-is April 30, 2026 15:21 — with GitHub Actions Inactive

andrewnester temporarily deployed to test-trigger-is May 1, 2026 11:35 — with GitHub Actions Inactive

andrewnester added 4 commits May 1, 2026 14:05

Added support for lifecycle.started for clusters

9b44214

fix generations

04a5ea8

fix lint

27a1b57

fix out.test.toml

75e051f

andrewnester force-pushed the feat/lifecycle-started-clusters branch from d14c9b4 to 75e051f Compare May 1, 2026 12:22

andrewnester temporarily deployed to test-trigger-is May 1, 2026 12:23 — with GitHub Actions Inactive

denik reviewed May 4, 2026

View reviewed changes

Comment thread bundle/direct/dresources/cluster.go Outdated

Comment thread bundle/direct/dresources/cluster.go Outdated

Comment thread bundle/direct/dresources/cluster.go Outdated

Comment thread bundle/direct/dresources/cluster.go Outdated

fixes

b85d142

andrewnester requested a review from denik May 4, 2026 12:40

andrewnester temporarily deployed to test-trigger-is May 4, 2026 12:40 — with GitHub Actions Inactive

andrewnester added 2 commits May 4, 2026 14:44

set started only when certain

e369a20

explicit started states

9ae157c

andrewnester temporarily deployed to test-trigger-is May 4, 2026 12:47 — with GitHub Actions Inactive

has change except

9bcaca6

andrewnester temporarily deployed to test-trigger-is May 4, 2026 12:51 — with GitHub Actions Inactive

andrewnester mentioned this pull request May 6, 2026

Add vector_search_indexes resource (direct engine) #5123

Open

shreyas-goenka reviewed May 8, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added support for lifecycle.started for clusters#5150

Added support for lifecycle.started for clusters#5150
andrewnester wants to merge 8 commits intomainfrom
feat/lifecycle-started-clusters

andrewnester commented Apr 30, 2026

Uh oh!

github-actions Bot commented Apr 30, 2026 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

shreyas-goenka May 8, 2026

Uh oh!

andrewnester May 8, 2026

Uh oh!

shreyas-goenka May 8, 2026

Uh oh!

shreyas-goenka May 8, 2026

Uh oh!

shreyas-goenka May 8, 2026

Uh oh!

andrewnester May 8, 2026

Uh oh!

Uh oh!

shreyas-goenka May 8, 2026

Uh oh!

andrewnester May 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

andrewnester commented Apr 30, 2026

Changes

Why

Tests

Uh oh!

github-actions Bot commented Apr 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Approval status: pending

/acceptance/bundle/ - needs approval

/bundle/ - needs approval

General files (require maintainer)

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

github-actions Bot commented Apr 30, 2026 •

edited

Loading

`/acceptance/bundle/` - needs approval

`/bundle/` - needs approval