This document explains how to create a managed instance group (MIG) that uses Flex-start virtual machine (VM) instances. Flex-start VMs run uninterrupted for up to seven days and help you obtain high-demand resources, such as GPUs, at a discounted price. These features make Flex-start VMs a cost-effective solution for running short-duration workloads, such as model fine-tuning and batch inference workloads.
The steps in this document explain how to create a MIG that creates individual Flex-start VMs as resources become available. Thus, the MIG might initially create only a portion of the requested VMs, and then add the remaining VMs later as capacity permits. If you want to create Flex-start VMs all at once in a MIG, then see instead About MIG resize requests.
For other methods that you can use to create MIGs, see Basic scenarios for creating a MIG.
Before you begin
- Verify that you have sufficient quota for the resources that you want to request. For more information, see Allocation quotas.
-
If you haven't already, set up authentication.
Authentication verifies your identity for access to Cloud de Confiance by S3NS services and APIs. To run
code or samples from a local development environment, you can authenticate to
Compute Engine by selecting one of the following options:
Select the tab for how you plan to use the samples on this page:
Console
When you use the Cloud de Confiance console to access Cloud de Confiance by S3NS services and APIs, you don't need to set up authentication.
gcloud
-
Install the Google Cloud CLI, and then sign in to the gcloud CLI with your federated identity. After signing in, initialize the Google Cloud CLI by running the following command:
gcloud init - Set a default region and zone.
REST
To use the REST API samples on this page in a local development environment, you use the credentials you provide to the gcloud CLI.
Install the Google Cloud CLI, and then sign in to the gcloud CLI with your federated identity.
For more information, see Authenticate for using REST in the Cloud de Confiance authentication documentation.
-
Required roles
To get the permissions that
you need to create MIGs that use Flex-start VMs,
ask your administrator to grant you the
Compute Instance Admin (v1) (roles/compute.instanceAdmin.v1) IAM role on the project.
For more information about granting roles, see Manage access to projects, folders, and organizations.
This predefined role contains the permissions required to create MIGs that use Flex-start VMs. To see the exact permissions that are required, expand the Required permissions section:
Required permissions
The following permissions are required to create MIGs that use Flex-start VMs:
-
To create an instance template:
compute.instanceTemplates.createon the project -
To create a MIG:
compute.instanceGroupManagers.createon the project
You might also be able to get these permissions with custom roles or other predefined roles.
Create a MIG that uses Flex-start VMs
To create a MIG that uses Flex-start VMs, you must complete the following steps:
Create an instance template that's configured for creating Flex-start VMs
You must create an instance template that's configured for creating Flex-start VMs, as described in this section. After you create the template, you use it to create the MIG.
To create an instance template that's configured for creating Flex-start VMs, select one of the following options:
Console
In the Cloud de Confiance console, go to the Instance templates page.
Click Create instance template. The Create an instance template page appears.
In the Name field, enter a name for the instance template.
In the Location section, select the type of instance template that you want to create:
For a regional instance template, select Regional, and then select the region in which to create your template. Based on the machine type that you want your Flex-start VMs to use, specify a supported region.
For a global instance template, select Global.
In the Machine configuration section, select a supported machine series for Flex-start VMs.
In the Provisioning model section, do the following:
In the VM provisioning model list, select Flex-start.
To set a run duration for the VMs that are created through the instance template, in the Enter number of hours field, enter the number of hours that the VMs run for. The value must be between one hour (
1) and seven days (168). When the VMs reach the end of their run duration, Compute Engine deletes each VM individually relative to its own creation time.
Optional: To change the default value boot disk type or image, in the Boot disk section, click Change. Then, follow the prompts to change the boot disk.
Click Create.
gcloud
To create an instance template that's configured for creating
Flex-start VMs, use the
gcloud compute instance-templates create command.
The following command creates a regional instance template. If you want to
create a global instance template, then use the same command without the
--instance-template-region flag.
gcloud compute instance-templates create INSTANCE_TEMPLATE_NAME \
--image-project=IMAGE_PROJECT \
--image-family=IMAGE_FAMILY \
--instance-template-region=REGION \
--instance-termination-action=DELETE \
--machine-type=MACHINE_TYPE \
--maintenance-policy=TERMINATE \
--max-run-duration=RUN_DURATION \
--provisioning-model=FLEX_START \
--reservation-affinity=none
Replace the following:
INSTANCE_TEMPLATE_NAME: the name of the instance template to create.IMAGE_PROJECT: the image project that contains the image; for example,debian-cloud. For more information about the supported image projects, see Public images.IMAGE_FAMILY: an image family, which specifies the most recent, non-deprecated OS image. For example, if you specifydebian-12, you use the latest version in the Debian 12 image family. For more information about using image families, see Image families best practices.REGION: the region where to create the instance template. Based on the machine type that you want the Flex-start VMs to use, specify a supported region.MACHINE_TYPE: the GPU machine type or H4D machine type to use for the Flex-start VMs. If you specify an N1 machine type, then you must include the--acceleratorflag to define the number and type of GPUs to attach to your VMs.RUN_DURATION: the duration you want the requested VMs to run. When the VMs reach the end of their run duration, Compute Engine deletes each VM individually relative to its own creation time. You must format the run duration as the number of days, hours, minutes, or seconds followed byd,h,m, andsrespectively. For example, specify30mfor 30 minutes or1d2h3m4sfor one day, two hours, three minutes, and four seconds. The value must be between 10 minutes and seven days.
REST
To create an instance template that's configured for creating
Flex-start VMs, make one of the following POST requests:
To create a regional instance template:
regionInstanceTemplates.insertmethodTo create a global instance template:
instanceTemplates.insertmethod
For example, to create a regional instance template, make a request as follows:
POST https://compute.googleapis.com/compute/v1/projects/PROJECT_ID/regions/REGION/instanceTemplates
{
"name": "INSTANCE_TEMPLATE_NAME",
"properties": {
"disks": [
{
"boot": true,
"initializeParams": {
"sourceImage": "projects/IMAGE_PROJECT/global/images/IMAGE"
}
}
],
"machineType": "MACHINE_TYPE",
"networkInterfaces": [
{
"network": "global/networks/default"
}
],
"reservationAffinity": {
"consumeReservationType": "NO_RESERVATION"
},
"scheduling": {
"instanceTerminationAction": "DELETE",
"maxRunDuration": {
"seconds": RUN_DURATION
},
"onHostMaintenance": "TERMINATE",
"provisioningModel": "FLEX_START"
}
}
}
Replace the following:
PROJECT_ID: the ID of the project in which to create the instance template.REGION: the region where to create the instance template. Based on the machine type that you want the Flex-start VMs to use, specify a supported region.INSTANCE_TEMPLATE_NAME: the name of the instance template.IMAGE_PROJECT: the image project that contains the image; for example,debian-cloud. For more information about the supported image projects, see Public images.IMAGE: specify one of the following:A specific version of the OS image—for example,
debian-12-bookworm-v20240617.An image family, which must be formatted as
family/IMAGE_FAMILY. This value specifies the most recent, non-deprecated OS image. For example, if you specifyfamily/debian-12, the latest version in the Debian 12 image family is used. For more information about using image families, see Image families best practices.
MACHINE_TYPE: the GPU machine type to use for the VMs. If you specify an N1 machine type, then you must include theguestAcceleratorsfield to define the number and type of GPUs to attach to your VMs.RUN_DURATION: the duration, in seconds, you want the requested VMs to run. When the VMs reach the end of their run duration, Compute Engine deletes each VM individually relative to its own creation time. The run duration must be between600, which is 600 seconds (10 minutes), and604800, which is 604,800 seconds (seven days).
After you create the instance template, you can view the template's details to review its properties.
Create a regional or zonal MIG
Create a regional or zonal MIG as described in this section. You must not configure autoscaling and must turn off repairs. If your requested resources are unavailable, then Compute Engine keeps attempting to allocate resources until the request succeeds, you reduce the number of VMs in the MIG, or you delete your MIG.
To create regional or zonal MIG, select one of the following options:
Console
In the Cloud de Confiance console, go to the Instance groups page.
Click Create instance group. The Create instance group page appears.
In the Name field, enter a name for the MIG.
In the Instance template list, select the instance template that you created in the previous section.
In the Number of instances field, enter the number of Flex-start VMs to create in the MIG. If your workload requires specific VM names, then enter
0to not create any VMs. After you create the MIG, add VMs with specific names to it.In the Location section, specify whether you want to create a zonal or a regional MIG as follows:
To create a zonal MIG, select Single zone. Or, to create a regional MIG, select Multiple zones.
Select the Region and Zones for the MIG.
If you're creating a regional MIG, then do the following:
In the Target distribution shape field, select Any single zone.
In the dialog that appears, click Disable instance redistribution.
To delete the autoscaling configuration, in the Autoscaling section, do the following:
Click Configure autoscaling.
In the Autoscaling mode list, click Delete autoscaling configuration.
In the confirmation dialog, click Delete.
To turn off repairs, in the Action on failure section, in the Default action on failure list, select No action.
Click Create.
gcloud
To create a MIG, use the
gcloud compute instance-groups managed create command.
Based on the type of MIG that you want to create, include the following flags in the command:
To create a zonal MIG, include the
--zoneflag:gcloud compute instance-groups managed create INSTANCE_GROUP_NAME \ --default-action-on-vm-failure=do-nothing \ --size=SIZE \ --template=INSTANCE_TEMPLATE_URL \ --zone=ZONETo create a regional MIG, include the
--regionand--target-distribution-shapeflags:gcloud compute instance-groups managed create INSTANCE_GROUP_NAME \ --default-action-on-vm-failure=do-nothing \ --size=SIZE \ --template=INSTANCE_TEMPLATE_URL \ --region=REGION \ --target-distribution-shape=SHAPE
Replace the following:
INSTANCE_GROUP_NAME: the name of the MIG.INSTANCE_TEMPLATE_URL: the URL of the instance template that you created in the previous section. The URL can contain either the ID or the name of the instance template. Specify one of the following values:For a regional instance template:
projects/PROJECT_ID/regions/REGION/instanceTemplates/INSTANCE_TEMPLATE_IDFor a global instance template:
INSTANCE_TEMPLATE_ID
SIZE: the number of Flex-start VMs to create in the MIG. If your workload requires specific VM names, then specify0to not create any VMs. After you create the MIG, add VMs with specific names to it.ZONE: the zone where to create the zonal MIG. If you use a regional instance template to create the MIG, then you must specify a zone within the same region as the template.REGION: the region where to create the regional MIG. If you use a regional instance template to create the MIG, then you must specify the same region as the template's region.SHAPE: how your MIG distributes the Flex-start VMs across zones in your regional MIG. Specify one of the following values:To create VMs in multiple zones based on availability:
ANYTo create VMs in a single zone based on availability:
ANY_SINGLE_ZONE
For more information, see Regional MIG target distribution shape.
REST
To create a MIG, make a POST request to one of the following methods:
To create a zonal MIG, make the following request to the
instanceGroupManagers.insertmethod:POST https://compute.googleapis.com/compute/v1/projects/PROJECT_ID/zones/ZONE/instanceGroupManagers { "name": "INSTANCE_GROUP_NAME", "versions": [ { "instanceTemplate": "INSTANCE_TEMPLATE_URL" } ], "instanceLifecyclePolicy": { "defaultActionOnFailure": "DO_NOTHING" }, "targetSize": SIZE }To create a regional MIG, make the following request to the
regionInstanceGroupManagers.insertmethod:POST https://compute.googleapis.com/compute/v1/projects/PROJECT_ID/regions/REGION/instanceGroupManagers { "name": "INSTANCE_GROUP_NAME", "instanceTemplate": "INSTANCE_TEMPLATE_URL", "instanceLifecyclePolicy": { "defaultActionOnFailure": "DO_NOTHING" }, "targetSize": SIZE, "distributionPolicy": { "targetShape": "SHAPE" } }
Replace the following:
PROJECT_ID: the ID of the project in which to create the regional or zonal MIG.ZONE: the zone where to create the zonal MIG. If you use a regional instance template to create the MIG, then you must specify a zone within the same region as the template.REGION: the region where to create the regional MIG. If you use a regional instance template to create the MIG, then you must specify the same region as the template's region.INSTANCE_GROUP_NAME: the name of the MIG.INSTANCE_TEMPLATE_URL: the URL of the instance template that you created in the previous section. The URL can contain either the ID or the name of the instance template. Specify one of the following values:For a regional instance template:
projects/PROJECT_ID/regions/REGION/instanceTemplates/INSTANCE_TEMPLATE_IDFor a global instance template:
INSTANCE_TEMPLATE_ID
SIZE: the number of Flex-start VMs to create in the MIG. If your workload requires specific VM names, then specify0to not create any VMs. After you create the MIG, add VMs with specific names to it.SHAPE: how your MIG distributes the Flex-start VMs across zones in your regional MIG. Specify one of the following values:To create VMs in multiple zones based on availability:
ANYTo create VMs in a single zone based on availability:
ANY_SINGLE_ZONE
For more information, see Regional MIG target distribution shape.
What's next
Learn how to view info about MIGs and managed VMs.
Learn how to view the actual and forecasted usage of your VMs and GPUs.