Create a reservation for a single project
Stay organized with collections
Save and categorize content based on your preferences.
This document explains how to create a single-project reservation, which can be consumed only by virtual machine (VM) instances in the same project. To learn more about reservations, see Reservations of Compute Engine zonal resources.
For other methods of creating reservations, see instead the following pages:
To create a reservation that can be used by multiple projects, see Create a shared reservation.
To create a reservation when purchasing a resource-based commitment, see Purchase commitments with attached reservations. Commitments provide discounts, known as committed use discounts (CUDs), on your on-demand resource costs in exchange for purchasing a minimum level of resources or for spending a minimum amount.
Before you begin
- Review the requirements and restrictions for reservations.
-
If you haven't already, set up authentication.
Authentication verifies your identity for access to Google Cloud services and APIs. To run
code or samples from a local development environment, you can authenticate to
Compute Engine by selecting one of the following options:
Select the tab for how you plan to use the samples on this page:
Console
When you use the Google Cloud console to access Google Cloud services and APIs, you don't need to set up authentication.
gcloud
-
Install the Google Cloud CLI. After installation, initialize the Google Cloud CLI by running the following command:
gcloudinit
If you're using an external identity provider (IdP), you must first sign in to the gcloud CLI with your federated identity.
- Set a default region and zone.
Terraform
To use the Terraform samples on this page in a local development environment, install and initialize the gcloud CLI, and then set up Application Default Credentials with your user credentials.
Install the Google Cloud CLI.
If you're using an external identity provider (IdP), you must first sign in to the gcloud CLI with your federated identity.
If you're using a local shell, then create local authentication credentials for your user account:
gcloudauthapplication-defaultlogin
You don't need to do this if you're using Cloud Shell.
If an authentication error is returned, and you are using an external identity provider (IdP), confirm that you have signed in to the gcloud CLI with your federated identity.
For more information, see Set up authentication for a local development environment.
REST
To use the REST API samples on this page in a local development environment, you use the credentials you provide to the gcloud CLI.
Install the Google Cloud CLI.
If you're using an external identity provider (IdP), you must first sign in to the gcloud CLI with your federated identity.
For more information, see Authenticate for using REST in the Google Cloud authentication documentation.
-
Required roles
To get the permissions that
you need to create single-project reservations,
ask your administrator to grant you the
Compute Admin (roles/compute.admin)
IAM role on the project.
For more information about granting roles, see Manage access to projects, folders, and organizations.
This predefined role contains the permissions required to create single-project reservations. To see the exact permissions that are required, expand the Required permissions section:
Required permissions
The following permissions are required to create single-project reservations:
-
compute.reservations.createon the project -
To specify an instance template:
compute.instanceTemplates.useReadOnlyon the instance template
You might also be able to get these permissions with custom roles or other predefined roles.
Create a single-project reservation
To consume a reservation, a VM must have properties that exactly match that reservation. To specify the properties of the VMs that you want to reserve, select one of the following sections in this document:
Recommended: Specify an instance template
This section explains how to use an instance template to define the properties of a reservation. By using an instance template, you can define the properties of a reservation and the VMs that can consume the reservation in the same place.
-
This section explains how to use an existing VM to define the properties of a reservation. By using the properties of an existing VM, you can consume the reservation by creating VMs with properties that match the reference VM.
-
This section explains how to directly define the properties of a reservation. This method requires you to manually ensure that the properties of your VMs and reservations exactly match—any mismatched properties prevent consumption.
By default, a reservation can be automatically consumed by any VMs with properties that match it. If you want to control reservation consumption, do one or more of the following:
Create a reservation that can only be consumed by VMs that specifically target it, as explained in the following sections in this document. Then, configure VMs to consume that specific reservation.
Additionally, you can specify a compact placement policy when creating a single-project specific reservation. A compact placement policy specifies that VMs are to be located as close to each other as possible to reduce network latency.
Specify an instance template
To avoid errors when creating a reservation by specifying an instance template, make sure of the following:
Create your reservation in the same region and zone as the resources within the instance template. Any regional or zonal resources specified in an instance template—such as a machine type or a Persistent Disk volume—restrict the use of the template to the locations where those resources exist. For example, if your instance template specifies an existing Persistent Disk volume in zone
us-central1-a, then you can only create your reservation in the same zone. To check if an existing template specifies any resources that bind the template to a specific region or zone, view the details of the instance template and look for references to regional or zonal resources inside of it.If the instance template specifies a compact placement policy, then make sure that the template specifies a supported machine type for compact placement policies. Otherwise, creating the reservation fails.
To create a single-project reservation by specifying an instance template, select one of the following options:
Console
In the Google Cloud console, go to the Reservations page.
On the On-demand reservations tab (default), click Create reservation.
The Create a reservation page opens.
For Name, enter a name for your reservation.
For Region and Zone, select where you want to reserve resources.
In the Share type section, select Local if it isn't already selected.
Optional: To allow a reservation of GPU VMs to be consumed by custom training jobs or prediction jobs in Vertex AI, in the Google Cloud services section, select Share reservation.
In the Use with VM instance section, select one of the following options:
To allow matching VMs to automatically consume this reservation, select Use reservation automatically if it isn't already selected.
To consume this reservation's resources only when creating matching VMs that specifically target this reservation by name, select Select specific reservation.
For Number of VM instances, enter the number of VMs that you want to reserve.
In the Machine configuration section, do the following:
To specify the properties of your VMs from an existing instance template, select Use instance template.
In the Instance template field, select the instance template of your choice. If you select a regional instance template, then you can only reserve resources within the region of the instance template.
In the Auto-delete section, you can enable the auto-delete option to let Compute Engine automatically delete the reservation at a specific date and time. Automatically deleting reservations can be useful to avoid unnecessary charges when you stop consuming the reservation.
To create the reservation, click Create.
The Reservations page opens. Creating the single-project reservation might take up to a minute to complete.
gcloud
To create a single-project reservation, use the
gcloud compute reservations create command.
To create a single-project reservation by specifying an instance template and without including any optional flags, run the following command:
gcloud compute reservations create RESERVATION_NAME \
--source-instance-template=projects/PROJECT_ID/LOCATION/instanceTemplates/INSTANCE_TEMPLATE_NAME \
--vm-count=NUMBER_OF_VMS \
--zone=ZONE
Replace the following:
RESERVATION_NAME: the name of the reservation to create.PROJECT_ID: the ID of the project where you want to reserve resources and where the instance template exists.LOCATION: the location of the instance template. Specify one of the following values:For a global instance template:
global.For a regional instance template:
regions/REGION. ReplaceREGIONwith the region where the instance template is located. If you specify a regional instance template, then you can only reserve VMs within the same region as the template's region.
INSTANCE_TEMPLATE_NAME: the name of an existing instance template. If the instance template specifies an A3 Mega, A3 High, or A3 Edge machine type or a compact placement policy, then you must include the--require-specific-reservationflag. This indicates that only VMs that specifically target the reservation can consume it. For more information, see Consume VMs from a specific reservation.NUMBER_OF_VMS: the number of VMs to reserve.ZONE: the zone in which to reserve resources.
For example, to create a reservation for ten VMs in zone us-central1-a by
specifying a global instance template, run the following command:
gcloud compute reservations create my-reservation \
--source-instance-template=projects/example-project/global/example-instance-template \
--vm-count=10 \
--zone=us-central1-a
Optionally, you can do one or more of the following:
To specify that only VMs that specifically target this reservation can consume it, include the
--require-specific-reservationflag.gcloud compute reservations create RESERVATION_NAME \ --source-instance-template=projects/PROJECT_ID/LOCATION/instanceTemplates/INSTANCE_TEMPLATE_NAME \ --require-specific-reservation \ --vm-count=NUMBER_OF_VMS \ --zone=ZONETo allow a reservation of GPU VMs to be consumed by custom training jobs or prediction jobs in Vertex AI, include the
--reservation-sharing-policyflag set toALLOW_ALL.gcloud compute reservations create RESERVATION_NAME \ --source-instance-template=projects/PROJECT_ID/LOCATION/instanceTemplates/INSTANCE_TEMPLATE_NAME \ --reservation-sharing-policy=ALLOW_ALL \ --vm-count=NUMBER_OF_VMS \ --zone=ZONETo enable Compute Engine to automatically delete the reservation, select one of the following methods:
To delete the reservation at a specific date and time, use the
gcloud beta compute reservations createcommand with the--delete-at-timeflag.gcloud beta compute reservations create RESERVATION_NAME \ --delete-at-time=DELETE_AT_TIME \ --source-instance-template=projects/PROJECT_ID/LOCATION/instanceTemplates/INSTANCE_TEMPLATE_NAME \ --vm-count=NUMBER_OF_VMS \ --zone=ZONEReplace
DELETE_AT_TIMEwith a date and time formatted as an RFC 3339 timestamp, which must be as follows:none YYYY-MM-DDTHH:MM:SSOFFSETReplace the following:
YYYY-MM-DD: a date formatted as a 4-digit year, 2-digit month, and a 2-digit day of the month, separated by hyphens (-).HH:MM:SS: a time formatted as a 2-digit hour using a 24-hour time, 2-digit minutes, and 2-digit seconds, separated by colons (:).OFFSET: the time zone formatted as an offset of Coordinated Universal Time (UTC). For example, to use the Pacific Standard Time (PST), specify-08:00. Alternatively, to use no offset, specifyZ.
To delete the reservation after a specific duration, use the
gcloud beta compute reservations createcommand with the--delete-after-durationflag.gcloud beta compute reservations create RESERVATION_NAME \ --delete-after-duration=DELETE_AFTER_DURATION \ --source-instance-template=projects/PROJECT_ID/LOCATION/instanceTemplates/INSTANCE_TEMPLATE_NAME \ --vm-count=NUMBER_OF_VMS \ --zone=ZONEReplace
DELETE_AFTER_DURATIONwith a duration in days, hours, minutes, or seconds. For example, specify30mfor 30 minutes, or1d2h3m4sfor 1 day, 2 hours, 3 minutes, and 4 seconds.
Go
import(
"context"
"fmt"
"io"
compute"cloud.google.com/go/compute/apiv1"
computepb"cloud.google.com/go/compute/apiv1/computepb"
"google.golang.org/protobuf/proto"
)
// Creates the reservation from given template in particular zone
funccreateReservation(wio.Writer,projectID,zone,reservationName,sourceTemplatestring)error{
// projectID := "your_project_id"
// zone := "us-west3-a"
// reservationName := "your_reservation_name"
// template: existing template path. Following formats are allowed:
// - projects/{project_id}/global/instanceTemplates/{template_name}
// - projects/{project_id}/regions/{region}/instanceTemplates/{template_name}
// - https://www.googleapis.com/compute/v1/projects/{project_id}/global/instanceTemplates/instanceTemplate
// - https://www.googleapis.com/compute/v1/projects/{project_id}/regions/{region}/instanceTemplates/instanceTemplate
ctx:=context.Background()
reservationsClient,err:=compute.NewReservationsRESTClient (ctx)
iferr!=nil{
returnerr
}
deferreservationsClient.Close()
req:=&computepb.InsertReservationRequest{
Project:projectID,
ReservationResource:&computepb.Reservation{
Name:proto.String(reservationName),
Zone:proto.String(zone),
SpecificReservation:&computepb.AllocationSpecificSKUReservation{
Count:proto.Int64(2),
SourceInstanceTemplate:proto.String(sourceTemplate),
},
},
Zone:zone,
}
op,err:=reservationsClient.Insert(ctx,req)
iferr!=nil{
returnfmt.Errorf("unable to create reservation: %w",err)
}
iferr=op.Wait(ctx);err!=nil{
returnfmt.Errorf("unable to wait for the operation: %w",err)
}
fmt.Fprintf(w,"Reservation created\n")
returnnil
}
Java
importcom.google.cloud.compute.v1.AllocationSpecificSKUReservation ;
importcom.google.cloud.compute.v1.Operation ;
importcom.google.cloud.compute.v1.Reservation ;
importcom.google.cloud.compute.v1.ReservationsClient ;
importjava.io.IOException;
importjava.util.concurrent.ExecutionException;
importjava.util.concurrent.TimeUnit;
importjava.util.concurrent.TimeoutException;
publicclass CreateReservationForInstanceTemplate{
publicstaticvoidmain(String[]args)
throwsIOException,ExecutionException,InterruptedException,TimeoutException{
// TODO(developer): Replace these variables before running the sample.
// Project ID or project number of the Cloud project you want to use.
StringprojectId="YOUR_PROJECT_ID";
// Name of the zone in which you want to create the reservation.
Stringzone="us-central1-a";
// Name of the reservation you want to create.
StringreservationName="YOUR_RESERVATION_NAME";
// The number of virtual machines you want to create.
intnumberOfVms=3;
// The URI of the instance template with GLOBAL location
// to be used for creating the reservation.
StringinstanceTemplateUri=
"projects/YOUR_PROJECT_ID/global/instanceTemplates/YOUR_INSTANCE_TEMPLATE_NAME";
// The URI of the instance template with REGIONAL location
// to be used for creating the reservation. For us-central1 region in this case.
// String instanceTemplateUri =
// "projects/YOUR_PROJECT_ID/regions/us-central1/instanceTemplates/YOUR_INSTANCE_TEMPLATE_NAME"
createReservationForInstanceTemplate(
projectId,reservationName,instanceTemplateUri,numberOfVms,zone);
}
// Creates a reservation in a project for the instance template.
publicstaticReservation createReservationForInstanceTemplate(
StringprojectId,StringreservationName,StringinstanceTemplateUri,
intnumberOfVms,Stringzone)
throwsIOException,ExecutionException,InterruptedException,TimeoutException{
// Initialize client that will be used to send requests. This client only needs to be created
// once, and can be reused for multiple requests.
try(ReservationsClient reservationsClient=ReservationsClient .create()){
Reservation reservation=
Reservation .newBuilder()
.setName(reservationName)
.setZone(zone)
.setSpecificReservation (
AllocationSpecificSKUReservation .newBuilder()
// Set the number of instances
.setCount(numberOfVms)
// Set the instance template to be used for creating the reservation.
.setSourceInstanceTemplate(instanceTemplateUri)
.build())
.build();
Operation response=
reservationsClient.insertAsync(projectId,zone,reservation).get(3,TimeUnit.MINUTES);
if(response.hasError ()){
returnnull;
}
returnreservationsClient.get(projectId,zone,reservationName);
}
}
}Node.js
// Import the Compute library
constcomputeLib=require('@google-cloud/compute');
constcompute=computeLib.protos.google.cloud.compute.v1;
// Instantiate a reservationsClient
constreservationsClient=newcomputeLib.ReservationsClient ();
// Instantiate a zoneOperationsClient
constzoneOperationsClient=newcomputeLib.ZoneOperationsClient ();
/**
* TODO(developer): Update/uncomment these variables before running the sample.
*/
// The ID of the project where you want to reserve resources and where the instance template exists.
constprojectId=awaitreservationsClient.getProjectId();
// The zone in which to reserve resources.
constzone='us-central1-a';
// The name of the reservation to create.
// reservationName = 'reservation-01';
// The number of VMs to reserve.
constvmsNumber=3;
/**
* The name of an existing instance template.
* TODO(developer): Uncomment and update instanceTemplateName before running the sample.
*/
// const instanceTemplateName = 'pernament-region-template-name';
/**
* // The location of the instance template.
* TODO(developer): Uncomment the `location` variable depending on which template you want to use.
*/
// The location for a regional instance template: regions/{region}. Replace region with the region where the instance template is located.
// If you specify a regional instance template, then you can only reserve VMs within the same region as the template's region.
// const location = `regions/${zone.slice(0, -2)}`;
// The location for a global instance template.
// const location = 'global';
asyncfunctioncallCreateComputeReservationInstanceTemplate(){
// Create reservation for 3 VMs in zone us-central1-a by specifying a instance template.
constspecificReservation=newcompute.AllocationSpecificSKUReservation ({
count:vmsNumber,
sourceInstanceTemplate:`projects/${projectId}/${location}/instanceTemplates/${instanceTemplateName}`,
});
// Create a reservation.
constreservation=newcompute.Reservation ({
name:reservationName,
specificReservation,
// To specify that only VMs that specifically target this reservation can consume it,
// set specificReservationRequired field to true.
specificReservationRequired:true,
});
const[response]=awaitreservationsClient.insert({
project:projectId,
reservationResource:reservation,
zone,
});
letoperation=response.latestResponse;
// Wait for the create reservation operation to complete.
while(operation.status!=='DONE'){
[operation]=awaitzoneOperationsClient.wait({
operation:operation.name,
project:projectId,
zone:operation.zone.split('/').pop(),
});
}
console.log(`Reservation: ${reservationName} created.`);
}
awaitcallCreateComputeReservationInstanceTemplate();Python
from__future__import annotations
importsys
fromtypingimport Any
fromgoogle.api_core.extended_operationimport ExtendedOperation
fromgoogle.cloudimport compute_v1
defwait_for_extended_operation(
operation: ExtendedOperation, verbose_name: str = "operation", timeout: int = 300
) -> Any:
"""
Waits for the extended (long-running) operation to complete.
If the operation is successful, it will return its result.
If the operation ends with an error, an exception will be raised.
If there were any warnings during the execution of the operation
they will be printed to sys.stderr.
Args:
operation: a long-running operation you want to wait on.
verbose_name: (optional) a more verbose name of the operation,
used only during error and warning reporting.
timeout: how long (in seconds) to wait for operation to finish.
If None, wait indefinitely.
Returns:
Whatever the operation.result() returns.
Raises:
This method will raise the exception received from `operation.exception()`
or RuntimeError if there is no exception set, but there is an `error_code`
set for the `operation`.
In case of an operation taking longer than `timeout` seconds to complete,
a `concurrent.futures.TimeoutError` will be raised.
"""
result = operation.result(timeout=timeout)
if operation.error_code:
print(
f"Error during {verbose_name}: [Code: {operation.error_code}]: {operation.error_message}",
file=sys.stderr,
flush=True,
)
print(f"Operation ID: {operation.name}", file=sys.stderr, flush=True)
raise operation.exception() or RuntimeError(operation.error_message)
if operation.warnings:
print(f"Warnings during {verbose_name}:\n", file=sys.stderr, flush=True)
for warning in operation.warnings:
print(f" - {warning.code}: {warning.message}", file=sys.stderr, flush=True)
return result
defcreate_reservation_from_template(
project_id: str, reservation_name: str, template: str
) -> compute_v1.Reservation:
"""
Create a new reservation based on an existing template.
Args:
project_id: project ID or project number of the Cloud project you use.
reservation_name: the name of new reservation.
template: existing template path. Following formats are allowed:
- projects/{project_id}/global/instanceTemplates/{template_name}
- projects/{project_id}/regions/{region}/instanceTemplates/{template_name}
- https://www.googleapis.com/compute/v1/projects/{project_id}/global/instanceTemplates/instanceTemplate
- https://www.googleapis.com/compute/v1/projects/{project_id}/regions/{region}/instanceTemplates/instanceTemplate
Returns:
Reservation object that represents the new reservation.
"""
reservations_client = compute_v1 .ReservationsClient ()
request = compute_v1 .InsertReservationRequest ()
request.project = project_id
request.zone = "us-central1-a"
specific_reservation = compute_v1 .AllocationSpecificSKUReservation ()
specific_reservation.count = 1
specific_reservation.source_instance_template = template
reservation = compute_v1 .Reservation ()
reservation.name = reservation_name
reservation.specific_reservation = specific_reservation
request.reservation_resource = reservation
operation = reservations_client.insert (request)
wait_for_extended_operation(operation, "Reservation creation")
return reservations_client.get (
project=project_id, zone="us-central1-a", reservation=reservation_name
)
REST
To create a single-project reservation, make a POST request to the
reservations.insert method.
To create a single-project reservation by specifying an instance template
and without including any optional flags, make the following POST request:
POST https://compute.googleapis.com/compute/v1/projects/PROJECT_ID/zones/ZONE/reservations
{
"name": "RESERVATION_NAME",
"specificReservation": {
"count": "NUMBER_OF_VMS",
"sourceInstanceTemplate": "projects/PROJECT_ID/LOCATION/instanceTemplates/INSTANCE_TEMPLATE_NAME"
}
}
Replace the following:
PROJECT_ID: the ID of the project where you want to reserve resources and where the instance template exists.ZONE: the zone in which to reserve resources.RESERVATION_NAME: the name of the reservation to create.NUMBER_OF_VMS: the number of VMs to reserve.LOCATION: the location of the instance template. Specify one of the following values:For a global instance template:
global.For a regional instance template:
regions/REGION. ReplaceREGIONwith the region where the instance template is located. If you specify a regional instance template, then you can only reserve VMs within the same region as the template's region.
INSTANCE_TEMPLATE_NAME: the name of an existing instance template. If the instance template specifies an A3 Mega, A3 High, or A3 Edge machine type or a compact placement policy, then you must include thespecificReservationRequiredfield in the request body, and set the field totrue. This indicates that only VMs that specifically target this reservation can consume it. For more information, see Consume VMs from a specific reservation.
For example, to create a reservation for ten VMs in zone us-central1-a by
specifying a global instance template, make the following POST request:
POST https://compute.googleapis.com/compute/v1/projects/example-project/zones/us-central1-a/reservations
{
"name": "my-reservation",
"specificReservation": {
"count": "10",
"sourceInstanceTemplate": "projects/example-project/global/instanceTemplates/example-instance-template"
}
}
Optionally, you can do one or more of the following:
To specify that only VMs that specifically target this reservation can consume it, include the
specificReservationRequiredfield in the request body, and set the field totrue.POST https://compute.googleapis.com/compute/v1/projects/PROJECT_ID/zones/ZONE/reservations { "name": "RESERVATION_NAME", "specificReservation": { "count": "NUMBER_OF_VMS", "sourceInstanceTemplate": "projects/PROJECT_ID/LOCATION/instanceTemplates/INSTANCE_TEMPLATE_NAME" }, "specificReservationRequired": true }To allow a reservation of GPU VMs to be consumed by custom training jobs or prediction jobs in Vertex AI, include the
serviceShareTypefield and set it toALLOW_ALL.POST https://compute.googleapis.com/compute/v1/projects/PROJECT_ID/zones/ZONE/reservations { "name": "RESERVATION_NAME", "reservationSharingPolicy": { "serviceShareType": "ALLOW_ALL" }, "specificReservation": { "count": "NUMBER_OF_VMS", "sourceInstanceTemplate": "projects/PROJECT_ID/LOCATION/instanceTemplates/INSTANCE_TEMPLATE_NAME" } }To enable Compute Engine to automatically delete the reservation, select one of the following methods:
To delete the reservation at a specific date and time, make a
POSTrequest to thebeta.reservations.insertmethod. In the request body, include thedeleteAtTimefield.POST https://compute.googleapis.com/compute/beta/projects/PROJECT_ID/zones/ZONE/reservations { "deleteAtTime": "DELETE_AT_TIME", "name": "RESERVATION_NAME", "specificReservation": { "count": "NUMBER_OF_VMS", "sourceInstanceTemplate": "projects/PROJECT_ID/LOCATION/instanceTemplates/INSTANCE_TEMPLATE_NAME" } }Replace
DELETE_AT_TIMEwith a date and time formatted as an RFC 3339 timestamp, which must be as follows:YYYY-MM-DDTHH:MM:SSOFFSETReplace the following:
YYYY-MM-DD: a date formatted as a 4-digit year, 2-digit month, and a 2-digit day of the month, separated by hyphens (-).HH:MM:SS: a time formatted as a 2-digit hour using a 24-hour time, 2-digit minutes, and 2-digit seconds, separated by colons (:).OFFSET: the time zone formatted as an offset of Coordinated Universal Time (UTC). For example, to use the Pacific Standard Time (PST), specify-08:00. Alternatively, to use no offset, specifyZ.
To delete the reservation after a specific duration, make a
POSTrequest to thebeta.reservations.insertmethod. In the request body, include thedeleteAfterDurationfield.POST https://compute.googleapis.com/compute/beta/projects/PROJECT_ID/zones/ZONE/reservations { "deleteAfterDuration": { "seconds": "DELETE_AFTER_DURATION" }, "name": "RESERVATION_NAME", "specificReservation": { "count": "NUMBER_OF_VMS", "sourceInstanceTemplate": "projects/PROJECT_ID/LOCATION/instanceTemplates/INSTANCE_TEMPLATE_NAME" } }Replace
DELETE_AFTER_DURATIONwith a duration in seconds. For example, specify86400for 86,400 seconds (1 day).
Specify an existing VM
You can only create a reservation based on an existing VM in the same zone as the VM.
After creating the reservation, you can consume it by creating VMs with properties that match the reference VM. You can do this by doing one of the following:
Create and use an instance template as follows:
Create an instance template based on the reference VM without overriding the reference VM's properties.
Create VMs using the newly-created template by doing one or both of the following:
Create a VM based on the reference VM while ensuring that the properties of the new VM and the reference VM exactly match.
To create a single-project reservation that uses the properties of an existing VM, do the following:
In the Google Cloud console, go to the Reservations page.
Click Create reservation.
The Create a reservation page opens.
For Name, enter a name for your reservation.
For Region and Zone, select where you want to reserve resources.
In the Share type section, click Local if it isn't already selected.
In the Use with VM instance section, select one of the following options:
To allow matching VMs to automatically consume this reservation, select Use reservation automatically if it isn't already selected.
To consume this reservation's resources only when creating matching VMs that specifically target this reservation by name, select Select specific reservation.
For Number of VM instances, enter the number of VMs that you want to reserve.
In the Machine configuration section, do the following:
Select Use existing VM.
For Existing VM, select the VM which properties you want to use to create the reservation.
Optional: To specify a compact placement policy for a reservation that meets the requirements, in the Group placement policy section, click the Select or create a group placement policy list, and then do one of the following:
To create a compact placement policy to specify in this reservation, do the following:
Click Create group placement policy.
The Create a group placement policy pane appears.
For Policy name, enter a name for your policy.
Click Create.
Creating the compact placement policy might take a few seconds to complete.
Otherwise, select an existing compact placement policy.
In the Auto-delete section, you can enable the auto-delete option to let Compute Engine automatically delete the reservation at a specific date and time. Automatically deleting reservations can be useful to avoid unnecessary charges when you stop consuming the reservation.
To create the reservation, click Create.
The Reservations page opens. Creating the reservation might take up to a minute to complete.
Specify properties directly
To create a single-project reservation by specifying properties directly, select one of the following options:
Console
In the Google Cloud console, go to the Reservations page.
On the On-demand reservations tab (default), click Create reservation.
The Create a reservation page opens.
For Name, enter a name for your reservation.
For Region and Zone, select where you want to reserve resources.
In the Share type section, click Local if it isn't already selected.
Optional: To allow a reservation of GPU VMs to be consumed by custom training jobs or prediction jobs in Vertex AI, in the Google Cloud services section, select Share reservation.
In the Use with VM instance section, select one of the following options:
To allow matching VMs to automatically consume this reservation, select Use reservation automatically if it isn't already selected.
To consume this reservation's resources only when creating matching VMs that specifically target this reservation by name, select Select specific reservation.
For Number of VM instances, enter the number of VMs that you want to reserve.
In the Machine configuration section, select Specify machine type, and then specify the following:
For Machine family, Series, and Machine type, select a machine family, series, and machine type.
Optional: To specify a minimum CPU platform or attach GPUs to N1 VMs, do the following:
To expand the CPU Platform and GPU section, click the expander arrow.
Optional: To specify a minimum CPU platform, for CPU Platform, select an option.
Optional: To attach GPUs to N1 VMs, click Add GPU. Then, for GPU type and Number of GPUs, select the type and number of GPUs to attach to each N1 VM.
Optional: To attach Local SSD disks to a machine type that doesn't include Local SSD disks by default, do the following:
For Number of disks, select the number of Local SSD disks to attach.
For Interface type, select the disk interface for the Local SSD disks.
Optional: To specify a compact placement policy for a reservation that meets the requirements, in the Group placement policy section, click the Select or create a group placement policy list, and then do one of the following:
To create a compact placement policy to specify in this reservation, do the following:
Click Create group placement policy.
The Create a group placement policy pane appears.
For Policy name, enter a name for your policy.
Click Create.
Creating the compact placement policy might take a few seconds to complete.
Otherwise, select an existing compact placement policy.
In the Auto-delete section, you can enable the auto-delete option to let Compute Engine automatically delete the reservation at a specific date and time. Automatically deleting reservations can be useful to avoid unnecessary charges when you stop consuming the reservation.
To create the reservation, click Create.
The Reservations page opens. Creating the single-project reservation might take up to a minute to complete.
gcloud
To create a single-project reservation, use the
gcloud compute reservations create command.
To create a single-project reservation by specifying properties directly and without including any optional flags, run the following command:
gcloud compute reservations create RESERVATION_NAME \
--machine-type=MACHINE_TYPE \
--vm-count=NUMBER_OF_VMS \
--zone=ZONE
Replace the following:
RESERVATION_NAME: the name of the reservation to create.MACHINE_TYPE: a machine type to use for each VM. If you specify an A3 Mega, A3 High, or A3 Edge machine type, then you must include the--require-specific-reservationflag. This indicates that only VMs that specifically target the reservation can consume it. For more information, see Consume VMs from a specific reservation.NUMBER_OF_VMS: the number of VMs to reserve.ZONE: the zone in which to reserve resources.
For example, to create a reservation in zone us-central1-a for ten VMs
that each use an N2 predefined machine type with 4 vCPUs, run the
following command:
gcloud compute reservations create my-reservation \
--machine-type=n2-standard-4 \
--vm-count=10 \
--zone=us-central1-a
Optionally, you can do one or more of the following:
To attach GPUs to your reserved N1 VMs, include the
--acceleratorflag.gcloud compute reservations create RESERVATION_NAME \ --accelerator=count=NUMBER_OF_ACCELERATORS,type=ACCELERATOR_TYPE \ --machine-type=MACHINE_TYPE \ --vm-count=NUMBER_OF_VMS \ --zone=ZONEReplace the following:
NUMBER_OF_ACCELERATORS: the number of GPUs to add per reserved VM.ACCELERATOR_TYPE: a supported GPU model for N1 VMs. Ensure that your chosen GPU model is available in the zone where you want to reserve resources. Otherwise, creating the reservation fails.
To attach Local SSD disks to a machine type that doesn't include Local SSD disks by default, include the
--local-ssdflag. You can only attach Local SSD disks of 375 GB.gcloud compute reservations create RESERVATION_NAME \ --local-ssd=count=NUMBER_OF_LOCAL_SSD_DISKS,size=375,interface=INTERFACE_TYPE \ --machine-type=MACHINE_TYPE \ --vm-count=NUMBER_OF_VMS \ --zone=ZONEReplace the following:
NUMBER_OF_LOCAL_SSD_DISKS: the number of Local SSD disks to attach.INTERFACE_TYPE: the type of disk interface that you want each Local SSD disk to use and that your specified machine type supports. Specify one of the following values:For the NVME disk interface:
nvmeFor the SCSI disk interface:
scsi
To have the reserved VMs use a specific minimum CPU platform instead of the zone's default CPU platform, include the
--min-cpu-platformflag.gcloud compute reservations create RESERVATION_NAME \ --machine-type=MACHINE_TYPE \ --min-cpu-platform="MIN_CPU_PLATFORM" \ --vm-count=NUMBER_OF_VMS \ --zone=ZONEReplace
MIN_CPU_PLATFORMwith a minimum CPU platform. To make sure that a CPU platform is available in the zone where you're reserving resources, view the available CPU platforms by zone.To specify that only VMs that specifically target this reservation can consume it, include the
--require-specific-reservationflag.gcloud compute reservations create RESERVATION_NAME \ --machine-type=MACHINE_TYPE \ --require-specific-reservation \ --vm-count=NUMBER_OF_VMS \ --zone=ZONETo specify a compact placement policy for lower network latency among VMs, include the
--resource-policies=policyflag.gcloud compute reservations create RESERVATION_NAME \ --machine-type=MACHINE_TYPE \ --resource-policies=policy=COMPACT_PLACEMENT_POLICY_NAME \ --require-specific-reservation \ --vm-count=NUMBER_OF_VMS \ --zone=ZONEReplace COMPACT_PLACEMENT_POLICY_NAME with the name of an existing compact placement policy. Additionally, to avoid errors when creating a single-project reservation that specifies a compact placement policy, make sure to specify the following:
A supported machine type and maximum number of VMs for compact placement policies.
A zone within the region where the compact placement policy is located.
The
--require-specific-reservationflag. This indicates that only VMs that specifically target the reservation can consume it.
To allow a reservation of GPU VMs to be consumed by custom training jobs or prediction jobs in Vertex AI, include the
--reservation-sharing-policyflag set toALLOW_ALL.gcloud compute reservations create RESERVATION_NAME \ --machine-type=MACHINE_TYPE \ --reservation-sharing-policy=ALLOW_ALL \ --vm-count=NUMBER_OF_VMS \ --zone=ZONETo enable Compute Engine to automatically delete the reservation, select one of the following methods:
To delete the reservation at a specific date and time, use the
gcloud beta compute reservations createcommand with the--delete-at-timeflag.gcloud beta compute reservations create RESERVATION_NAME \ --delete-at-time=DELETE_AT_TIME \ --machine-type=MACHINE_TYPE \ --vm-count=NUMBER_OF_VMS \ --zone=ZONEReplace
DELETE_AT_TIMEwith a date and time formatted as an RFC 3339 timestamp, which must be as follows:YYYY-MM-DDTHH:MM:SSOFFSETReplace the following:
YYYY-MM-DD: a date formatted as a 4-digit year, 2-digit month, and a 2-digit day of the month, separated by hyphens (-).HH:MM:SS: a time formatted as a 2-digit hour using a 24-hour time, 2-digit minutes, and 2-digit seconds, separated by colons (:).OFFSET: the time zone formatted as an offset of Coordinated Universal Time (UTC). For example, to use the Pacific Standard Time (PST), specify-08:00. Alternatively, to use no offset, specifyZ.
To delete the reservation after a specific duration, use the
gcloud beta compute reservations createcommand with the--delete-after-durationflag.gcloud beta compute reservations create RESERVATION_NAME \ --delete-after-duration=DELETE_AFTER_DURATION \ --machine-type=MACHINE_TYPE \ --vm-count=NUMBER_OF_VMS \ --zone=ZONEReplace
DELETE_AFTER_DURATIONwith a duration in days, hours, minutes, or seconds. For example, specify30mfor 30 minutes, or1d2h3m4sfor 1 day, 2 hours, 3 minutes, and 4 seconds.
Go
import(
"context"
"fmt"
"io"
compute"cloud.google.com/go/compute/apiv1"
computepb"cloud.google.com/go/compute/apiv1/computepb"
"google.golang.org/protobuf/proto"
)
// Creates the reservation with accelerated image
funccreateBaseReservation(wio.Writer,projectID,zone,reservationNamestring)error{
// projectID := "your_project_id"
// zone := "us-west3-a"
// reservationName := "your_reservation_name"
ctx:=context.Background()
reservationsClient,err:=compute.NewReservationsRESTClient (ctx)
iferr!=nil{
returnerr
}
deferreservationsClient.Close()
// Creating reservation based on direct properties
req:=&computepb.InsertReservationRequest{
Project:projectID,
ReservationResource:&computepb.Reservation{
Name:proto.String(reservationName),
Zone:proto.String(zone),
SpecificReservation:&computepb.AllocationSpecificSKUReservation{
Count:proto.Int64(2),
// Properties, which allows customising instances
InstanceProperties:&computepb.AllocationSpecificSKUAllocationReservedInstanceProperties{
// Attaching GPUs to the reserved VMs
// Read more: https://cloud.google.com/compute/docs/gpus#n1-gpus
GuestAccelerators:[]*computepb.AcceleratorConfig{
{
AcceleratorCount:proto.Int32(1),
AcceleratorType:proto.String("nvidia-tesla-t4"),
},
},
// Including local SSD disks
LocalSsds:[]*computepb.AllocationSpecificSKUAllocationAllocatedInstancePropertiesReservedDisk{
{
DiskSizeGb:proto.Int64(375),
Interface:proto.String("NVME"),
},
},
MachineType:proto.String("n1-standard-2"),
// Specifying minimum CPU platform
// Read more: https://cloud.google.com/compute/docs/instances/specify-min-cpu-platform
MinCpuPlatform:proto.String("Intel Skylake"),
},
},
},
Zone:zone,
}
op,err:=reservationsClient.Insert(ctx,req)
iferr!=nil{
returnfmt.Errorf("unable to create reservation: %w",err)
}
iferr=op.Wait(ctx);err!=nil{
returnfmt.Errorf("unable to wait for the operation: %w",err)
}
fmt.Fprintf(w,"Reservation created\n")
returnnil
}
Java
importcom.google.cloud.compute.v1.AcceleratorConfig ;
importcom.google.cloud.compute.v1.AllocationSpecificSKUAllocationAllocatedInstancePropertiesReservedDisk ;
importcom.google.cloud.compute.v1.AllocationSpecificSKUAllocationReservedInstanceProperties ;
importcom.google.cloud.compute.v1.AllocationSpecificSKUReservation ;
importcom.google.cloud.compute.v1.Operation ;
importcom.google.cloud.compute.v1.Reservation ;
importcom.google.cloud.compute.v1.ReservationsClient ;
importjava.io.IOException;
importjava.util.concurrent.ExecutionException;
importjava.util.concurrent.TimeUnit;
importjava.util.concurrent.TimeoutException;
publicclass CreateReservation{
publicstaticvoidmain(String[]args)
throwsIOException,ExecutionException,InterruptedException,TimeoutException{
// TODO(developer): Replace these variables before running the sample.
// Project ID or project number of the Cloud project you want to use.
StringprojectId="YOUR_PROJECT_ID";
// Name of the zone in which you want to create the disk.
Stringzone="us-central1-a";
// Name of the reservation you want to create.
StringreservationName="YOUR_RESERVATION_NAME";
// Number of instances in the reservation.
intnumberOfVms=3;
createReservation(projectId,reservationName,numberOfVms,zone);
}
// Creates reservation with optional flags
publicstaticReservation createReservation(
StringprojectId,StringreservationName,intnumberOfVms,Stringzone)
throwsIOException,ExecutionException,InterruptedException,TimeoutException{
// Create the reservation with optional properties:
// Machine type of the instances in the reservation.
StringmachineType="n1-standard-2";
// Number of accelerators to be attached to the instances in the reservation.
intnumberOfAccelerators=1;
// Accelerator type to be attached to the instances in the reservation.
StringacceleratorType="nvidia-tesla-t4";
// Minimum CPU platform to be attached to the instances in the reservation.
StringminCpuPlatform="Intel Skylake";
// Local SSD size in GB to be attached to the instances in the reservation.
intlocalSsdSize=375;
// Local SSD interfaces to be attached to the instances in the reservation.
StringlocalSsdInterface1="NVME";
StringlocalSsdInterface2="SCSI";
booleanspecificReservationRequired=true;
// Initialize client that will be used to send requests. This client only needs to be created
// once, and can be reused for multiple requests.
try(ReservationsClient reservationsClient=ReservationsClient .create()){
Reservation reservation=
Reservation .newBuilder()
.setName(reservationName)
.setZone(zone)
.setSpecificReservationRequired (specificReservationRequired)
.setSpecificReservation (
AllocationSpecificSKUReservation .newBuilder()
// Set the number of instances
.setCount(numberOfVms)
// Set instance properties
.setInstanceProperties(
AllocationSpecificSKUAllocationReservedInstanceProperties .newBuilder()
.setMachineType(machineType)
.setMinCpuPlatform(minCpuPlatform)
.addGuestAccelerators(
AcceleratorConfig .newBuilder()
.setAcceleratorCount(numberOfAccelerators)
.setAcceleratorType(acceleratorType)
.build())
.addLocalSsds (
AllocationSpecificSKUAllocationAllocatedInstancePropertiesReservedDisk
.newBuilder()
.setDiskSizeGb(localSsdSize)
.setInterface(localSsdInterface1)
.build())
.addLocalSsds (
AllocationSpecificSKUAllocationAllocatedInstancePropertiesReservedDisk
.newBuilder()
.setDiskSizeGb(localSsdSize)
.setInterface(localSsdInterface2)
.build())
.build())
.build())
.build();
Operation response=
reservationsClient.insertAsync(projectId,zone,reservation).get(7,TimeUnit.MINUTES);
if(response.hasError ()){
returnnull;
}
returnreservationsClient.get(projectId,zone,reservationName);
}
}
}Node.js
// Import the Compute library
constcomputeLib=require('@google-cloud/compute');
constcompute=computeLib.protos.google.cloud.compute.v1;
// Instantiate a reservationsClient
constreservationsClient=newcomputeLib.ReservationsClient ();
// Instantiate a zoneOperationsClient
constzoneOperationsClient=newcomputeLib.ZoneOperationsClient ();
/**
* TODO(developer): Update/uncomment these variables before running the sample.
*/
// The ID of the project where you want to reserve resources.
constprojectId=awaitreservationsClient.getProjectId();
// The zone in which to reserve resources.
constzone='us-central1-a';
// The name of the reservation to create.
// reservationName = 'reservation-01';
// The number of VMs to reserve.
constvmsNumber=3;
// Machine type to use for each VM.
constmachineType='n1-standard-4';
asyncfunctioncallCreateComputeReservationFromProperties(){
// Create specific reservation for 3 VMs that each use an N1 predefined machine type with 4 vCPUs.
constspecificReservation=newcompute.AllocationSpecificSKUReservation ({
count:vmsNumber,
instanceProperties:{
machineType,
// To have the reserved VMs use a specific minimum CPU platform instead of the zone's default CPU platform.
minCpuPlatform:'Intel Skylake',
// If you want to attach GPUs to your reserved N1 VMs, update and uncomment guestAccelerators if needed.
// guestAccelerators: [
// {
// // The number of GPUs to add per reserved VM.
// acceleratorCount: 1,
// // Supported GPU model for N1 VMs. Ensure that your chosen GPU model is available in the zone,
// // where you want to reserve resources.
// acceleratorType: 'nvidia-tesla-t4',
// },
// ],
// If you want to add local SSD disks to each reserved VM, update and uncomment localSsds if needed.
// You can specify up to 24 Local SSD disks. Each Local SSD disk is 375 GB.
// localSsds: [
// {
// diskSizeGb: 375,
// // The type of interface you want each Local SSD disk to use. Specify one of the following values: NVME or SCSI.
// // Make sure that the machine type you specify for the reserved VMs supports the chosen disk interfaces.
// interface: 'NVME',
// },
// ],
},
});
// Create a reservation.
constreservation=newcompute.Reservation ({
name:reservationName,
zone,
specificReservation,
// To specify that only VMs that specifically target this reservation can consume it,
// set specificReservationRequired field to true.
specificReservationRequired:true,
});
const[response]=awaitreservationsClient.insert({
project:projectId,
reservationResource:reservation,
zone,
});
letoperation=response.latestResponse;
// Wait for the create reservation operation to complete.
while(operation.status!=='DONE'){
[operation]=awaitzoneOperationsClient.wait({
operation:operation.name,
project:projectId,
zone:operation.zone.split('/').pop(),
});
}
console.log(`Reservation: ${reservationName} created.`);
}
awaitcallCreateComputeReservationFromProperties();Python
from__future__import annotations
importsys
fromtypingimport Any
fromgoogle.api_core.extended_operationimport ExtendedOperation
fromgoogle.cloudimport compute_v1
defwait_for_extended_operation(
operation: ExtendedOperation, verbose_name: str = "operation", timeout: int = 300
) -> Any:
"""
Waits for the extended (long-running) operation to complete.
If the operation is successful, it will return its result.
If the operation ends with an error, an exception will be raised.
If there were any warnings during the execution of the operation
they will be printed to sys.stderr.
Args:
operation: a long-running operation you want to wait on.
verbose_name: (optional) a more verbose name of the operation,
used only during error and warning reporting.
timeout: how long (in seconds) to wait for operation to finish.
If None, wait indefinitely.
Returns:
Whatever the operation.result() returns.
Raises:
This method will raise the exception received from `operation.exception()`
or RuntimeError if there is no exception set, but there is an `error_code`
set for the `operation`.
In case of an operation taking longer than `timeout` seconds to complete,
a `concurrent.futures.TimeoutError` will be raised.
"""
result = operation.result(timeout=timeout)
if operation.error_code:
print(
f"Error during {verbose_name}: [Code: {operation.error_code}]: {operation.error_message}",
file=sys.stderr,
flush=True,
)
print(f"Operation ID: {operation.name}", file=sys.stderr, flush=True)
raise operation.exception() or RuntimeError(operation.error_message)
if operation.warnings:
print(f"Warnings during {verbose_name}:\n", file=sys.stderr, flush=True)
for warning in operation.warnings:
print(f" - {warning.code}: {warning.message}", file=sys.stderr, flush=True)
return result
defcreate_compute_reservation(
project_id: str,
zone: str = "us-central1-a",
reservation_name="your-reservation-name",
) -> compute_v1.Reservation:
"""Creates a compute reservation in GCP.
Args:
project_id (str): The ID of the Google Cloud project.
zone (str): The zone to create the reservation.
reservation_name (str): The name of the reservation to create.
Returns:
Reservation object that represents the new reservation.
"""
instance_properties = compute_v1 .AllocationSpecificSKUAllocationReservedInstanceProperties (
machine_type="n1-standard-1",
# Optional. Specifies the minimum CPU platform for the VM instance.
min_cpu_platform="Intel Ivy Bridge",
# Optional. Specifies amount of local ssd to reserve with each instance.
local_ssds=[
compute_v1 .AllocationSpecificSKUAllocationAllocatedInstancePropertiesReservedDisk (
disk_size_gb=375, interface="NVME"
),
compute_v1 .AllocationSpecificSKUAllocationAllocatedInstancePropertiesReservedDisk (
disk_size_gb=375, interface="SCSI"
),
],
# Optional. Specifies the GPUs allocated to each instance.
# guest_accelerators=[
# compute_v1.AcceleratorConfig(
# accelerator_count=1, accelerator_type="nvidia-tesla-t4"
# )
# ],
)
reservation = compute_v1 .Reservation (
name=reservation_name,
specific_reservation=compute_v1 .AllocationSpecificSKUReservation (
count=3, # Number of resources that are allocated.
# If you use source_instance_template, you must exclude the instance_properties field.
# It can be a full or partial URL.
# source_instance_template="projects/[PROJECT_ID]/global/instanceTemplates/my-instance-template",
instance_properties=instance_properties,
),
)
# Create a client
client = compute_v1 .ReservationsClient ()
operation = client.insert (
project=project_id,
zone=zone,
reservation_resource=reservation,
)
wait_for_extended_operation(operation, "Reservation creation")
reservation = client.get (
project=project_id, zone=zone, reservation=reservation_name
)
print("Name: ", reservation.name)
print("STATUS: ", reservation.status)
print(reservation.specific_reservation)
# Example response:
# Name: your-reservation-name
# STATUS: READY
# count: 3
# instance_properties {
# machine_type: "n1-standard-1"
# local_ssds {
# disk_size_gb: 375
# interface: "NVME"
# }
# ...
return reservation
Terraform
To create a single-project reservation, use the
google_compute_reservation Terraform resource.
For example, to create a single-project reservation for one N2 predefined machine type with 2 vCPUs, use the following resource:
resource "google_compute_reservation" "default" {
name = "gce-reservation-local"
zone = "us-central1-a"
/**
* To specify a single-project reservation, omit the share_settings block
* (default) or set the share_type field to LOCAL.
*/
share_settings {
share_type = "LOCAL"
}
specific_reservation {
count = 1
instance_properties {
machine_type = "n2-standard-2"
}
}
/**
* To let VMs with affinity for any reservation consume this reservation, omit
* the specific_reservation_required field (default) or set it to false.
*/
specific_reservation_required = false
}
For more information about how to use Terraform, see Using Terraform with Google Cloud.
REST
To create a single-project reservation, make a POST request to the
reservations.insert method.
To create a single-project reservation by specifying properties directly and
without any including any optional fields, make the following POST
request:
POST https://compute.googleapis.com/compute/v1/projects/PROJECT_ID/zones/ZONE/reservations
{
"name": "RESERVATION_NAME",
"specificReservation": {
"count": "NUMBER_OF_VMS",
"instanceProperties": {
"machineType": "MACHINE_TYPE"
}
}
}
Replace the following:
PROJECT_ID: the ID of the project where you want to reserve resources.ZONE: the zone in which to reserve resources.RESERVATION_NAME: the name of the reservation to create.NUMBER_OF_VMS: the number of VMs to reserve.MACHINE_TYPE: a machine type to use for each VM. If you specify an A3 Mega, A3 High, or A3 Edge machine type, then you must include thespecificReservationRequiredfield in the request body, and set the field totrue. This indicates that only VMs that specifically target the reservation can consume it. For more information, see Consume VMs from a specific reservation.
For example, to create a reservation in zone us-central1-a for ten VMs
that each use an N2 predefined machine type with 4 vCPUs, make the
following POST request:
POST https://compute.googleapis.com/compute/v1/projects/example-project/zones/us-central1-a/reservations
{
"name": "my-reservation",
"specificReservation": {
"count": "10",
"instanceProperties": {
"machineType": "n2-standard-4",
}
}
}
Optionally, you can do one or more of the following:
To attach GPUs to your reserved N1 VMs, include the
guestAcceleratorsfield in the request body.POST https://compute.googleapis.com/compute/v1/projects/PROJECT_ID/zones/ZONE/reservations { "name": "RESERVATION_NAME", "specificReservation": { "count": "NUMBER_OF_VMS", "instanceProperties": { "guestAccelerators": [ { "acceleratorCount": NUMBER_OF_ACCELERATORS, "acceleratorType": "ACCELERATOR_TYPE" } ], "machineType": "MACHINE_TYPE" } } }Replace the following:
NUMBER_OF_ACCELERATORS: the number of GPUs to add per reserved VM.ACCELERATOR_TYPE: a supported GPU model for N1 VMs. Ensure that your chosen GPU model is available in the zone where you want to reserve resources. Otherwise, creating the reservation fails.
To attach Local SSD disks to a machine type that doesn't include Local SSD disks by default, include the
localSsdsfield in the request body. You can only attach Local SSD disks of 375 GB.For example, to attach two Local SSD disks to the machine type that you want to reserve, make a request as follows:
POST https://compute.googleapis.com/compute/v1/projects/PROJECT_ID/zones/ZONE/reservations { "name": "RESERVATION_NAME", "specificReservation": { "count": "NUMBER_OF_VMS", "instanceProperties": { "localSsds": [ { "diskSizeGb": "375", "interface": "INTERFACE_TYPE" }, { "diskSizeGb": "375", "interface": "INTERFACE_TYPE" } ], "machineType": "MACHINE_TYPE" } } }Replace
INTERFACE_TYPEwith the type of disk interface that you want each Local SSD disk to use and that your specified machine type supports. Specify one of the following values:For the NVME disk interface:
NVMEFor the SCSI disk interface:
SCSI
To have the reserved VMs use a specific minimum CPU platform instead of the zone's default CPU platform, include the
minCpuPlatformfield in the request body.POST https://compute.googleapis.com/compute/v1/projects/PROJECT_ID/zones/ZONE/reservations { "name": "RESERVATION_NAME", "specificReservation": { "count": "NUMBER_OF_VMS", "instanceProperties": { "machineType": "MACHINE_TYPE", "minCpuPlatform": "MIN_CPU_PLATFORM" } } }Replace
MIN_CPU_PLATFORMwith a minimum CPU platform. To make sure that a CPU platform is available in the zone where you're reserving resources, view the available CPU platforms by zone.To specify a compact placement policy to reduce network latency among your reserved VMs, include the
resourcePoliciesfield in the request body.POST https://compute.googleapis.com/compute/v1/projects/PROJECT_ID/zones/ZONE/reservations { "name": "RESERVATION_NAME", "resourcePolicies": { "policy" : "projects/example-project/regions/REGION/resourcePolicies/COMPACT_PLACEMENT_POLICY_NAME" }, "specificReservation": { "count": "NUMBER_OF_VMS", "instanceProperties": { "machineType": "MACHINE_TYPE" } }, "specificReservationRequired": true }Replace the following:
REGION: the region where the compact placement policy is located. You can only create the reservation within the region of the placement policy.COMPACT_PLACEMENT_POLICY_NAME: the name of an existing compact placement policy.
Additionally, to avoid errors when creating a single-project reservation that specifies a compact placement policy, make sure to specify the following:
A supported machine type for compact placement policies.
The
specificReservationRequiredfield set totrue. This indicates that only VMs that specifically target this reservation can consume it.
To specify that only VMs that specifically target this reservation can consume it, include the
specificReservationRequiredfield in the request body and set the field totrue.POST https://compute.googleapis.com/compute/v1/projects/PROJECT_ID/zones/ZONE/reservations { "name": "RESERVATION_NAME", "specificReservation": { "count": "NUMBER_OF_VMS", "instanceProperties": { "machineType": "MACHINE_TYPE" } }, "specificReservationRequired": true }To allow a reservation of GPU VMs to be consumed by custom training jobs or prediction jobs in Vertex AI, include the
serviceShareTypefield and set it toALLOW_ALL.POST https://compute.googleapis.com/compute/v1/projects/PROJECT_ID/zones/ZONE/reservations { "name": "RESERVATION_NAME", "reservationSharingPolicy": { "serviceShareType": "ALLOW_ALL" }, "specificReservation": { "count": "NUMBER_OF_VMS", "instanceProperties": { "machineType": "MACHINE_TYPE" } } }To enable Compute Engine to automatically delete the reservation, select one of the following methods:
To delete the reservation at a specific date and time, make a
POSTrequest to thebeta.reservations.insertmethod. In the request body, include thedeleteAtTimefield.POST https://compute.googleapis.com/compute/beta/projects/PROJECT_ID/zones/ZONE/reservations { "deleteAtTime": "DELETE_AT_TIME", "name": "RESERVATION_NAME", "specificReservation": { "count": "NUMBER_OF_VMS", "instanceProperties": { "machineType": "MACHINE_TYPE" } } }Replace
DELETE_AT_TIMEwith a date and time formatted as an RFC 3339 timestamp, which must be as follows:YYYY-MM-DDTHH:MM:SSOFFSETReplace the following:
YYYY-MM-DD: a date formatted as a 4-digit year, 2-digit month, and a 2-digit day of the month, separated by hyphens (-).HH:MM:SS: a time formatted as a 2-digit hour using a 24-hour time, 2-digit minutes, and 2-digit seconds, separated by colons (:).OFFSET: the time zone formatted as an offset of Coordinated Universal Time (UTC). For example, to use the Pacific Standard Time (PST), specify-08:00. Alternatively, to use no offset, specifyZ.
To delete the reservation after a specific duration, make a
POSTrequest to thebeta.reservations.insertmethod. In the request body, include thedeleteAfterDurationfield.POST https://compute.googleapis.com/compute/beta/projects/PROJECT_ID/zones/ZONE/reservations { "deleteAfterDuration": { "seconds": "DELETE_AFTER_DURATION" }, "name": "RESERVATION_NAME", "specificReservation": { "count": "NUMBER_OF_VMS", "instanceProperties": { "machineType": "MACHINE_TYPE" } } }Replace
DELETE_AFTER_DURATIONwith a duration in seconds. For example, specify86400for 86,400 seconds (1 day).
Troubleshooting
Learn how to troubleshoot reservation creation.
What's next
Learn how to view your reservations.
Learn how to attach reservations to commitments.
Learn how to consume reservations.
Learn how to monitor reservations consumption.