Some or all of the information on this page might not apply to Cloud de Confiance by S3NS. See Differences from Google Cloud for more details.

Google Cloud Gke Recommender V1 Client - Class StorageConfig (0.2.0)

Reference documentation and code samples for the Google Cloud Gke Recommender V1 Client class StorageConfig.

Storage configuration for a model deployment.

Generated from protobuf message google.cloud.gkerecommender.v1.StorageConfig

Namespace

Google \ Cloud \ GkeRecommender \ V1

Methods

__construct

Constructor.

Parameters
Name	Description
`data`	`array` Optional. Data for populating the Message object.
`↳ model_bucket_uri`	`string` Optional. The Google Cloud Storage bucket URI to load the model from. This URI must point to the directory containing the model's config file (`config.json`) and model weights. A tuned GCSFuse setup can improve LLM Pod startup time by more than 7x. Expected format: `gs://<bucket-name>/<path-to-model>`.
`↳ xla_cache_bucket_uri`	`string` Optional. The URI for the GCS bucket containing the XLA compilation cache. If using TPUs, the XLA cache will be written to the same path as `model_bucket_uri`. This can speed up vLLM model preparation for repeated deployments.

getModelBucketUri

Optional. The Google Cloud Storage bucket URI to load the model from. This URI must point to the directory containing the model's config file (config.json) and model weights. A tuned GCSFuse setup can improve LLM Pod startup time by more than 7x. Expected format: gs://<bucket-name>/<path-to-model>.

Returns
Type	Description
`string`

setModelBucketUri

Parameter
Name	Description
`var`	`string`

Returns
Type	Description
`$this`

getXlaCacheBucketUri

Optional. The URI for the GCS bucket containing the XLA compilation cache.

If using TPUs, the XLA cache will be written to the same path as model_bucket_uri. This can speed up vLLM model preparation for repeated deployments.

Returns
Type	Description
`string`

setXlaCacheBucketUri

Optional. The URI for the GCS bucket containing the XLA compilation cache.

If using TPUs, the XLA cache will be written to the same path as model_bucket_uri. This can speed up vLLM model preparation for repeated deployments.

Parameter
Name	Description
`var`	`string`

Returns
Type	Description
`$this`