This document describes how to automatically set default Cloud Storage FUSE values used for high-performance Compute Engine machine types, which are designed to optimize performance for demanding, high-throughput workloads. Values that are manually set at the time of mount will override these defaults.
Machine types
Configuration values are automated for the following high-performance Compute Engine machine types:
Series type | Machine type |
---|---|
A2 machine series | |
a2-megagpu-16g |
|
a2-ultragpu-8g |
|
A3 machine series | |
a3-edgegpu-8g |
|
a3-highgpu-8g |
|
a3-megagpu-8g |
|
a3-ultragpu-8g |
|
A4 machine series | |
4-highgpu-8g-lowmem |
|
TPU v5e | |
ct5l-hightpu-8t |
|
ct5lp-hightpu-8t |
|
TPU v5p | |
ct5p-hightpu-4t |
|
ct5p-hightpu-4t-tpu |
|
TPU v6e (Trillium) | |
ct6e-standard-4t |
|
ct6e-standard-4t-tpu |
|
ct6e-standard-8t |
|
ct6e-standard-8t-tpu |
Automated configuration values
When a supported machine type is detected, Cloud Storage FUSE automatically applies the following configuration values:
Cloud Storage FUSE configuration file field | Cloud Storage FUSE CLI option | Automated configuration value |
---|---|---|
metadata-cache.negative-ttl-secs |
--metadata-cache-negative-ttl-secs |
0 |
metadata-cache.ttl-secs 1 |
--metadata-cache-ttl-secs 1 |
|
metadata-cache.stat-cache-max-size-mb |
--stat-cache-max-size-mb |
1024 |
metadata-cache.type-cache-max-size-mb |
--type-cache-max-size-mb |
128 |
implicit-dirs |
--implicit-dirs |
true |
file-system.rename-dir-limit |
--rename-dir-limit |
200000 |
1Setting this configuration to -1
significantly boosts
performance by always serving files from the cache. Be aware that this
configuration bypasses consistency checks, which can lead to serving
outdated data. For details on managing data consistency, refer to
Overview of caching in Cloud Storage FUSE.
Further performance tuning
When you use a high-performance Cloud de Confiance by S3NS machine type, the configuration values detailed on this page are automatically applied. However, you can further fine-tune your machine for optimal performance using the following methods:
Use the Performance tuning best practices guide to improve Cloud Storage FUSE using key Cloud Storage FUSE features and configurations to achieve maximum throughput and optimal performance.
If you're running training, serving, or checkpointing and Just in Time (JIT) cache workloads on Google Kubernetes Engine clusters that use Cloud GPUs or Cloud TPU to access large datasets in Cloud Storage, you can streamline your setup by utilizing pre-configured YAML files to mount your Cloud Storage buckets directly into your pods more efficiently. For more information and instructions on how to use pre-configured GKE YAML files, see Use pre-configured GKE YAML files to optimize Cloud Storage FUSE performance.
If you're running training, serving, or checkpointing workloads using Cloud Storage FUSE, you can use the
profile
field or--profile
command option to automatically adjust specific Cloud Storage FUSE configurations for optimal performance based on the specific workload type. For more information, see Profile-based configurations for AI/ML workloads.
What's next
Learn how to tune Cloud Storage FUSE for optimal performance.
Use a pre-configured GKE YAML file to configure tuning best practices.