Message representing a set of files in a Cloud Storage bucket.
Regular expressions are used to allow fine-grained control over
which files in the bucket to include.
Included files are those that match at least one item in
include_regex and do not match any items in exclude_regex.
Note that a file that matches items from both lists will not be
included. For a match to occur, the entire file path (i.e.,
everything in the url after the bucket name) must match the regular
expression.
For example, given the input
{bucket_name: "mybucket", include_regex: ["directory1/.*"], exclude_regex: ["directory1/excluded.*"]}:
gs://mybucket/directory1/myfile will be included
gs://mybucket/directory1/directory2/myfile will be included
(.* matches across /)
gs://mybucket/directory0/directory1/myfile will not be
included (the full path doesn't match any items in
include_regex)
gs://mybucket/directory1/excludedfile will not be included
(the path matches an item in exclude_regex)
If include_regex is left empty, it will match all files by
default (this is equivalent to setting include_regex: [".*"]).
Some other common use cases:
{bucket_name: "mybucket", exclude_regex: [".*\.pdf"]} will
include all files in mybucket except for .pdf files
{bucket_name: "mybucket", include_regex: ["directory/[^/]+"]}
will include all files directly under
gs://mybucket/directory/, without matching across /
Attributes
Name
Description
bucket_name
str
The name of a Cloud Storage bucket. Required.
include_regex
MutableSequence[str]
A list of regular expressions matching file paths to
include. All files in the bucket that match at least one of
these regular expressions will be included in the set of
files, except for those that also match an item in
exclude_regex. Leaving this field empty will match all
files by default (this is equivalent to including .* in
the list).
Regular expressions use RE2
syntax __; a
guide can be found under the google/re2 repository on
GitHub.
exclude_regex
MutableSequence[str]
A list of regular expressions matching file paths to
exclude. All files in the bucket that match at least one of
these regular expressions will be excluded from the scan.
Regular expressions use RE2
syntax __; a
guide can be found under the google/re2 repository on
GitHub.
[[["Easy to understand","easyToUnderstand","thumb-up"],["Solved my problem","solvedMyProblem","thumb-up"],["Other","otherUp","thumb-up"]],[["Missing the information I need","missingTheInformationINeed","thumb-down"],["Too complicated / too many steps","tooComplicatedTooManySteps","thumb-down"],["Out of date","outOfDate","thumb-down"],["Samples / code issue","samplesCodeIssue","thumb-down"],["Other","otherDown","thumb-down"]],["Last updated 2025-08-07 UTC."],[],[],null,["# Class CloudStorageRegexFileSet (3.31.0)\n\nVersion latestkeyboard_arrow_down\n\n- [3.31.0 (latest)](/python/docs/reference/dlp/latest/google.cloud.dlp_v2.types.CloudStorageRegexFileSet)\n- [3.30.0](/python/docs/reference/dlp/3.30.0/google.cloud.dlp_v2.types.CloudStorageRegexFileSet)\n- [3.29.0](/python/docs/reference/dlp/3.29.0/google.cloud.dlp_v2.types.CloudStorageRegexFileSet)\n- [3.28.0](/python/docs/reference/dlp/3.28.0/google.cloud.dlp_v2.types.CloudStorageRegexFileSet)\n- [3.27.0](/python/docs/reference/dlp/3.27.0/google.cloud.dlp_v2.types.CloudStorageRegexFileSet)\n- [3.26.0](/python/docs/reference/dlp/3.26.0/google.cloud.dlp_v2.types.CloudStorageRegexFileSet)\n- [3.25.1](/python/docs/reference/dlp/3.25.1/google.cloud.dlp_v2.types.CloudStorageRegexFileSet)\n- [3.24.0](/python/docs/reference/dlp/3.24.0/google.cloud.dlp_v2.types.CloudStorageRegexFileSet)\n- [3.23.0](/python/docs/reference/dlp/3.23.0/google.cloud.dlp_v2.types.CloudStorageRegexFileSet)\n- [3.22.0](/python/docs/reference/dlp/3.22.0/google.cloud.dlp_v2.types.CloudStorageRegexFileSet)\n- [3.20.0](/python/docs/reference/dlp/3.20.0/google.cloud.dlp_v2.types.CloudStorageRegexFileSet)\n- [3.19.0](/python/docs/reference/dlp/3.19.0/google.cloud.dlp_v2.types.CloudStorageRegexFileSet)\n- [3.18.1](/python/docs/reference/dlp/3.18.1/google.cloud.dlp_v2.types.CloudStorageRegexFileSet)\n- [3.17.0](/python/docs/reference/dlp/3.17.0/google.cloud.dlp_v2.types.CloudStorageRegexFileSet)\n- [3.16.0](/python/docs/reference/dlp/3.16.0/google.cloud.dlp_v2.types.CloudStorageRegexFileSet)\n- [3.15.3](/python/docs/reference/dlp/3.15.3/google.cloud.dlp_v2.types.CloudStorageRegexFileSet)\n- [3.14.0](/python/docs/reference/dlp/3.14.0/google.cloud.dlp_v2.types.CloudStorageRegexFileSet)\n- [3.13.0](/python/docs/reference/dlp/3.13.0/google.cloud.dlp_v2.types.CloudStorageRegexFileSet)\n- [3.12.3](/python/docs/reference/dlp/3.12.3/google.cloud.dlp_v2.types.CloudStorageRegexFileSet)\n- [3.11.1](/python/docs/reference/dlp/3.11.1/google.cloud.dlp_v2.types.CloudStorageRegexFileSet)\n- [3.10.1](/python/docs/reference/dlp/3.10.1/google.cloud.dlp_v2.types.CloudStorageRegexFileSet)\n- [3.9.2](/python/docs/reference/dlp/3.9.2/google.cloud.dlp_v2.types.CloudStorageRegexFileSet)\n- [3.8.1](/python/docs/reference/dlp/3.8.1/google.cloud.dlp_v2.types.CloudStorageRegexFileSet)\n- [3.7.1](/python/docs/reference/dlp/3.7.1/google.cloud.dlp_v2.types.CloudStorageRegexFileSet)\n- [3.6.2](/python/docs/reference/dlp/3.6.2/google.cloud.dlp_v2.types.CloudStorageRegexFileSet)\n- [3.5.0](/python/docs/reference/dlp/3.5.0/google.cloud.dlp_v2.types.CloudStorageRegexFileSet)\n- [3.4.0](/python/docs/reference/dlp/3.4.0/google.cloud.dlp_v2.types.CloudStorageRegexFileSet)\n- [3.3.1](/python/docs/reference/dlp/3.3.1/google.cloud.dlp_v2.types.CloudStorageRegexFileSet)\n- [3.2.4](/python/docs/reference/dlp/3.2.4/google.cloud.dlp_v2.types.CloudStorageRegexFileSet)\n- [3.1.1](/python/docs/reference/dlp/3.1.1/google.cloud.dlp_v2.types.CloudStorageRegexFileSet)\n- [3.0.1](/python/docs/reference/dlp/3.0.1/google.cloud.dlp_v2.types.CloudStorageRegexFileSet)\n- [2.0.2](/python/docs/reference/dlp/2.0.2/google.cloud.dlp_v2.types.CloudStorageRegexFileSet)\n- [1.0.2](/python/docs/reference/dlp/1.0.2/google.cloud.dlp_v2.types.CloudStorageRegexFileSet)\n- [0.15.2](/python/docs/reference/dlp/0.15.2/google.cloud.dlp_v2.types.CloudStorageRegexFileSet)\n- [0.14.0](/python/docs/reference/dlp/0.14.0/google.cloud.dlp_v2.types.CloudStorageRegexFileSet)\n- [0.13.0](/python/docs/reference/dlp/0.13.0/google.cloud.dlp_v2.types.CloudStorageRegexFileSet)\n- [0.12.1](/python/docs/reference/dlp/0.12.1/google.cloud.dlp_v2.types.CloudStorageRegexFileSet) \n\n CloudStorageRegexFileSet(mapping=None, *, ignore_unknown_fields=False, **kwargs)\n\nMessage representing a set of files in a Cloud Storage bucket.\nRegular expressions are used to allow fine-grained control over\nwhich files in the bucket to include.\n\nIncluded files are those that match at least one item in\n`include_regex` and do not match any items in `exclude_regex`.\nNote that a file that matches items from both lists will *not* be\nincluded. For a match to occur, the entire file path (i.e.,\neverything in the url after the bucket name) must match the regular\nexpression.\n\nFor example, given the input\n`{bucket_name: \"mybucket\", include_regex: [\"directory1/.*\"], exclude_regex: [\"directory1/excluded.*\"]}`:\n\n- `gs://mybucket/directory1/myfile` will be included\n- `gs://mybucket/directory1/directory2/myfile` will be included (`.*` matches across `/`)\n- `gs://mybucket/directory0/directory1/myfile` will *not* be included (the full path doesn't match any items in `include_regex`)\n- `gs://mybucket/directory1/excludedfile` will *not* be included (the path matches an item in `exclude_regex`)\n\nIf `include_regex` is left empty, it will match all files by\ndefault (this is equivalent to setting `include_regex: [\".*\"]`).\n\nSome other common use cases:\n\n- `{bucket_name: \"mybucket\", exclude_regex: [\".*\\.pdf\"]}` will include all files in `mybucket` except for .pdf files\n- `{bucket_name: \"mybucket\", include_regex: [\"directory/[^/]+\"]}` will include all files directly under `gs://mybucket/directory/`, without matching across `/`"]]