The Anaconda python distribution. The Anaconda component is not supported
in the Dataproc
<a
href="/dataproc/docs/concepts/versioning/dataproc-release-2.0">2.0
image</a>. The 2.0 image is pre-installed with Miniconda.
Docker
Docker
Druid
The Druid query engine. (alpha)
Flink
Flink
Hbase
HBase. (beta)
HiveWebhcat
The Hive Web HCatalog (the REST service for accessing HCatalog).
Jupyter
The Jupyter Notebook.
Presto
The Presto query engine.
Ranger
The Ranger service.
Solr
The Solr service.
Unspecified
Unspecified component. Specifying this will cause Cluster creation to fail.
[[["Easy to understand","easyToUnderstand","thumb-up"],["Solved my problem","solvedMyProblem","thumb-up"],["Other","otherUp","thumb-up"]],[["Missing the information I need","missingTheInformationINeed","thumb-down"],["Too complicated / too many steps","tooComplicatedTooManySteps","thumb-down"],["Out of date","outOfDate","thumb-down"],["Samples / code issue","samplesCodeIssue","thumb-down"],["Other","otherDown","thumb-down"]],["Last updated 2025-08-07 UTC."],[[["\u003cp\u003eThe latest version of the Google Cloud Dataproc v1 API component is 5.17.0, with previous versions ranging down to 3.1.0 available for reference.\u003c/p\u003e\n"],["\u003cp\u003eThe documentation covers the \u003ccode\u003eComponent\u003c/code\u003e enum, which lists cluster components that can be activated, such as Anaconda, Docker, Druid, Flink, HBase, and others.\u003c/p\u003e\n"],["\u003cp\u003eEach component field, like \u003ccode\u003eAnaconda\u003c/code\u003e, \u003ccode\u003eDocker\u003c/code\u003e, \u003ccode\u003eDruid\u003c/code\u003e, etc, has a description explaining its role within the Google Cloud Dataproc v1 environment.\u003c/p\u003e\n"],["\u003cp\u003eThe documentation mentions that the Anaconda component is not supported in Dataproc 2.0 images, as they are pre-installed with Miniconda instead.\u003c/p\u003e\n"],["\u003cp\u003eThe component \u003ccode\u003eUnspecified\u003c/code\u003e will cause the cluster creation to fail.\u003c/p\u003e\n"]]],[],null,[]]