[BindServiceMethod(typeof(PredictionService), "BindService")]
public abstract class PredictionService.PredictionServiceBaseReference documentation and code samples for the Vertex AI v1beta1 API class PredictionService.PredictionServiceBase.
Base class for server-side implementations of PredictionService
Namespace
Google.Cloud.AIPlatform.V1Beta1Assembly
Google.Cloud.AIPlatform.V1Beta1.dll
Methods
ChatCompletions(ChatCompletionsRequest, IServerStreamWriter<HttpBody>, ServerCallContext)
public virtual Task ChatCompletions(ChatCompletionsRequest request, IServerStreamWriter<HttpBody> responseStream, ServerCallContext context)Exposes an OpenAI-compatible endpoint for chat completions.
| Parameters | |
|---|---|
| Name | Description | 
| request | ChatCompletionsRequestThe request received from the client. | 
| responseStream | IServerStreamWriterHttpBodyUsed for sending responses back to the client. | 
| context | ServerCallContextThe context of the server-side call handler being invoked. | 
| Returns | |
|---|---|
| Type | Description | 
| Task | A task indicating completion of the handler. | 
CountTokens(CountTokensRequest, ServerCallContext)
public virtual Task<CountTokensResponse> CountTokens(CountTokensRequest request, ServerCallContext context)Perform a token counting.
| Parameters | |
|---|---|
| Name | Description | 
| request | CountTokensRequestThe request received from the client. | 
| context | ServerCallContextThe context of the server-side call handler being invoked. | 
| Returns | |
|---|---|
| Type | Description | 
| TaskCountTokensResponse | The response to send back to the client (wrapped by a task). | 
DirectPredict(DirectPredictRequest, ServerCallContext)
public virtual Task<DirectPredictResponse> DirectPredict(DirectPredictRequest request, ServerCallContext context)Perform an unary online prediction request to a gRPC model server for Vertex first-party products and frameworks.
| Parameters | |
|---|---|
| Name | Description | 
| request | DirectPredictRequestThe request received from the client. | 
| context | ServerCallContextThe context of the server-side call handler being invoked. | 
| Returns | |
|---|---|
| Type | Description | 
| TaskDirectPredictResponse | The response to send back to the client (wrapped by a task). | 
DirectRawPredict(DirectRawPredictRequest, ServerCallContext)
public virtual Task<DirectRawPredictResponse> DirectRawPredict(DirectRawPredictRequest request, ServerCallContext context)Perform an unary online prediction request to a gRPC model server for custom containers.
| Parameters | |
|---|---|
| Name | Description | 
| request | DirectRawPredictRequestThe request received from the client. | 
| context | ServerCallContextThe context of the server-side call handler being invoked. | 
| Returns | |
|---|---|
| Type | Description | 
| TaskDirectRawPredictResponse | The response to send back to the client (wrapped by a task). | 
EmbedContent(EmbedContentRequest, ServerCallContext)
public virtual Task<EmbedContentResponse> EmbedContent(EmbedContentRequest request, ServerCallContext context)Embed content with multimodal inputs.
| Parameters | |
|---|---|
| Name | Description | 
| request | EmbedContentRequestThe request received from the client. | 
| context | ServerCallContextThe context of the server-side call handler being invoked. | 
| Returns | |
|---|---|
| Type | Description | 
| TaskEmbedContentResponse | The response to send back to the client (wrapped by a task). | 
Explain(ExplainRequest, ServerCallContext)
public virtual Task<ExplainResponse> Explain(ExplainRequest request, ServerCallContext context)Perform an online explanation.
If [deployed_model_id][google.cloud.aiplatform.v1beta1.ExplainRequest.deployed_model_id] is specified, the corresponding DeployModel must have [explanation_spec][google.cloud.aiplatform.v1beta1.DeployedModel.explanation_spec] populated. If [deployed_model_id][google.cloud.aiplatform.v1beta1.ExplainRequest.deployed_model_id] is not specified, all DeployedModels must have [explanation_spec][google.cloud.aiplatform.v1beta1.DeployedModel.explanation_spec] populated.
| Parameters | |
|---|---|
| Name | Description | 
| request | ExplainRequestThe request received from the client. | 
| context | ServerCallContextThe context of the server-side call handler being invoked. | 
| Returns | |
|---|---|
| Type | Description | 
| TaskExplainResponse | The response to send back to the client (wrapped by a task). | 
GenerateContent(GenerateContentRequest, ServerCallContext)
public virtual Task<GenerateContentResponse> GenerateContent(GenerateContentRequest request, ServerCallContext context)Generate content with multimodal inputs.
| Parameters | |
|---|---|
| Name | Description | 
| request | GenerateContentRequestThe request received from the client. | 
| context | ServerCallContextThe context of the server-side call handler being invoked. | 
| Returns | |
|---|---|
| Type | Description | 
| TaskGenerateContentResponse | The response to send back to the client (wrapped by a task). | 
Predict(PredictRequest, ServerCallContext)
public virtual Task<PredictResponse> Predict(PredictRequest request, ServerCallContext context)Perform an online prediction.
| Parameters | |
|---|---|
| Name | Description | 
| request | PredictRequestThe request received from the client. | 
| context | ServerCallContextThe context of the server-side call handler being invoked. | 
| Returns | |
|---|---|
| Type | Description | 
| TaskPredictResponse | The response to send back to the client (wrapped by a task). | 
RawPredict(RawPredictRequest, ServerCallContext)
public virtual Task<HttpBody> RawPredict(RawPredictRequest request, ServerCallContext context)Perform an online prediction with an arbitrary HTTP payload.
The response includes the following HTTP headers:
- X-Vertex-AI-Endpoint-Id: ID of the [Endpoint][google.cloud.aiplatform.v1beta1.Endpoint] that served this prediction.
- X-Vertex-AI-Deployed-Model-Id: ID of the Endpoint's [DeployedModel][google.cloud.aiplatform.v1beta1.DeployedModel] that served this prediction.
| Parameters | |
|---|---|
| Name | Description | 
| request | RawPredictRequestThe request received from the client. | 
| context | ServerCallContextThe context of the server-side call handler being invoked. | 
| Returns | |
|---|---|
| Type | Description | 
| TaskHttpBody | The response to send back to the client (wrapped by a task). | 
ServerStreamingPredict(StreamingPredictRequest, IServerStreamWriter<StreamingPredictResponse>, ServerCallContext)
public virtual Task ServerStreamingPredict(StreamingPredictRequest request, IServerStreamWriter<StreamingPredictResponse> responseStream, ServerCallContext context)Perform a server-side streaming online prediction request for Vertex LLM streaming.
| Parameters | |
|---|---|
| Name | Description | 
| request | StreamingPredictRequestThe request received from the client. | 
| responseStream | IServerStreamWriterStreamingPredictResponseUsed for sending responses back to the client. | 
| context | ServerCallContextThe context of the server-side call handler being invoked. | 
| Returns | |
|---|---|
| Type | Description | 
| Task | A task indicating completion of the handler. | 
StreamDirectPredict(IAsyncStreamReader<StreamDirectPredictRequest>, IServerStreamWriter<StreamDirectPredictResponse>, ServerCallContext)
public virtual Task StreamDirectPredict(IAsyncStreamReader<StreamDirectPredictRequest> requestStream, IServerStreamWriter<StreamDirectPredictResponse> responseStream, ServerCallContext context)Perform a streaming online prediction request to a gRPC model server for Vertex first-party products and frameworks.
| Parameters | |
|---|---|
| Name | Description | 
| requestStream | IAsyncStreamReaderStreamDirectPredictRequestUsed for reading requests from the client. | 
| responseStream | IServerStreamWriterStreamDirectPredictResponseUsed for sending responses back to the client. | 
| context | ServerCallContextThe context of the server-side call handler being invoked. | 
| Returns | |
|---|---|
| Type | Description | 
| Task | A task indicating completion of the handler. | 
StreamDirectRawPredict(IAsyncStreamReader<StreamDirectRawPredictRequest>, IServerStreamWriter<StreamDirectRawPredictResponse>, ServerCallContext)
public virtual Task StreamDirectRawPredict(IAsyncStreamReader<StreamDirectRawPredictRequest> requestStream, IServerStreamWriter<StreamDirectRawPredictResponse> responseStream, ServerCallContext context)Perform a streaming online prediction request to a gRPC model server for custom containers.
| Parameters | |
|---|---|
| Name | Description | 
| requestStream | IAsyncStreamReaderStreamDirectRawPredictRequestUsed for reading requests from the client. | 
| responseStream | IServerStreamWriterStreamDirectRawPredictResponseUsed for sending responses back to the client. | 
| context | ServerCallContextThe context of the server-side call handler being invoked. | 
| Returns | |
|---|---|
| Type | Description | 
| Task | A task indicating completion of the handler. | 
StreamGenerateContent(GenerateContentRequest, IServerStreamWriter<GenerateContentResponse>, ServerCallContext)
public virtual Task StreamGenerateContent(GenerateContentRequest request, IServerStreamWriter<GenerateContentResponse> responseStream, ServerCallContext context)Generate content with multimodal inputs with streaming support.
| Parameters | |
|---|---|
| Name | Description | 
| request | GenerateContentRequestThe request received from the client. | 
| responseStream | IServerStreamWriterGenerateContentResponseUsed for sending responses back to the client. | 
| context | ServerCallContextThe context of the server-side call handler being invoked. | 
| Returns | |
|---|---|
| Type | Description | 
| Task | A task indicating completion of the handler. | 
StreamRawPredict(StreamRawPredictRequest, IServerStreamWriter<HttpBody>, ServerCallContext)
public virtual Task StreamRawPredict(StreamRawPredictRequest request, IServerStreamWriter<HttpBody> responseStream, ServerCallContext context)Perform a streaming online prediction with an arbitrary HTTP payload.
| Parameters | |
|---|---|
| Name | Description | 
| request | StreamRawPredictRequestThe request received from the client. | 
| responseStream | IServerStreamWriterHttpBodyUsed for sending responses back to the client. | 
| context | ServerCallContextThe context of the server-side call handler being invoked. | 
| Returns | |
|---|---|
| Type | Description | 
| Task | A task indicating completion of the handler. | 
StreamingPredict(IAsyncStreamReader<StreamingPredictRequest>, IServerStreamWriter<StreamingPredictResponse>, ServerCallContext)
public virtual Task StreamingPredict(IAsyncStreamReader<StreamingPredictRequest> requestStream, IServerStreamWriter<StreamingPredictResponse> responseStream, ServerCallContext context)Perform a streaming online prediction request for Vertex first-party products and frameworks.
| Parameters | |
|---|---|
| Name | Description | 
| requestStream | IAsyncStreamReaderStreamingPredictRequestUsed for reading requests from the client. | 
| responseStream | IServerStreamWriterStreamingPredictResponseUsed for sending responses back to the client. | 
| context | ServerCallContextThe context of the server-side call handler being invoked. | 
| Returns | |
|---|---|
| Type | Description | 
| Task | A task indicating completion of the handler. | 
StreamingRawPredict(IAsyncStreamReader<StreamingRawPredictRequest>, IServerStreamWriter<StreamingRawPredictResponse>, ServerCallContext)
public virtual Task StreamingRawPredict(IAsyncStreamReader<StreamingRawPredictRequest> requestStream, IServerStreamWriter<StreamingRawPredictResponse> responseStream, ServerCallContext context)Perform a streaming online prediction request through gRPC.
| Parameters | |
|---|---|
| Name | Description | 
| requestStream | IAsyncStreamReaderStreamingRawPredictRequestUsed for reading requests from the client. | 
| responseStream | IServerStreamWriterStreamingRawPredictResponseUsed for sending responses back to the client. | 
| context | ServerCallContextThe context of the server-side call handler being invoked. | 
| Returns | |
|---|---|
| Type | Description | 
| Task | A task indicating completion of the handler. |