Transcribe speech to text to create a written document, such as a
text-message, email or report.
Discussion
Multiple people in a conversation or discussion. For example in a
meeting with two or more people actively participating. Typically
all the primary people speaking would be in the same room (if not,
see PHONE_CALL)
PhoneCall
A phone-call or video-conference in which two or more people, who are
not in the same room, are actively participating.
Presentation
One or more persons lecturing or presenting to others, mostly
uninterrupted.
ProfessionallyProduced
Professionally produced audio (eg. TV Show, Podcast).
Unspecified
Use case is either unknown or is something other than one of the other
values below.
VoiceCommand
Transcribe voice commands, such as for controlling a device.
Voicemail
A recorded message intended for another person to listen to.
VoiceSearch
Transcribe spoken questions and queries into text.
[[["Easy to understand","easyToUnderstand","thumb-up"],["Solved my problem","solvedMyProblem","thumb-up"],["Other","otherUp","thumb-up"]],[["Missing the information I need","missingTheInformationINeed","thumb-down"],["Too complicated / too many steps","tooComplicatedTooManySteps","thumb-down"],["Out of date","outOfDate","thumb-down"],["Samples / code issue","samplesCodeIssue","thumb-down"],["Other","otherDown","thumb-down"]],["Last updated 2025-08-07 UTC."],[[["\u003cp\u003eThe content details various versions of the \u003ccode\u003eGoogle.Cloud.Speech.V1\u003c/code\u003e library, with version 3.8.0 being the latest and 2.2.0 the oldest listed.\u003c/p\u003e\n"],["\u003cp\u003eThis webpage focuses on the \u003ccode\u003eInteractionType\u003c/code\u003e enum, which is part of the \u003ccode\u003eGoogle.Cloud.Speech.V1.RecognitionMetadata.Types\u003c/code\u003e namespace within the library.\u003c/p\u003e\n"],["\u003cp\u003eThe \u003ccode\u003eInteractionType\u003c/code\u003e enum provides a set of categories for use cases that describe the context of an audio recognition request.\u003c/p\u003e\n"],["\u003cp\u003eThere are nine defined interaction types, including \u003ccode\u003eDictation\u003c/code\u003e, \u003ccode\u003eDiscussion\u003c/code\u003e, \u003ccode\u003ePhoneCall\u003c/code\u003e, \u003ccode\u003ePresentation\u003c/code\u003e, \u003ccode\u003eProfessionallyProduced\u003c/code\u003e, \u003ccode\u003eUnspecified\u003c/code\u003e, \u003ccode\u003eVoiceCommand\u003c/code\u003e, \u003ccode\u003eVoicemail\u003c/code\u003e, and \u003ccode\u003eVoiceSearch\u003c/code\u003e.\u003c/p\u003e\n"],["\u003cp\u003eEach \u003ccode\u003eInteractionType\u003c/code\u003e field is documented with its purpose, like \u003ccode\u003eDictation\u003c/code\u003e for creating documents from speech or \u003ccode\u003eVoiceCommand\u003c/code\u003e for controlling devices.\u003c/p\u003e\n"]]],[],null,[]]