Transcribe speech to text to create a written document, such as a
text-message, email or report.
Discussion
Multiple people in a conversation or discussion. For example in a
meeting with two or more people actively participating. Typically
all the primary people speaking would be in the same room (if not,
see PHONE_CALL)
PhoneCall
A phone-call or video-conference in which two or more people, who are
not in the same room, are actively participating.
Presentation
One or more persons lecturing or presenting to others, mostly
uninterrupted.
ProfessionallyProduced
Professionally produced audio (eg. TV Show, Podcast).
Unspecified
Use case is either unknown or is something other than one of the other
values below.
VoiceCommand
Transcribe voice commands, such as for controlling a device.
Voicemail
A recorded message intended for another person to listen to.
VoiceSearch
Transcribe spoken questions and queries into text.
[[["Easy to understand","easyToUnderstand","thumb-up"],["Solved my problem","solvedMyProblem","thumb-up"],["Other","otherUp","thumb-up"]],[["Missing the information I need","missingTheInformationINeed","thumb-down"],["Too complicated / too many steps","tooComplicatedTooManySteps","thumb-down"],["Out of date","outOfDate","thumb-down"],["Samples / code issue","samplesCodeIssue","thumb-down"],["Other","otherDown","thumb-down"]],["Last updated 2025-08-07 UTC."],[[["\u003cp\u003eThis page provides documentation for the \u003ccode\u003eRecognitionMetadata.Types.InteractionType\u003c/code\u003e enum within the Google Cloud Speech v1 API, detailing various audio recognition use case categories.\u003c/p\u003e\n"],["\u003cp\u003eThe available versions of the Google Cloud Speech v1 API documentation for \u003ccode\u003eRecognitionMetadata.Types.InteractionType\u003c/code\u003e range from version 2.2.0 up to the latest, 3.8.0, and are listed.\u003c/p\u003e\n"],["\u003cp\u003eThe \u003ccode\u003eRecognitionMetadata.Types.InteractionType\u003c/code\u003e enum includes fields such as \u003ccode\u003eDictation\u003c/code\u003e, \u003ccode\u003eDiscussion\u003c/code\u003e, \u003ccode\u003ePhoneCall\u003c/code\u003e, \u003ccode\u003ePresentation\u003c/code\u003e, \u003ccode\u003eProfessionallyProduced\u003c/code\u003e, \u003ccode\u003eUnspecified\u003c/code\u003e, \u003ccode\u003eVoiceCommand\u003c/code\u003e, \u003ccode\u003eVoicemail\u003c/code\u003e, and \u003ccode\u003eVoiceSearch\u003c/code\u003e, each corresponding to a different use case.\u003c/p\u003e\n"],["\u003cp\u003eThe enum of \u003ccode\u003eRecognitionMetadata.Types.InteractionType\u003c/code\u003e categorizes audio input by its use case, to make it easier to transcribe, and it is associated with the namespace \u003ccode\u003eGoogle.Cloud.Speech.V1\u003c/code\u003e and assembly \u003ccode\u003eGoogle.Cloud.Speech.V1.dll\u003c/code\u003e.\u003c/p\u003e\n"]]],[],null,[]]