Upload Audio

The Upload Audio control allows a user to upload an audio file when running a workflow.

The Upload Audio control is an input component that allows a user to upload an audio file (e.g., MP3, WAV, M4A) when they run a workflow. The uploaded audio is then accessible via a temporary URL that can be used by downstream nodes for processing.

When to Use

Use the Upload Audio control for any workflow that needs to process a user-provided audio file. Common use cases include:

Transcribing speech to text.
Summarizing the content of a meeting or call.
Translating speech from one language to another.
Identifying different speakers in a recording.

How to Add

From your workflow's Build page, click on + Add Input.
Select Upload Audio from the list of available controls.

Configuration

Title: The main label for the file upload field.
Description: Optional helper text to provide more context.
Variable Name: The name used to access the URL of the uploaded audio file.
Optional/Required: A toggle to specify if the user must upload a file.

Accessing the Value

After the user uploads a file, this control outputs a temporary, secure URL pointing to that audio file. This URL is accessible via the variable name you assigned.

For example, if the variable name is podcast_episode, the value of {{podcast_episode}} will be a URL (e.g., https://cdn.agenticflow.ai/.../episode.mp3).

Note: This URL is temporary and will expire. It is meant for immediate processing within the workflow run.

Example

Goal: Allow a user to upload a recording of a customer support call and generate a summary and a list of action items.

Add an Upload Audio Control:
- Title: "Upload Support Call Recording"
- Variable Name: call_recording_url
Add an OpenAI MCP Node (Whisper):
- Action: Create Transcription
- File: {{call_recording_url}}
- Purpose: To convert the spoken audio into text.

Add another OpenAI MCP Node (GPT):

Action: Chat
Prompt:

Read the following transcript of a customer support call. Provide a one-paragraph summary of the issue and a bulleted list of action items for the support agent.

Transcript:
{{openai_mcp_1.text}}

Purpose: To analyze the transcribed text and extract key information.

This workflow takes a raw audio file and turns it into a structured, actionable summary, automating the post-call analysis process.

PreviousUpload Video NextMultiple Media

Last updated 3 months ago

hashtagWhen to Use

hashtagHow to Add

hashtagConfiguration

hashtagAccessing the Value

hashtagExample

When to Use

How to Add

Configuration

Accessing the Value

Example