Convert text to natural-sounding speech audio using advanced text-to-speech technology. This node supports multiple voice providers including Azure, OpenAI, Google, and AWS Polly, offering a wide selection of voices in different languages and styles.
Provider
PixelML
Connection
Name
Description
Required
Category
PixelML Connection
The PixelML connection to call PixelML API.
✓
pixelml
Input Parameters
Name
Type
Required
Default
Description
voice
string (enum)
✓
-
Voice to use for text to speech. Choose from a variety of voices across multiple providers (Azure, OpenAI, Google, AWS Polly)
This node converts text into natural-sounding speech audio using PixelML's text-to-speech API, which integrates multiple voice providers including Azure, OpenAI, Google, and AWS Polly. You provide the text content and select a voice from the available options, and the node generates an audio file, returns the URL to access it, along with caption data and duration information for downstream processing.
Usage Examples
Example 1: Convert Article to Audio with Female Voice
Input:
voice: "Joanna"
text: "Welcome to our podcast. Today we're discussing the future of artificial intelligence and its impact on everyday life."
Example 3: Create News-Style Audio with Professional Voice
Input:
voice: "en_US_News_K"
text: "Breaking news: Scientists have made a groundbreaking discovery in renewable energy technology that could revolutionize power generation worldwide."
Podcast Generation: Convert written content, blog posts, or articles into audio format for podcast distribution
Accessibility: Create audio versions of written content to make it accessible to visually impaired users
E-Learning: Generate narration for educational videos, courses, and training materials
Voice Notifications: Create custom voice alerts and notifications for applications and systems
Audiobook Production: Convert written books or documents into audiobook format
Video Voiceovers: Generate professional voiceovers for marketing videos, explainer videos, and presentations
Multilingual Content: Produce audio content in multiple languages using native-sounding voices
Error Handling
Error Type
Cause
Solution
Invalid Connection
PixelML connection is not configured or expired
Verify and update your PixelML connection credentials in the connection settings
Text Too Long
Input text exceeds maximum character limit
Split the text into smaller chunks and process separately, then concatenate audio files
Invalid Voice
Selected voice is not available or unsupported
Choose a valid voice from the available options in the voice parameter
API Rate Limit
Too many requests sent in a short time period
Implement delays between requests or upgrade your PixelML plan for higher limits
Empty Text
No text provided or text is empty
Ensure the text parameter contains at least one character
Network Timeout
Request timed out due to network issues
Retry the request or check network connectivity
Quota Exceeded
PixelML account quota has been exceeded
Upgrade your PixelML plan or wait for quota reset
Notes
Voice Selection: Different voices are optimized for different use cases. News voices work well for formal content, Neural voices for natural conversation, and Wavenet for high-quality output.
Text Length: Longer texts will result in longer processing times. Consider splitting very long texts into manageable chunks.
Language Support: Ensure you select a voice that matches the language of your text for best pronunciation and natural sound.
Audio Format: The generated audio is typically in MP3 format, which is widely compatible with most platforms and devices.
Provider Differences: Each provider (Azure, OpenAI, Google, AWS) has distinct voice characteristics. Test multiple voices to find the best fit for your content.
Duration Accuracy: The returned duration value is accurate and useful for synchronizing audio with video or other media.
Caption Files: The caption_url provides a text transcript file that can be used for subtitles or accessibility purposes.
Caching: Audio files are temporarily stored and accessible via the returned URLs. Download and store files if long-term access is needed.