Text to video
Action ID: text_to_video
Description
Text to video
Connection
PixelML Connection
The PixelML connection to call PixelML API.
True
pixelml
Input Schema
{
"description": "Text to video node input.",
"properties": {
"model": {
"default": "Pixel S",
"description": "Model",
"enum": [
"Pixel S",
"Pixel M",
"Pixel L"
],
"title": "Model",
"type": "string"
},
"prompt": {
"description": "Prompt",
"title": "Prompt",
"type": "string"
},
"negative_prompt": {
"anyOf": [
{
"type": "string"
},
{
"type": "null"
}
],
"default": null,
"description": "Negative prompt",
"title": "Negative prompt"
},
"steps": {
"default": 30,
"description": "Steps",
"maximum": 50,
"minimum": 1,
"title": "Steps",
"type": "integer"
},
"guidance_scale": {
"default": 7,
"description": "Guidance scale",
"maximum": 10,
"minimum": 2,
"title": "Guidance scale",
"type": "integer"
},
"size": {
"default": "Landscape HD (16:9)",
"description": "Size (Only for Pixel M)",
"enum": [
"Square HD (1:1)",
"Square FullHD (1:1)",
"Portrait HD (3:4)",
"Portrait FullHD (3:4)",
"Portrait HD (9:16)",
"Portrait FullHD (9:16)",
"Landscape HD (4:3)",
"Landscape FullHD (4:3)",
"Landscape HD (16:9)",
"Landscape FullHD (16:9)"
],
"title": "Size (Only for Pixel M)",
"type": "string"
},
"fps": {
"default": 30,
"description": "FPS (Only for Pixel M)",
"title": "FPS (Only for Pixel M)",
"type": "integer"
},
"duration": {
"default": 2,
"description": "Duration in seconds (Only for Pixel L)",
"title": "Duration in seconds (Only for Pixel L)",
"type": "integer"
}
},
"required": [
"prompt"
],
"title": "TextToVideoNodeInput",
"type": "object"
}Output Schema
{
"description": "Text to video node output.",
"properties": {
"video_url": {
"description": "Video URL",
"title": "Video URL",
"type": "string"
}
},
"required": [
"video_url"
],
"title": "TextToVideoNodeOutput",
"type": "object"
}How It Works
This node generates AI-powered videos from text descriptions using PixelML's text-to-video models. You select a model size (S, M, or L) based on your quality and duration needs, then provide a descriptive prompt. The AI interprets your text, creates a sequence of frames that match your description, and renders them into a smooth video with motion. Quality is controlled through steps and guidance scale parameters, while negative prompts help exclude unwanted elements. Different model sizes offer different capabilities—Pixel S for quick previews, Pixel M for customizable size and FPS, and Pixel L for longer durations.
Usage Examples
Example 1: Quick Social Media Clip (Pixel S)
Input:
model: "Pixel S"
prompt: "A coffee cup steaming on a wooden table, morning sunlight streaming through window, peaceful and calming atmosphere"
negative_prompt: "people, text, logos, blurry, distorted"
steps: 30
guidance_scale: 7Output:
video_url: "https://pixelml.com/output/coffee_morning_scene.mp4"Example 2: Product Demo Video (Pixel M)
Input:
model: "Pixel M"
prompt: "Modern smartphone rotating slowly, sleek black finish, studio lighting with reflections, professional product showcase"
negative_prompt: "scratches, fingerprints, text, watermarks"
steps: 40
guidance_scale: 8
size: "Landscape HD (16:9)"
fps: 60Output:
video_url: "https://pixelml.com/output/smartphone_showcase_hd.mp4"Example 3: Extended Cinematic Scene (Pixel L)
Input:
model: "Pixel L"
prompt: "Majestic mountain landscape at sunset, clouds moving slowly across orange sky, camera slowly panning across peaks, cinematic and epic"
negative_prompt: "people, buildings, cars, modern objects, text"
steps: 50
guidance_scale: 9
duration: 5Output:
video_url: "https://pixelml.com/output/mountain_sunset_cinematic.mp4"Common Use Cases
Social Media Content: Create engaging video clips for Instagram Stories, TikTok, and other social platforms
Product Demonstrations: Generate product showcase videos without filming physical products
Marketing Materials: Produce b-roll footage and background videos for marketing campaigns
Concept Visualization: Visualize ideas and concepts for presentations and pitches
Video Backgrounds: Create looping background videos for websites, streams, and virtual events
Educational Content: Generate illustrative videos to explain concepts and processes
Prototyping: Quickly create video mockups to test ideas before investing in professional production
Error Handling
Empty Prompt
Prompt field is empty or null
Provide a descriptive text prompt for the video you want to generate
Invalid Model
Model selection not in allowed values (S, M, L)
Choose one of the supported models: Pixel S, Pixel M, or Pixel L
Invalid Steps
Steps value outside 1-50 range
Set steps between 1 and 50 (recommended: 30-40 for quality)
Invalid Guidance Scale
Guidance scale outside 2-10 range
Set guidance scale between 2 and 10 (recommended: 7-9)
Invalid Size
Size format not recognized (Pixel M only)
Choose from predefined size options like "Landscape HD (16:9)"
Duration Too Long
Duration exceeds model limits (Pixel L only)
Reduce duration or split into multiple shorter videos
PixelML Connection Error
Invalid or missing PixelML API credentials
Verify your PixelML connection is properly configured with valid API key
Notes
Model Selection: Pixel S is fastest (quick previews), Pixel M offers more control (custom size/FPS), Pixel L creates longer videos (up to several seconds)
Prompt Quality: Detailed prompts with camera movement, lighting, and mood descriptions produce better results
Negative Prompts: Use to exclude unwanted elements like text, people, or artifacts from generated videos
Steps Parameter: More steps (40-50) produce smoother motion but take longer to process
Guidance Scale: Higher values (8-10) stick closer to your prompt, lower values (3-6) allow more creative interpretation
FPS Consideration: Higher FPS (60) creates smoother motion but increases file size (Pixel M only)
Processing Time: Video generation takes significantly longer than image generation—expect 2-10 minutes depending on model and settings
Size vs Quality: Higher resolutions (FullHD) produce better quality but require more processing time and credits
Last updated
Was this helpful?