Text to video

Action ID: text_to_video

Description

Text to video

Connection

Name

Description

Required

Input Schema

{
  "description": "Text to video node input.",
  "properties": {
    "model": {
      "default": "Pixel S",
      "description": "Model",
      "enum": [
        "Pixel S",
        "Pixel M",
        "Pixel L"
      ],
      "title": "Model",
      "type": "string"
    },
    "prompt": {
      "description": "Prompt",
      "title": "Prompt",
      "type": "string"
    },
    "negative_prompt": {
      "anyOf": [
        {
          "type": "string"
        },
        {
          "type": "null"
        }
      ],
      "default": null,
      "description": "Negative prompt",
      "title": "Negative prompt"
    },
    "steps": {
      "default": 30,
      "description": "Steps",
      "maximum": 50,
      "minimum": 1,
      "title": "Steps",
      "type": "integer"
    },
    "guidance_scale": {
      "default": 7,
      "description": "Guidance scale",
      "maximum": 10,
      "minimum": 2,
      "title": "Guidance scale",
      "type": "integer"
    },
    "size": {
      "default": "Landscape HD (16:9)",
      "description": "Size (Only for Pixel M)",
      "enum": [
        "Square HD (1:1)",
        "Square FullHD (1:1)",
        "Portrait HD (3:4)",
        "Portrait FullHD (3:4)",
        "Portrait HD (9:16)",
        "Portrait FullHD (9:16)",
        "Landscape HD (4:3)",
        "Landscape FullHD (4:3)",
        "Landscape HD (16:9)",
        "Landscape FullHD (16:9)"
      ],
      "title": "Size (Only for Pixel M)",
      "type": "string"
    },
    "fps": {
      "default": 30,
      "description": "FPS (Only for Pixel M)",
      "title": "FPS (Only for Pixel M)",
      "type": "integer"
    },
    "duration": {
      "default": 2,
      "description": "Duration in seconds (Only for Pixel L)",
      "title": "Duration in seconds (Only for Pixel L)",
      "type": "integer"
    }
  },
  "required": [
    "prompt"
  ],
  "title": "TextToVideoNodeInput",
  "type": "object"
}

Output Schema

{
  "description": "Text to video node output.",
  "properties": {
    "video_url": {
      "description": "Video URL",
      "title": "Video URL",
      "type": "string"
    }
  },
  "required": [
    "video_url"
  ],
  "title": "TextToVideoNodeOutput",
  "type": "object"
}

How It Works

This node generates AI-powered videos from text descriptions using PixelML's text-to-video models. You select a model size (S, M, or L) based on your quality and duration needs, then provide a descriptive prompt. The AI interprets your text, creates a sequence of frames that match your description, and renders them into a smooth video with motion. Quality is controlled through steps and guidance scale parameters, while negative prompts help exclude unwanted elements. Different model sizes offer different capabilities—Pixel S for quick previews, Pixel M for customizable size and FPS, and Pixel L for longer durations.

Usage Examples

Input:

model: "Pixel S"
prompt: "A coffee cup steaming on a wooden table, morning sunlight streaming through window, peaceful and calming atmosphere"
negative_prompt: "people, text, logos, blurry, distorted"
steps: 30
guidance_scale: 7

Output:

video_url: "https://pixelml.com/output/coffee_morning_scene.mp4"

Example 2: Product Demo Video (Pixel M)

Input:

model: "Pixel M"
prompt: "Modern smartphone rotating slowly, sleek black finish, studio lighting with reflections, professional product showcase"
negative_prompt: "scratches, fingerprints, text, watermarks"
steps: 40
guidance_scale: 8
size: "Landscape HD (16:9)"
fps: 60

Output:

video_url: "https://pixelml.com/output/smartphone_showcase_hd.mp4"

Example 3: Extended Cinematic Scene (Pixel L)

Input:

model: "Pixel L"
prompt: "Majestic mountain landscape at sunset, clouds moving slowly across orange sky, camera slowly panning across peaks, cinematic and epic"
negative_prompt: "people, buildings, cars, modern objects, text"
steps: 50
guidance_scale: 9
duration: 5

Output:

video_url: "https://pixelml.com/output/mountain_sunset_cinematic.mp4"

Common Use Cases

Social Media Content: Create engaging video clips for Instagram Stories, TikTok, and other social platforms
Product Demonstrations: Generate product showcase videos without filming physical products
Marketing Materials: Produce b-roll footage and background videos for marketing campaigns
Concept Visualization: Visualize ideas and concepts for presentations and pitches
Video Backgrounds: Create looping background videos for websites, streams, and virtual events
Educational Content: Generate illustrative videos to explain concepts and processes
Prototyping: Quickly create video mockups to test ideas before investing in professional production

Error Handling

Error Type

Cause

Solution

Empty Prompt

Prompt field is empty or null

Provide a descriptive text prompt for the video you want to generate

Invalid Model

Model selection not in allowed values (S, M, L)

Choose one of the supported models: Pixel S, Pixel M, or Pixel L

Invalid Steps

Steps value outside 1-50 range

Set steps between 1 and 50 (recommended: 30-40 for quality)

Invalid Guidance Scale

Guidance scale outside 2-10 range

Set guidance scale between 2 and 10 (recommended: 7-9)

Invalid Size

Size format not recognized (Pixel M only)

Choose from predefined size options like "Landscape HD (16:9)"

Duration Too Long

Duration exceeds model limits (Pixel L only)

Reduce duration or split into multiple shorter videos

PixelML Connection Error

Invalid or missing PixelML API credentials

Verify your PixelML connection is properly configured with valid API key

Notes

Model Selection: Pixel S is fastest (quick previews), Pixel M offers more control (custom size/FPS), Pixel L creates longer videos (up to several seconds)
Prompt Quality: Detailed prompts with camera movement, lighting, and mood descriptions produce better results
Negative Prompts: Use to exclude unwanted elements like text, people, or artifacts from generated videos
Steps Parameter: More steps (40-50) produce smoother motion but take longer to process
Guidance Scale: Higher values (8-10) stick closer to your prompt, lower values (3-6) allow more creative interpretation
FPS Consideration: Higher FPS (60) creates smoother motion but increases file size (Pixel M only)
Processing Time: Video generation takes significantly longer than image generation—expect 2-10 minutes depending on model and settings
Size vs Quality: Higher resolutions (FullHD) produce better quality but require more processing time and credits

PreviousText to speech with voice clone NextUpload Video to TikTok

Last updated 3 months ago

hashtagDescription

hashtagConnection

hashtagInput Schema

hashtagOutput Schema

hashtagHow It Works

hashtagUsage Examples

hashtagExample 1: Quick Social Media Clip (Pixel S)

hashtagExample 2: Product Demo Video (Pixel M)

hashtagExample 3: Extended Cinematic Scene (Pixel L)

hashtagCommon Use Cases

hashtagError Handling

hashtagNotes

Description

Connection

Input Schema

Output Schema

How It Works

Usage Examples

Example 1: Quick Social Media Clip (Pixel S)

Example 2: Product Demo Video (Pixel M)

Example 3: Extended Cinematic Scene (Pixel L)

Common Use Cases

Error Handling

Notes