Craft Lip Sync
Professional lip sync video generation guide
Craft Lip Sync
Overview
Craft Lip Sync (formerly Video Craft Pro) is a professional lip-sync video generation feature that creates videos where actors' lips are perfectly synchronized with your custom audio. This feature is ideal for creating talking head videos, presentations, tutorials, and any content where precise lip synchronization is important.
When to Use Craft Lip Sync
- Talking Head Videos: Create videos with actors speaking your script
- Tutorials: Generate tutorial videos with synchronized speech
- Presentations: Create presentation videos with custom voice
- Marketing Content: Produce marketing videos with professional lip sync
- Custom Voice Content: Use your own audio files for complete control
Step-by-Step Guide
Step 1: Select Actor (Required)
-
Navigate to Custom Mode:
- Click on "Craft Lip Sync" card from the dashboard (or Video Craft Pro)
-
Choose an Actor:
- In the "Step 1: Select Actor" section, you'll see several tabs:
- Actors Tab (default): Browse pre-generated actor images
- Upload Tab: Upload your own actor image
- Generate Tab: Generate a new actor image using AI
- History Tab: Select from your previous generations
- In the "Step 1: Select Actor" section, you'll see several tabs:
-
Select Your Actor:
- Click on any actor image from the Actors tab, or
- Upload an image from the Upload tab, or
- Generate a new actor from the Generate tab, or
- Choose a previous generation from the History tab
-
Preview: The selected actor will appear in the preview canvas
-
Zoom Controls: Use zoom controls to inspect the actor image
Note: The selected actor image is required before you can proceed.
Step 2: Voice & Audio (Required)
The Voice & Audio section is collapsible - click the header to expand/collapse it.
Upload Audio File
-
Voice Selector Component:
- The Voice Selector allows you to upload your own audio file
- Supported formats: MP3, WAV, M4A, and other common audio formats
- Required: An audio file is required for lip sync generation
-
Upload Process:
- Click in the upload area or drag and drop an audio file
- File will upload to the platform
- Audio duration determines video duration automatically
-
Audio Requirements:
- Format: Common audio formats (MP3, WAV, M4A, etc.)
- Quality: Higher quality audio produces better lip sync
- Duration: Video duration matches audio duration automatically
- Content: Audio should contain clear speech for best results
Tips for Audio:
- Use clear, high-quality audio recordings
- Ensure speech is clear and not too fast
- Remove background noise if possible
- Test audio playback before uploading
Step 3: Behavior - Action Prompt (Required)
-
Action Prompt Field:
- In the "Behavior" section, find the "Action Prompt" field
- Required: This field must be filled
- Purpose: Describe how the actor should behave and move while speaking
-
Writing Action Prompts:
- Be Specific: Describe body language, gestures, and expressions
- Natural Movements: Use phrases like "natural hand gestures", "maintaining eye contact"
- Emotional Cues: Include emotions like "warm smile", "confident demeanor"
- Body Language: Describe how the actor should move
Example Action Prompts:
- "Speaking confidently with a warm smile, making natural hand gestures, maintaining eye contact with the camera"
- "Professional presentation style, subtle hand movements, confident posture, maintaining eye contact"
- "Enthusiastic and engaging, animated gestures, expressive facial expressions, friendly demeanor"
Tips:
- Match the action to the content of your audio
- Be specific about gestures and expressions
- Consider the tone of your audio when writing actions
Negative Prompt (Optional)
-
Negative Prompt Field:
- Below the Action Prompt, you'll find the "Negative Prompt" field
- Optional: This field is not required
- Purpose: Describe what you want to avoid in the video
-
Writing Negative Prompts:
- Common Issues: Mention things like "unnatural movements", "blurry scenes", "distorted faces"
- Quality Issues: Include terms like "poor lighting", "awkward gestures", "jerky movements"
- Help Refine: Helps improve the overall quality of the output
Example Negative Prompts:
- "unnatural movements, blurry scenes, distorted faces, poor lighting"
- "awkward gestures, jerky movements, unnatural expressions, distorted audio sync"
Note: Negative prompts help refine output but are optional.
Content Safety
- NSFW Filter: The system includes safety filters to prevent inappropriate content
- Action Prompt: Filters check the action prompt for inappropriate content
- Negative Prompt: Filters also check the negative prompt
- Validation: If inappropriate content is detected, generation will be blocked with an error message
- Error Message: "Your input contains content that violates our community guidelines. Please revise your prompt to comply with our content policy."
Step 4: Settings
Resolution
- Options: Various resolution options available (HD, FHD, etc.)
- Selection: Use the dropdown to select your preferred resolution
- Cost Impact: Higher resolutions may cost more credits
- Recommendation: Choose based on your target platform and quality needs
Resolution Options:
- HD: Standard HD resolution (good for most use cases)
- FHD: Full HD resolution (higher quality)
- Higher Resolutions: Additional options may be available
Duration
- Auto-Determined: Duration is automatically determined by your audio file length
- Cannot Set Manually: You cannot manually set duration - it matches your audio
- Display: Video duration will match the audio file duration exactly
Step 5: Organize Asset (Optional)
- Tags: Add tags to help organize this video in your history
- Tag Selector: Click on the tag dropdown to select existing tags or create new ones
Step 6: Generate Video
-
Check Credits:
- Credit cost varies based on resolution and duration
- Base cost: 20 credits (1 minute, HD)
- Longer videos or higher resolutions cost more
- Check the cost displayed on the Generate button
-
Validation:
- Actor is selected (required)
- Audio file is uploaded (required)
- Action prompt is filled (required)
- No inappropriate content detected
-
Click Generate:
- Click the "Generate Video" button
- Credits are deducted immediately
- The preview canvas will show a loading animation with progress
-
Wait for Generation:
- Generation time varies based on video duration and resolution
- Progress indicator shows generation progress
- You can wait on the page or navigate away
- The page will automatically update when complete
- You'll receive an email notification if enabled
Step 7: View Your Video
-
Video Display:
- Once complete, the video automatically appears in the preview canvas
- Video will show the actor with synchronized lip movements
- Video controls are available (play, pause, volume)
-
Download:
- Use the download button to save the video to your computer
- Video is in MP4 format with perfect lip synchronization
Credit Costs
Craft Lip Sync uses variable pricing:
- Base Cost: 20 credits (1 minute, HD resolution)
- Duration: Longer videos cost more (30 seconds ≈ 15 credits, 2 minutes ≈ 40 credits)
- Resolution: Higher resolutions may cost more
- Calculation: Cost is calculated based on duration and resolution
Example Costs:
- 30 seconds, HD: ~15 credits
- 1 minute, HD: 20 credits
- 2 minutes, HD: ~40 credits
- Higher resolutions: Additional cost
Tips for Best Results
Audio Quality
- Clear Speech: Use clear, well-recorded audio
- Good Quality: Higher quality audio produces better lip sync
- Appropriate Length: Match audio length to your needs (video duration = audio duration)
- Remove Noise: Clean up background noise if possible
Action Prompts
- Match Audio Tone: Write actions that match the tone of your audio
- Be Specific: Describe gestures and expressions clearly
- Natural Movements: Focus on natural, believable movements
- Professional Style: For professional content, use professional action descriptions
Actor Selection
- Clear Face: Choose actors with clearly visible faces
- Appropriate Style: Match actor style to your content (professional, casual, etc.)
- High Quality: Use high-quality actor images for best results
Resolution Selection
- HD: Good for most use cases, faster generation
- FHD: Best for professional content, higher quality
- Platform Consideration: Choose resolution based on your target platform
What to Expect
Generation Time
- Varies by Duration: Longer videos take longer to generate
- HD, 1 minute: Approximately 5-10 minutes
- HD, 2 minutes: Approximately 10-15 minutes
- Progress Indicator: Shows generation progress percentage
- May Vary: Based on server load and resolution
Video Quality
- Perfect Lip Sync: Precise synchronization between audio and lip movements
- Professional Output: High-quality videos suitable for professional use
- Natural Movements: Natural-looking facial expressions and movements
- Smooth Playback: Professional video quality with smooth playback
File Format
- Format: MP4 (standard video format)
- Compatible: Works with all major video players and platforms
- Ready to Use: Can be directly uploaded to any platform
Troubleshooting
Generation Failed
- Check Credits: Ensure you have enough credits for your video duration and resolution
- Check Requirements: Ensure actor, audio, and action prompt are all provided
- Content Safety: If blocked, check for inappropriate content in prompts
- Try Again: Sometimes temporary issues occur
- Refund: Credits are automatically refunded if generation fails
Content Safety Error
- Error Message: "Your input contains content that violates our community guidelines"
- Solution: Revise your action prompt or negative prompt to remove inappropriate content
- Check Both Fields: Both action prompt and negative prompt are checked
- Guidelines: Ensure content complies with community guidelines
Audio Upload Issues
- Format: Ensure audio is in a supported format (MP3, WAV, M4A, etc.)
- File Size: Check that file size is within limits
- Upload: Try uploading again if upload fails
- Quality: Use good quality audio files
Lip Sync Quality Issues
- Audio Quality: Ensure audio is clear and high quality
- Speech Clarity: Clear, well-enunciated speech works best
- Actor Selection: Choose actors with clearly visible faces
- Resolution: Try higher resolution for better quality
Credit Issues
- Check Balance: Verify your credit balance
- Review Costs: Understand that Craft Lip Sync costs more than Quick Video
- Duration Impact: Longer videos cost more credits
- Purchase Credits: Buy more credits if needed
Examples
Example 1: Tutorial Video (1 minute, HD)
Actor: Professional presenter from Actors tab Audio: 1-minute tutorial script recording Action Prompt: "Speaking clearly with a friendly demeanor, making explanatory hand gestures, maintaining eye contact, using a teaching style presentation" Negative Prompt: "unnatural movements, blurry scenes, awkward gestures" Resolution: HD Duration: 1 minute (auto-determined from audio) Result: Professional tutorial video with perfect lip synchronization
Example 2: Marketing Video (30 seconds, FHD)
Actor: Business professional from Actors tab Audio: 30-second marketing script recording Action Prompt: "Confident and enthusiastic presentation style, natural hand gestures, warm smile, maintaining eye contact, professional demeanor" Resolution: FHD Duration: 30 seconds (auto-determined from audio) Result: High-quality marketing video with professional lip sync
Next Steps
After creating your Craft Lip Sync video:
- Download: Save your video
- Review: Watch to verify lip sync quality
- Edit: Use Video Editor to make additional edits if needed
- Share: Upload to your preferred platform
- Create More: Experiment with different actors and audio files
Related Topics
- Quick Video: Fast video generation with text-to-speech
- Premium Video: Advanced video generation with more control
- Video Editor: Edit and enhance your generated videos
- Credit Costs: Understand credit costs and pricing
- History: View and manage your video generations