Transform this raw talking-head video into a premium, viral-quality social media reel while preserving the original performance completely.

WhatsApp Group Join Now
Telegram Group Join Now

IMPORTANT PRESERVATION RULES:

  • Keep the original speaker exactly as shown in the uploaded video.
  • Preserve the original face, body proportions, hairstyle, skin tone, clothing, shirt color, and all visual characteristics without modification.
  • Keep the original audio completely unchanged.
  • Maintain perfect lip-sync with the original spoken audio.
  • Preserve every original hand gesture, hand movement, finger movement, body posture, head movement, facial expression, eye movement, and natural speaking behavior exactly as captured in the source footage.
  • Do not generate new gestures, new poses, or alternative body movements.
  • Do not replace, modify, or reinterpret any physical action performed by the speaker.
  • All editing must be applied as enhancement layers on top of the existing footage.

The final edit should feel like a professionally edited creator reel while remaining fully faithful to the raw recording.

The speaker’s natural delivery, timing, expressions, gestures, lip movements, and body language must remain identical to the original video.

WhatsApp Group Join Now
Telegram Group Join Now

Only enhance the video through editing techniques such as:

  • Motion graphics
  • Cinematic typography
  • Dynamic captions
  • Camera punch-ins
  • Reframing
  • Speed ramps
  • B-roll overlays
  • Visual callouts
  • UI animations
  • Sound design
  • Transitions
  • Color-consistent graphic elements

The original performance must remain the foundation of the entire video, with all visual enhancements supporting — not replacing — the speaker’s authentic presentation.

Prompts

Direct prompt editing - Transform this raw talking-head video into a premium, viral-quality social media reel with high-retention editing, cinematic motion graphics, and modern creator-style visuals. Keep the original speaker, original audio, and original shirt color completely unchanged — do not alter, recolor, or modify the subject's appearance, clothing, or skin tone in any way. Preserve the exact video footage as-is and only add editing layers on top.

Generate accurate subtitles directly from the spoken audio. The speaker is speaking in Hinglish, so subtitles must match exactly what is being said in the video — word for word, in the same Hinglish as spoken. Written entirely in English characters (ABC format only). Never convert subtitles into Hindi script or Devanagari. Never paraphrase or rewrite the spoken words.

Caption design should be a major visual element,

Typography Style: Large editorial fonts, luxury cinematic typography, mixed font weights, layered text compositions, kinetic typography, motion-tracked text, depth and parallax effects, premium 3D text treatments.

Color Style: Extract colors directly from the video footage and build the entire typography and graphic palette around the scene. Use the red tones visible on the shirt logo as the primary accent color. Build red-based gradients blending with orange, crimson, burgundy, dark cherry, and warm highlights. All text colors must feel naturally integrated into the video's existing color palette. Important keywords should have unique gradient treatments and premium glow effects that complement — not clash with — the actual footage colors.

Whenever the speaker makes a strong point, create huge cinematic text moments, let key words dominate the screen, use layered typography behind the speaker, add scale animations, depth, shadows, lighting effects, and subtle motion.

Add dynamic zoom-ins, punch-ins, reframing, speed ramps, motion blur transitions, and seamless camera movement. Add relevant B-roll, graphics, UI animations, visual metaphors, icons, callouts, motion graphics, and overlay effects to support the spoken content visually.


Add professional sound design including whooshes, impacts, swipes, clicks, risers, transition sounds, and subtle cinematic audio enhancement — all synced to the original spoken audio.

Maintain fast pacing with meaningful visual changes every few seconds to maximize retention.

The final output should feel like a premium reel edited by a top-tier content agency, with typography and captions acting as a central visual storytelling element. I am also sharing a font style reference — match the typography aesthetic to that reference.


make sure to follow the same hand motion and gesture as uploaded video, audio and lipsync also remain similar to raw video

Output

READ ALSO  How to Create 100% Realistic Human Images Using AI (Without Plastic Skin Texture)

Leave a Comment