Short guide. Read the first three sections: Welcome, Models, Prompting, and you'll cover 90% of what matters.
Welcome
bopGPT generates photos and short videos that look like a person you train it on. You upload reference photos to create a model, write a short prompt, and pick a couple of settings. We do the rest.
Two things drive output quality more than anything else: the photos you used to build the model, and how you prompt. Both are covered in the next two sections. Read those if you skip the rest.
Models
Unlimited
Per shoot
1–4 images
Video length
3–30s
Building a Model
Output is only as good as the source photos. A model trained on 8 sharp, varied images will out-generate one trained on 30 mediocre ones. Every time.
4 photos, max. The recommended mix is 2 Face, 1 Body, 1 Full. That covers what the model needs without diluting the signal.
When you upload a photo, you tag it. Each tag does a different job. Stick to the recommended counts and you're set.
Close-up. Sharp eyes, neutral expression, no obstructions. Good lighting (window light works). The most important set. Pick your best two.
Torso and shoulders visible. Outfit clearly readable.
Head to toe. Pick a clean, well-lit standing pose.
Lifestyle / context shot. Only use if you have a slot left over and the photo adds something Face/Body/Full doesn't already cover.
Photos like these. Crisp, well-lit, framed correctly for each tag.



Open any model, add or remove photos, save. The next generation uses the updated set. No re-training step on your end.
Prompting
Your prompt gets rewritten and enriched on our side before it reaches the model. We add framing, lighting, lens, and style cues based on the camera and angle settings you picked. When you stack your own jargon on top, it usually fights our enhancer instead of helping it.
Write a short, plain sentence. Subject + setting. That's it.
Subject + setting + (optional) action.
That's the formula. “at the gym”, “walking on the beach”, “in a kitchen making coffee”.
Don't describe the model's appearance.
We already know what they look like. Telling us “long brown hair, blue eyes” will only confuse it.
One scene per prompt.
Don't chain “and then she walks out and…”. Generate it as a separate prompt.
Image Settings
Mode
Camera
Angle
Images
Two different AI engines, not newer vs older. Each has its own look. Try both on the same prompt; one will land closer to the model on any given shot. Switching engines costs the same as switching camera or angle: nothing extra.
Image to Video
Upload a start image and we'll animate it for 5–15 seconds. Optionally provide an end image to control where the clip lands.
Describe the motion, not the appearance.
Duration
Longer clips cost more credits and take longer. Pick the shortest that captures the motion.
Generate Audio
Leave on unless you're adding your own track. The model picks ambient sound that fits.
Engine A vs B
Same idea as image generation. Different engines, not versions. Try both.
Motion Control
Two inputs: a character image (your model) and a reference video (any clip: yours, a TikTok download, anything). The character image's appearance is preserved; the reference video's motion is applied.
Reference video
Character image
Upload your reference video first.
Drop in any 3–30s clip with the motion you want copied. Look at the very first frame of the video before moving on.
Check the start frame's orientation.
Is the person facing the camera? Side profile? Three-quarter turn? Looking down? Whatever the first frame shows is what your character image needs to match.
Pick a character image in the same orientation.
Browse your model's gallery and pick the photo whose pose and angle line up with the video's first frame. Or hit Auto-select and we'll pick for you.
Add a prompt only if you need to.
Leave it blank and you'll usually get a solid result. Use it for wardrobe or scene tweaks.
Generate.
Pick Engine A or B and submit. If the first result doesn't land, swap engines and try again before changing the inputs.
The first frame of the reference video on the left, the character image on the right. Same body orientation, same general framing.


Match the orientation of your character image to the first frame of the reference video. The model uses your photo as the “before” pose and the video's motion as the path. If they don't line up, the first second of the clip will warp while the model snaps into the right pose.
Matches
Lines up cleanly. Motion tracks from frame one.
Doesn't match
Model has to rotate before the motion starts. Warps the first ~1s.
Unlike image generation, the prompt is a tiebreaker here, not the main signal. Leave it blank and you'll usually get a solid result. Use it to nudge style or wardrobe (“in a black dress”) or set the scene (“in a kitchen”).
Both take the same inputs. Try both. They're tuned differently. On any given clip, one will track the reference motion more closely than the other.
Presets
Open the bookmark icon next to the prompt input. Save your current prompt and settings as a preset. Apply it next time with one tap.
Image, Image-to-Video, and Motion Control each have their own preset library. Your image presets won't show up inside the video modal, and vice versa. The prompts don't translate between contexts anyway.
Image presets
Video presets
(both kinds)
Credits & Support
Every generation costs credits. The exact cost is shown next to the send button before you submit, so there are no surprises.
Subscription credits
Reset at the start of each billing cycle. Use them or lose them.
Bonus credits
One-time top-ups never expire and are spent first, before subscription credits.
Out of credits mid-cycle?
Top up from the Billing page.
Bad output you'd like refunded, missing credits, or anything broken? Open a ticket from Support. Include the prompt and a link to the Gallery item if there is one. Faster context = faster fix.
Still stuck?
Reach out. We read every ticket.