Zappletree
Stable diffusion with IPadapter. ComfyUI guide: https://youtu.be/C94pTaKoLbU?si=ODDJRDm3W5enXoHK
LarsDu88
Stable Diffusion Dreambooth.

The basic idea is you find a rare unused token (like say f$#sdafad) and then finetune your image generation model with a specific set of images (say 20 images of your red cap at various angles) while telling it that f$#sdafad is the same thing as your red cap.

Then you can start prompting "f$#sdafad resting on the head of a monkey" and your cap will appear on a monkey's head.

The problem with this technique is the finetuning part. Finetuning can take minutes to hours depending on how many gpus you have and needs to be done individually for every new "token" you want to map to a specific individual or object you want to add to your pre-trained model.

Another strategy is to use some kind of autocropping strategy + generative infill. You can take a semantic segmentation model like Meta's "Segment Anything", then use it to segment out the item of interest manually (perhaps a UI could be built to make this a one-step process). Then take the mask and do a generative infill using some sort of image generation model like stable diffusion.

lolinder
Somewhat tangential, but this reminds me of these delightful product photos on Amazon:

https://www.amazon.com/dp/B0C4YP8MKY

The third photo is my favorite.

muzani
Would Dream Studio do what you want? https://beta.dreamstudio.ai
65
Photoshop generative fill, perhaps?

Or... just Photoshop a person wearing the hat.

Or... just take a picture of someone wearing the hat.

lucb1e
> lets say we have a cap and I want to generate a human image wearing that same cap

Do you have an image of the cap in the right orientation, as it would also appear as it is sitting on someone's head?

If not, any algorithm is necessarily going to have to invent what the cap looks like from another angle, making up details on any previously-hidden side and guessing at the depth of different parts of the still image in order to rotate it into the right orientation

If yes, crop it out and paste it onto the target head

Gooblebrai
You are looking for platforms or implementations using Dreambooth.
swatcoder
Why do you specifically want to use AI or even care how its done?

If you have to do this for only a few things, you can do it for a pittance through a service like Fiverr.

If you have higher or recurring volume, you can employ an offshore company to do this for a very modest monthly retainer.

Just let the provider on the other end figure out the tool that makes it cost effective for them -- AI or otherwise.

unleaded
A camera?
mg
I think the description of what you want to achieve is not clear.

Can you link to an example image which you would upload and then describe based on that example what the generated image should look like?

Curzel
Nothing free that I know of, get Adobe Firefly
jaggs
Stable Diffusion with a suitably trained LoRA? The trick is in finding a tech which gives rapis LoRA fine tuning. I'm sure they exist, but I can't come up with one right now.

Once you get that post right, using something like Krita with AI Diffusion should give you a nice fast process flow.

nfromb
[dead]
your_friend
[dead]
joenot443
AFAIK this specific tech doesn't exist yet.