The first attempt turned out pretty good, but the model may be underbaked. It works pretty well with a strength of 1.3 to 1.5 but still gets confused if the prompt mentions features like faces, teeth, hoods, etc.
Create & Edit Images Instantly with Grok Imagine
Try Grok Imagine