🎉 New Year Sale: Get 20% OFF on all plans — Use code NEWYEAR2026.

Upgrade now
LEOSAM's HelloWorld XL - HelloWorld XL 5.0 GPT4V thumbnail

ModelsLab/LEOSAM'S HelloWorld XL - HelloWorld XL 5.0 GPT4V

leosamshelloworldxl-helloworldxl50gpt4v
Text to Image Community ModelFree for Premium UsersLLMs.txt

HelloWorld 5.0 is the most substantial update in the history of the HelloWorld series, tagged with GPT-4v, and has undergone significant fine-tuning in fields such as science fiction, animals, architecture, and illustration.

Comparative tests show improvements in this version include:

1. More varied and dynamic character poses and image compositions, creating visually engaging pictures;

2. The film dataset has been extensively trained. While the film texture was weak from versions 2.0 to 4.0, many fans missed the leogirl style of version 1.0. Therefore, this update has specifically strengthened the film texture without compromising other photographic qualities. The film texture can be triggered by phrases such as film grain texture and analog photography aesthetic;

3. Enhanced expressiveness in themes like science fiction, thriller, and animals, with mechas and other subjects having a more designed feel. Animals like the Pallas's cat, snow leopard, red panda, giant panda, tiger, and domestic cats and dogs are more lifelike;

4. Thanks to GPT tagging, prompt adherence and conceptual accuracy have been further improved.

However, the drawbacks of this version include:

1. As this is a substantial fine-tuning update, the error rate for limbs and such may slightly increase, a normal phenomenon when moving out of a comfort zone into new areas of relative optimization. Previous versions underwent extensive limb testing for improvements, while the new version had limited time for such enhancements. Nevertheless, the accuracy of limbs in this version is at least higher than in version 1.0, and I will continue to make improvements in future updates.

2. Due to the reinforced film texture, even though GPT tagging is as accurate as possible, there can be an unavoidable default warm tone in images. However, you can use prompts like studio light or sharp focus to produce high-definition studio-quality images, and with proper use of prompts, the output can have better skin tones and visual appeal than previous versions.

3. This version includes more full-body character images to enhance the full-body effect, so the model may produce wider scenes than before if no specific character composition is directed. Currently, the facial details in 1024 resolution full-body shots might be less sharp compared to half-body or close-up shots. However, this can be improved by using prompts like adetailer and a 1.5x Hires. fix at 0.3 intensity, or by specifying composition to avoid generating full-body images.

4. Since a small number of high-quality illustration datasets have been added, there is a chance that prompts related to animated styles will produce animated images. If this concerns you, please adjust your prompts accordingly.

These are the main updates for this version. Training the SDXL large model is challenging, and when the training set approaches ten thousand images, the cost for tagging and training for each model exceeds 300 USD. I welcome everyone to use the model and appreciate any feedback you can provide! If you find this model satisfactory, I would be immensely grateful if you could help spread the word about it.

API PlaygroundAPI Documentation

API Endpoint URL

Base URL for all API requests to this endpoint.

https://modelslab.com/api/v6/images/text2img

API Authentication

Authentication requires a valid API key included in the request. Generate and manage your API keys from your developer dashboard. Include the key in the key parameter for all API requests.

Integration Examples

Production-ready code samples for API integration

{
"prompt": "R3alisticF, hauntingly beautiful oriental necromancer, long flowing brown hair, bangs, darkly tanned skin, earrings, bone necklaces, dark eyeshadow, red lips, vibrant, front-laced transparent, filmy silk blouse, cleavage, holding skull, in a sandstone room lit by candles, High Detail, Perfect Composition, high contrast, silhouetted, chiascuro",
"model_id": "leosamshelloworldxl-helloworldxl50gpt4v",
"lora_model": [],
"width": "1024",
"height": "1024",
"negative_prompt": "(worst quality:2), (low quality:2), (normal quality:2), (jpeg artifacts), (blurry), (duplicate), (morbid), (mutilated), (out of frame), (extra limbs), (bad anatomy), (disfigured), (deformed), (cross-eye), (glitch), (oversaturated), (overexposed), (underexposed), (bad proportions), (bad hands), (bad feet), (cloned face), (long neck), (missing arms), (missing legs), (extra fingers), (fused fingers), (poorly drawn hands), (poorly drawn face), (mutation), (deformed eyes), watermark, text, logo, signature, grainy, tiling, censored, nsfw, ugly, blurry eyes, noisy image, bad lighting, unnatural skin, asymmetry",
"num_inference_steps": "31",
"scheduler": "DPMSolverMultistepScheduler",
"guidance_scale": "7.5",
"enhance_prompt": false,
"key": "YOUR_API_KEY"
}

SDKs

Official SDKs

Production-ready SDKs and client libraries for all major programming languages

API Parameters

Technical specifications for API request parameters.

Field NameParameterTechnical Description
promptpromptprompt help in image generation
Modelmodel_idEnter model_id that can help in image generation
lora Modellora_modelNo description available
Widthwidthwidth of the image
Heightheightheight of the image
Negative Promptnegative_promptNegative prompt help in avoid things that you do not want in image
Stepsnum_inference_stepsNumber of inference steps
SchedulerschedulerSampling scheduler
Guidance Scaleguidance_scaleHow closely to follow the prompt (1-10)
enhance_promptenhance_promptAutomatically enhance the prompt