Stable Diffusion Fooocus Guide Part 2: Image Generation Tutorial

Feb 24, 2026

—

in How To, AI Art & Image Generation, Stable Diffusion Tutorials

Welcome to Part 2 of our comprehensive Fooocus guide series. After installing Fooocus and getting everything set up in Part 1, you’re ready to dive into the actual image generation process.

What is Fooocus Image Generation?

Fooocus is a simplified, user-friendly interface for Stable Diffusion that makes AI image generation accessible without complex settings and technical knowledge.

Unlike Automatic1111 with its overwhelming array of sliders and parameters, Fooocus takes a different approach. The software intelligently handles technical settings in the background while you focus on what matters: your creative vision and prompts.

Having used Fooocus extensively for the past six months, I’ve found it consistently produces excellent results with minimal tweaking. The default settings work surprisingly well, which means you can generate professional-quality images within minutes of your first launch.

This guide focuses specifically on image generation workflows. We’ll cover prompt engineering, style presets, advanced settings, and practical techniques I’ve learned through generating thousands of images. If you haven’t installed Fooocus yet, check out our comparison of Stable Diffusion GUI tools to understand why Fooocus might be the perfect choice for your workflow.

💡 Key Takeaway: “Fooocus simplifies AI image generation by automatically handling technical parameters, letting you focus on creativity rather than settings configuration.”

Getting Started with the Fooocus Interface

Quick Summary: Fooocus interface has three main areas: prompt input at the top, style selection on the left, and generation settings on the right. Everything you need for basic image generation is visible on one screen.

When you first launch Fooocus, you’ll notice a clean, minimalist interface. This simplicity is intentional – the most commonly used features are front and center, while advanced options stay hidden until you need them.

The interface consists of three main sections. At the top, you’ll find the prompt input area where you describe what you want to create. The left sidebar contains style presets that dramatically influence your image’s appearance. The right side houses generation settings like image dimensions and quality options.

I spent the first week with Fooocus just experimenting with different combinations of prompts and styles. The learning curve is gentle compared to other Stable Diffusion interfaces. Most users I’ve guided through Fooocus generate their first satisfactory image within 10 minutes.

⚠️ Important: Fooocus requires a GPU with at least 4GB VRAM for SDXL models. If you encounter memory errors, check our guide on fixing Stable Diffusion VRAM errors for optimization tips.

How to Generate Your First Image in Fooocus?

Generating your first image in Fooocus takes just four simple steps. The entire process is designed to be intuitive, even if you’ve never used AI image generation software before.

Enter your prompt: Type a detailed description of what you want to create in the main text box at the top. Be specific about subjects, styles, and details.
Select a style: Choose from the preset styles on the left sidebar. “Fooocus V2” is the default and works well for most subjects.
Set image dimensions: Use the aspect ratio buttons on the right to select your output size. Square (1024×1024) is great for portraits.
Click generate: Press the purple “Generate” button and wait 10-30 seconds for your image to appear.

After running this simple workflow over a hundred times with various prompts, I’ve found that Fooocus consistently interprets natural language prompts better than most Stable Diffusion interfaces. You don’t need to format your prompts with special syntax or weight tokens.

Your first generation might not be perfect, and that’s completely normal. AI image generation is an iterative process. Each generation teaches you how the model responds to different prompt structures and style combinations.

First Generation Success Factors

Prompt Clarity
Critical

Style Selection
High Impact

Patience for Iteration
Essential

Prompt Engineering Fundamentals for Fooocus

Writing effective prompts is the most important skill in AI image generation. Through extensive testing, I’ve developed a prompt framework that consistently produces reliable results in Fooocus.

The best prompts follow a simple structure: subject + action + environment + style + quality keywords. This logical flow helps the AI understand exactly what you want to create.

Writing Effective Positive Prompts

Positive prompts describe what you want to see in your image. The more specific and detailed your description, the better the results tend to be. However, Fooocus handles natural language better than most interfaces, so you don’t need complex prompt syntax.

Here are some proven prompt templates I use regularly:

Style	Example Prompt
Portrait	“Young woman with flowing red hair, soft studio lighting, professional headshot style, sharp eyes, neutral background, high detail”
Landscape	“Majestic mountain range at golden hour, dramatic clouds, reflection in alpine lake, photorealistic, wide angle view, vibrant colors”
Fantasy	“Elegant elf princess in enchanted forest, glowing magical elements, ornate armor, ethereal atmosphere, cinematic lighting, fantasy art style”
Product	“Luxury perfume bottle on marble surface, soft backlighting, minimal composition, commercial product photography, crystal clear details”

After testing hundreds of prompts, I’ve found that adding specific lighting descriptions and quality keywords dramatically improves results. Words like “cinematic lighting,” “professional photography,” and “high detail” consistently enhance image quality.

Using Negative Prompts Effectively

Negative Prompts: These tell the AI what you don’t want in your image. Fooocus handles these in a dedicated negative prompt box, helping you avoid unwanted elements like blurry faces, extra limbs, or poor quality.

Negative prompts are equally important for quality control. They help eliminate common artifacts and unwanted elements that can appear in generated images.

Here’s a standard negative prompt template that works well for most generations:

“blurry, low quality, distorted, deformed, ugly, bad anatomy, extra limbs, missing fingers, watermarked, text, logo, cropped, out of frame, worst quality, low resolution”

I’ve noticed that Fooocus is generally better than other interfaces at avoiding common artifacts even without negative prompts. However, adding negative prompts gives you more control over the final output and significantly reduces the number of failed generations.

Advanced Prompt Techniques

Once you’re comfortable with basic prompts, you can explore more advanced techniques for finer control over your generations.

✅ Pro Tip: Use image generation as an iterative process. Generate multiple variations with slightly different prompts, then refine the best elements into a final prompt that combines what worked well.

Emphasis weighting works differently in Fooocus compared to Automatic1111. Fooocus automatically understands which parts of your prompt are most important based on sentence structure and positioning. Put your most critical descriptions at the beginning of the prompt for stronger emphasis.

Style keywords near the end of your prompt influence the overall aesthetic without overpowering the subject description. This separation of content and style is one of Fooocus’s strengths.

Using Style Presets in Fooocus

Style presets are one of Fooocus’s most powerful features. They encapsulate complex prompting techniques into single-click options that dramatically transform your image’s appearance.

The style presets in Fooocus are essentially pre-packaged prompt templates that have been carefully tuned. When you select a style, Fooocus automatically adjusts internal parameters and adds appropriate prompt modifiers to achieve that specific aesthetic.

Popular Style Presets Explained

Preset	Best For	Characteristics
Fooocus V2	General purpose	Balanced enhancement, natural look, versatile
Cinematic	Dramatic scenes	Movie-like lighting, depth of field, dramatic colors
Anime	Japanese art styles	Clean lines, vibrant colors, characteristic shading
Photorealistic	Real images	Camera-like quality, natural textures, realistic lighting
3D Render	CGI-style images	Smooth surfaces, studio lighting, digital art aesthetic
Comic Book	Illustrated styles	Bold lines, flat colors, graphic novel feel

After comparing style presets across hundreds of generations, I’ve found that combining styles can produce unique results. Fooocus allows you to select multiple styles, blending their characteristics together.

The default “Fooocus V2” style works remarkably well for most subjects. I recommend starting with it and only exploring other styles once you’re comfortable with basic generation. Each style has its own personality and strengths.

Creating Custom Style Combinations

One of Fooocus’s hidden strengths is the ability to combine multiple styles. Selecting two or three styles blends their characteristics, often creating unique aesthetics that aren’t possible with single-style selections.

I’ve had great success combining “Cinematic” with “Photorealistic” for portrait work, and “Anime” with “3D Render” for stylized character art. Experimentation is key – unexpected combinations often produce the most interesting results.

Advanced Settings Explained

While Fooocus is designed to work excellently with default settings, understanding the advanced options gives you more control over your generations. These settings fine-tune how the AI interprets your prompts and renders the final image.

What is CFG Scale?

CFG Scale (Classifier-Free Guidance Scale): Controls how closely the AI follows your prompt. Higher values mean the AI adheres more strictly to your description, while lower values allow more creative freedom and interpretation.

CFG scale typically ranges from 1 to 20 in Fooocus. The default setting of 7.0 works well for most prompts, but adjusting this value can significantly change your results.

Through my testing, I’ve found that CFG values between 5 and 9 produce the most reliable results. Values below 4 can cause the AI to ignore your prompt too much, while values above 12 often make images look over-processed and unnatural.

✅ Use Lower CFG (4-6)

For artistic freedom, abstract concepts, or when you want creative interpretation of your prompt.

❌ Use Higher CFG (8-12)

For specific subjects, technical accuracy, or when the AI consistently misses key prompt elements.

Sampling Steps

Sampling steps determine how many refinement passes the AI makes when generating your image. More steps generally mean better quality, but with diminishing returns and longer generation times.

The default setting in Fooocus is optimized for a good balance between quality and speed. In my experience, 20-30 steps are sufficient for most generations. Going beyond 40 steps rarely improves quality noticeably but significantly increases wait time.

For quick iterations and experiments, I often reduce steps to 15-20. When generating final images for projects, I might increase to 30-35 steps for that extra bit of refinement.

Image Dimensions and Aspect Ratios

Fooocus offers several preset aspect ratios, and understanding which to use for different subjects improves your results significantly.

Aspect Ratio	Dimensions	Best Use
Square	1024 x 1024	Portraits, character art, social media
Portrait	720 x 1152	Full-body shots, phone wallpapers
Landscape	1152 x 720	Scenery, wide shots, desktop backgrounds

Higher resolutions require more VRAM and generation time. If you’re working with limited hardware, check our guide on the best GPUs for Stable Diffusion SDXL to understand hardware requirements.

Seed Values for Reproducible Results

Every generation in Fooocus uses a random seed value that influences the output. Using the same seed with the same prompt produces identical results, which is useful for reproducibility and fine-tuning.

I often find a generation I like but want to tweak slightly. By copying the seed from a good generation and making small prompt adjustments, I can explore variations while maintaining the overall composition and feel.

Image-to-Image Workflow in Fooocus

Image-to-image (img2img) generation allows you to use an existing image as a starting point. This powerful feature is perfect for variations, style transfers, and refining sketches or concept art.

How Image-to-Image Works

When you upload an image to Fooocus, it uses that image as a base and generates variations based on your new prompt. The denoising strength setting controls how much the original image influences the result.

Quick Summary: Lower denoising values (0.3-0.5) keep the original image mostly unchanged. Higher values (0.7-0.9) allow more dramatic transformation based on your new prompt.

For subtle style changes, I use denoising around 0.3-0.4. The composition remains the same, but lighting, colors, and textures shift based on the style preset. For major transformations, I push denoising to 0.7-0.8, which allows the AI to substantially restructure the image while keeping some reference to the original.

Practical Image-to-Image Applications

Style transfer: Take a photograph and transform it into anime, oil painting, or 3D render style
Composition refinement: Keep the basic layout but change elements, lighting, or mood
Sketch completion: Upload a rough sketch and let Fooocus render it as a finished piece
Variation generation: Create multiple versions of an image you like with different styles

I’ve used img2img extensively for client work. Being able to take a rough concept sketch and quickly generate multiple polished variations dramatically speeds up the creative process and gives clients more options to choose from.

Tips for Better Fooocus Results

After generating thousands of images with Fooocus, I’ve compiled these practical tips that consistently improve output quality.

Start simple, then iterate: Begin with basic prompts and gradually add details. This helps you understand which elements influence the result.
Use specific subjects: “Young woman in red dress at café” works better than “person at restaurant.”
Describe lighting: Add “golden hour lighting,” “soft studio light,” or “dramatic side lighting” for better atmosphere.
Match style to subject: Use Photorealistic for product shots, Anime for character art, Cinematic for dramatic scenes.
Generate variations: Create 4-8 versions of your prompt, then refine the best elements into a final prompt.
Leverage negative prompts: Even basic negative prompts eliminate many common artifacts.
Save your seeds: When you get a great result, note the seed for future variations.
Experiment with style combinations: Unusual style blends often produce unique, interesting results.

✅ Pro Tip: Keep a prompt journal. Document your successful prompts, settings, and what worked well. This reference becomes invaluable as your library of techniques grows.

Frequently Asked Questions

How do I generate my first image in Fooocus?

Enter your prompt in the text box at the top, select a style preset from the left sidebar, choose your image dimensions on the right, and click the purple Generate button. Your image will appear in 10-30 seconds depending on your hardware.

What are the best prompts for Fooocus?

Effective prompts follow this structure: subject + action + environment + style + quality keywords. For example: “Young woman with flowing red hair, soft studio lighting, professional headshot style, sharp eyes, neutral background.” Be specific and descriptive for best results.

What is CFG scale in Fooocus?

CFG scale controls how closely the AI follows your prompt. The default of 7.0 works well for most cases. Use lower values (4-6) for more creative freedom, or higher values (8-12) when you need the AI to follow specific prompt elements more strictly.

What are negative prompts in Fooocus?

Negative prompts tell the AI what you don’t want in your image. A standard negative prompt includes: blurry, low quality, distorted, deformed, ugly, bad anatomy, extra limbs, missing fingers, watermarked, text, logo, cropped, out of frame.

How many sampling steps should I use in Fooocus?

20-30 steps are sufficient for most generations in Fooocus. Lower steps (15-20) work well for quick iterations, while 30-35 steps provide extra refinement for final images. Going beyond 40 steps rarely improves quality noticeably.

How does image-to-image work in Fooocus?

Upload a base image, enter your new prompt, and adjust the denoising strength. Lower values (0.3-0.5) keep the original mostly unchanged, while higher values (0.7-0.9) allow dramatic transformation. This is perfect for style transfer and creating variations.

What image resolution should I use in Fooocus?

Use square (1024×1024) for portraits and social media, portrait (720×1152) for full-body shots and phone wallpapers, and landscape (1152×720) for scenery and desktop backgrounds. Higher resolutions require more VRAM and generation time.

How to get better image quality in Fooocus?

Use specific subject descriptions, include lighting details, match style presets to your subject, leverage negative prompts, generate multiple variations, and iterate on successful prompts. The default settings work well – focus on prompt quality over technical adjustments.

Final Thoughts

Fooocus represents the most accessible entry point into Stable Diffusion image generation. The combination of intelligent defaults, clean interface, and powerful style presets makes it possible for anyone to create impressive AI art.

Through months of testing and thousands of generations, I’ve found that Fooocus strikes an excellent balance between simplicity and power. Beginners can get great results immediately, while advanced users have enough control to refine their outputs precisely.

If you’re looking to expand beyond Fooocus, our Automatic1111 beginner’s guide covers more advanced Stable Diffusion workflows. For those interested in creating custom content, check out our guide on training Stable Diffusion LoRA models.

In Part 3 of this series, we’ll explore advanced Fooocus workflows including custom models, LoRA integration, and professional techniques for commercial applications. Until then, keep experimenting and building your prompt library.

Droid4x – Free Android Emulator for Windows & Mac