Meet Nano Banana (aka Gemini 2.5 Flash Image): our latest, fastest, and most efficient model. Its native multimodal architecture processes text and images in a single step, unlocking powerful capabilities like conversational editing, multi-image composition, and logical reasoning.
. Here are the key things you can do:
Text-to-image: Generate high-quality images from simple or complex text descriptions.
Image + text-to-image (editing): Provide an image and use text prompts to add, remove, or modify elements, change the style, or adjust colors.
Multi-image to image (composition & style transfer): Use multiple input images to compose a new scene or transfer the style from one image to another.
Iterative refinement: Have a conversation to progressively refine your image over multiple turns, making small adjustments until it's perfect.
Text rendering: Generate images that contain clear and well-placed text, ideal for logos, diagrams, and posters.
This guide will teach you how to write prompts and provide instructions that get the best results from Gemini 2.5 Flash. It all starts with one fundamental principle:
Describe the scene, don't just list keywords. The model's core strength is its deep language understanding. A narrative, descriptive paragraph will almost always produce a better, more coherent image than a simple list of disconnected words.
The most common way to generate an image is by describing what you want to see.
For realistic images, think like a photographer. Mentioning camera angles, lens types, lighting, and fine details will guide the model toward a photorealistic result.
Template: A photorealistic [shot type] of [subject], [action or expression], set in [environment]. The scene is illuminated by [lighting description], creating a [mood] atmosphere. Captured with a [camera/lens details], emphasizing [key textures and details]. The image should be in a [aspect ratio] format.
Example Prompt: A photorealistic close-up portrait of an elderly Japanese ceramicist with deep, sun-etched wrinkles and a warm, knowing smile. He is carefully inspecting a freshly glazed tea bowl. The setting is his rustic, sun-drenched workshop. The scene is illuminated by soft, golden hour light streaming through a window, highlighting the fine texture of the clay. Captured with an 85mm portrait lens, resulting in a soft, blurred background (bokeh). The overall mood is serene and masterful. Vertical portrait orientation.
A photorealistic close-up portrait of an elderly Japanese ceramicist...
To create stickers, icons, or assets for your projects, be explicit about the style and remember to request a white background if you need one.
Template: A [style] sticker of a [subject], featuring [key characteristics] and a [color palette]. The design should have [line style] and [shading style]. The background must be white.
Example Prompt: A kawaii-style sticker of a happy red panda wearing a tiny bamboo hat. It's munching on a green bamboo leaf. The design features bold, clean outlines, simple cel-shading, and a vibrant color palette. The background must be white.
A kawaii-style sticker of a happy red panda...
Gemini excels at rendering text. Be clear about the text, the font style (descriptively), and the overall design.
Template: Create a [image type] for [brand/concept] with the text "[text to render]" in a [font style]. The design should be [style description], with a [color scheme].
Prompt: Create a modern, minimalist logo for a coffee shop called 'The Daily Grind'. The text should be in a clean, bold, sans-serif font. The design should feature a simple, stylized icon of a coffee bean seamlessly integrated with the text. The color scheme is black and white.
Create a modern, minimalist logo for a coffee shop called 'The Daily Grind'...
Create clean, professional product shots for e-commerce, advertising, or branding.
Template: A high-resolution, studio-lit product photograph of a [product description] on a [background surface/description]. The lighting is a [lighting setup, e.g., three-point softbox setup] to [lighting purpose]. The camera angle is a [angle type] to showcase [specific feature]. Ultra-realistic, with sharp focus on [key detail]. [Aspect ratio].
Example Prompt: A high-resolution, studio-lit product photograph of a minimalist ceramic coffee mug in matte black, presented on a polished concrete surface. The lighting is a three-point softbox setup designed to create soft, diffused highlights and eliminate harsh shadows. The camera angle is a slightly elevated 45-degree shot to showcase its clean lines. Ultra-realistic, with sharp focus on the steam rising from the coffee. Square image.
A high-resolution, studio-lit product photograph of a minimalist ceramic coffee mug...
Excellent for creating backgrounds for websites, presentations, or marketing materials where text will be overlaid.
Template: A minimalist composition featuring a single [subject] positioned in the [bottom-right/top-left/etc.] of the frame. The background is a vast, empty [color] canvas, creating significant negative space. Soft, subtle lighting. [Aspect ratio].
Example Prompt: A minimalist composition featuring a single, delicate red maple leaf positioned in the bottom-right of the frame. The background is a vast, empty off-white canvas, creating significant negative space for text. Soft, diffused lighting from the top left. Square image.
A minimalist composition featuring a single, delicate red maple leaf...
Create compelling visual narratives, panel by panel, ideal for developing storyboards, comic strips, or any form of sequential art by focusing on clear scene descriptions.
Template: A single comic book panel in a [art style] style. In the foreground, [character description and action]. In the background, [setting details]. The panel has a [dialogue/caption box] with the text "[Text]". The lighting creates a [mood] mood. [Aspect ratio].
Example Prompt: A single comic book panel in a gritty, noir art style with high-contrast black and white inks. In the foreground, a detective in a trench coat stands under a flickering streetlamp, rain soaking his shoulders. In the background, the neon sign of a desolate bar reflects in a puddle. A caption box at the top reads "The city was a tough place to keep secrets." The lighting is harsh, creating a dramatic, somber mood. Landscape.