Gemini 2.0 Flash’s New Multimodal Image Gen Is My New Image Editor

Google has released an impressive update with Gemini 2.0 Flash Experimental . you could not only bring forth look-alike now but also blue-pencil them consistently without losing modification using simple text - based prompts .

There are mint of AI epitome editors out there with the like ofDall - E 3 and Imagen 3fighting for your meter and money . While they are unspoilt at generating picture , editing with them was sadly out of reach . These AI mannikin were trained to bring forth images only . alternatively of make changes , they usually ended up creating unexampled single from lettuce .

Gemini is currently the only multimodal AI chatbot that can do by both text and paradigm natively . Meaning , that when you ask Gemini to cut a sire picture , it does so natively instead of routing the request to a specialised image diffusion model like Imagen 3 .

Gemini ’s multimodal capabilities aid it understand both schoolbook and image natively help it accomplish some telling feats . allow ’s break it down with some instance .

What’s New With Gemini 2.0 Flash Native Image Generation and Editing

Until now , when you expect an AI example to edit an image , alternatively of edit the generated image , it would regenerate a new image entirely creating two distinct image rather .

For example , here ’s ChatGPT ’s reply when I ask it to alter the gondola ’s people of color from fatal to red . or else of changing the color , it generates a new red railway car instead with new road , different setting , and even different car role model .

Now when I take Gemini to change the gondola ’s color from black to red , it keeps the picture consistent and only makes the required modification . It only changes the color but keeps the gondola mannequin , route , and background knowledge , all reproducible .

Gemini uses native multimodal capability to keep images reproducible even when generate whole step - by - step instructions . For example , when you ask for a alimentary paste formula , Gemini will generate images for each cooking step , keeping the details like the bowl or pan out consistent . you could even download these picture for personal use .

This is still a beta feature and is currently not uncommitted directly inside Gemini . However , everyone can access it for loose inside the AI Studio app , Google ’s AI beta examination app . Just hop ontoGoogle ’s AI Studio website , select the Gemini 2.0 Flash observational example , and test it .

Examples of Gemini 2.0 Flash Image Generation

We tested the feature of speech in several different ways and every time , it came out on top delivering reproducible issue .

First , I asked the model to return an look-alike of vanilla ice ointment . afterwards , I ask it to add cocoa syrup , and it did precisely that without vary anything — even the scoop was exactly the same as in the first image .

likewise , I asked Gemini to change the television camera slant and it did that perfectly . For example , I first generated an epitome of a classic cerise auto . When I asked for a dissimilar camera angle , it generated an trope with the front scene or else of the side survey .

As I enquire Gemini to add up more edits , the model made changes like adding / removing items , changing placements , adjusting camera angles , and more as requested .

Not just for generated images , you may also upload your own picture and then edit them . In the example below , I asked the model to commute the image into a sunset with vivid colors , and it did that utterly .

Want to make your pitch-dark - and - white picture coloured ? you’re able to ask Gemini to do it .

you may also try and upload an art expressive style and ask it to generate something in that exceptional mode , and the model can double it exactly .

Since Gemini is proficient with both text edition and image , you’re able to now require it to bring text to images . in the first place , Gemini , like most AI models , struggled with sum up and cut text inside an picture .

Here ’s Gemini generating a Happy Birthday card with a bunch of text exactly as call for .

As mentioned , Gemini use its multimodal capability to generate consistent images in various way . For example , here ’s an intact taradiddle produce by Gemini , yield images for each step of the storey . Notice how the character are coherent .

you could also request recipes with images for each step , and the modeling will observe consistency throughout .

However , the good example is not whole perfect . If you observe , when make a formula , the model first baked the cookies and then commit them on a tray . While this does n’t ordinarily happen , we observed some casual issues during our examination . Additionally , one time when I require to alter the colouring of the car , it changed the entire railcar rather than just the color . However , when I tried again , it right changed just the car ’s people of colour .

Gemini 2.0 Flash’s New Multimodal Image Gen Is My New Image Editor

What’s New With Gemini 2.0 Flash Native Image Generation and Editing

Examples of Gemini 2.0 Flash Image Generation

All Secret And Hidden Commands for Alexa (May 2025)

How to Turn ChatGPT AI Action Figure Into a Video…

7 AI Tools That Research for You: Stop Drowning in…

How to Create ChatGPT Action Figures and Turn Yourself Into…

4 Ways to Use ChatGPT to Generate YouTube Thumbnails –…

ChatGPT Not Generating Ghibli Images? Here’s What to Do

We Put ChatGPT and Gemini’s Image Generators Head-to-Head – Here’s…

How to Turn Ghibli Images Into Videos Using AI for…

We Tested Tens of AI YouTube Thumbnail Makers to Discover…

I Created My Own Apps Using AI — Here’s What…

What’s New With Gemini 2.0 Flash Native Image Generation and Editing#

Examples of Gemini 2.0 Flash Image Generation#

All Secret And Hidden Commands for Alexa (May 2025)#

How to Turn ChatGPT AI Action Figure Into a Video…#

7 AI Tools That Research for You: Stop Drowning in…#

How to Create ChatGPT Action Figures and Turn Yourself Into…#

4 Ways to Use ChatGPT to Generate YouTube Thumbnails –…#

ChatGPT Not Generating Ghibli Images? Here’s What to Do#

We Put ChatGPT and Gemini’s Image Generators Head-to-Head – Here’s…#

How to Turn Ghibli Images Into Videos Using AI for…#

We Tested Tens of AI YouTube Thumbnail Makers to Discover…#

I Created My Own Apps Using AI — Here’s What…#

What’s New With Gemini 2.0 Flash Native Image Generation and Editing

Examples of Gemini 2.0 Flash Image Generation

All Secret And Hidden Commands for Alexa (May 2025)

How to Turn ChatGPT AI Action Figure Into a Video…

7 AI Tools That Research for You: Stop Drowning in…

How to Create ChatGPT Action Figures and Turn Yourself Into…

4 Ways to Use ChatGPT to Generate YouTube Thumbnails –…

ChatGPT Not Generating Ghibli Images? Here’s What to Do

We Put ChatGPT and Gemini’s Image Generators Head-to-Head – Here’s…

How to Turn Ghibli Images Into Videos Using AI for…

We Tested Tens of AI YouTube Thumbnail Makers to Discover…

I Created My Own Apps Using AI — Here’s What…