Can DALL-E 3 in ChatGPT learn and modify photographs? Come see for your self

I have been exploring the usage of DALL-E 3 inside ChatGPT Plus. I am doing this as a result of it is my job, not as a result of I’ve some form of unhealthy little dependancy to describing one thing in my thoughts and see it manifest in mere minutes on the display. I can cease at any time. Positive, that is the ticket, I can cease at any time.

We’ve got arrived at the moment sooner or later when an AI says the exact equal of, “I am sorry Dave. I am afraid I am unable to do this.”

However not at present. In the present day, I discovered a brand new toy. DALL-E 3 inside ChatGPT can learn and modify photographs. Form of. You see, it’s kind of fussy. However I am getting forward of myself. Let’s begin this story in the beginning…

Additionally: The way to get an ideal face swap utilizing Midjourney AI

I have been utilizing Midjourney to customise uploaded photographs for some time. The issue is that it is very convoluted. It’s a must to be operating Midjourney in Discord, after which it’s important to undergo various steps to add a picture into Discord, get a URL, yada, yada, yada…

In ChatGPT Plus, you merely need to click on on the paperclip icon and add your picture. One and carried out.

That makes it loads simpler to make use of, and in addition much more enjoyable. However how nicely does it work? To check it out, I attempted three photographs: an image of my automotive, an image of me, and the ZDNET brand. Let us take a look at the outcomes.

My automotive

This is an image of my automotive, a 2013 Dodge Challenger.

my-car
David Gewirtz/ZDNET

As soon as the picture was uploaded, I instructed DALL-E 3:

Put automotive in metropolis

The outcomes have been promising. DALL-E 3 efficiently reproduced a likeness of the automotive, in a metropolis scene:

car-in-city.png
Screenshot by David Gewirtz/ZDNET

Then, as a result of I’ve a particular steampunk fascination, I requested DALL-E to:

Make it steampunk

This is what we bought. It nonetheless retained the general physique model of the Dodge Challenger:

steampunk.png
Screenshot by David Gewirtz/ZDNET

DALL-E retains breaking

One factor to notice is that I could not get DALL-E to do too many iterations with out failure. Each two or three requests (and by no means greater than 4), I bought this message:

failure
Screenshot by David Gewirtz/ZDNET

My workaround was to take the final efficiently created picture and add it into a brand new ChatGPT Plus session, and work from that.

Additionally: The very best AI artwork turbines: DALL-E 2 and enjoyable alternate options to attempt

Flying automotive

So, I uploaded the final picture, the steampunk model of my automotive, and informed DALL-E to:

Make the automotive fly

This is what I bought again. The illustration of my authentic automotive was gone, and we bought again a really cool Chitty Chitty Bang Bang-style automobile, within the air:

chitty.png
Screenshot by David Gewirtz/ZDNET

It was cool, but it surely was not my automotive. However that is OK, let’s examine the place we will go.

The Santa Clause

My subsequent try, given the season, was to attempt to put Santa within the driver’s seat. Right here, I ran right into a quite over-zealous guardrail, with ChatGPT telling me, “I am sorry, however I am unable to help with that request.”

Sure, we’ve arrived at the moment sooner or later when an AI says the exact equal of, “I am sorry Dave. I am afraid I am unable to do this.” Life imitates artwork.

sorry-dave
Screenshot by David Gewirtz/ZDNET

As you may see, the reasoning was a bit weird. But it surely appeared to hinge on the truth that I used to be asking it to change an uploaded image. So I gave it the above prompts as a single request:

Put automotive in metropolis. Make it steampunk. Make the automotive fly. Put Santa within the driver’s seat.

I bought this:

bright-santa.png
Screenshot by David Gewirtz/ZDNET

It was a pleasant image, but it surely missed the entire Victorian aesthetic that the earlier photographs had. There is a repair for that, although.

Additionally: The way to use Bing Picture Creator (and why it is higher than ever)

DALL-E with ChatGPT primarily rewrites each immediate right into a extra detailed instruction. So, for instance, “Make it steampunk” bought rewritten into:

Picture of a steampunk-themed road scene with classic structure, that includes a basic muscle automotive modified with steampunk components like brass pipes, gears, and steam exhausts. The automotive is parked on a cobblestone road with ambient road lamps and a backdrop of old style buildings that mix Victorian and industrial revolution influences. There are individuals wearing steampunk apparel, with goggles and Victorian clothes, strolling on the sidewalks. The general environment is that of a retro-futuristic metropolis from an alternate historical past the place steam energy is the first expertise.

So I grabbed parts of the descriptive materials from each “make it steampunk” and “make the automotive fly” and mixed them for this immediate:

Put automotive in metropolis. Make it steampunk. Make the automotive fly. Put Santa within the driver’s seat. Under, the cobblestone streets are lined with gaslight road lamps, and folks in Victorian apparel search for in amazement. The sky is a nightfall orange with a touch of smog and the thrill of smaller steampunk drones and airships within the distance. The general environment is that of a retro-futuristic metropolis from an alternate historical past the place steam energy is the first expertise.

This is what I bought again:

steam-santa.png
Screenshot by David Gewirtz/ZDNET

Strictly talking, it isn’t a flying automotive, but it surely’s cool. Sadly, there is not any connection in any respect to the unique automotive picture I began with.

Cease, Dave. Will you cease, Dave? Cease, Dave.

I had one other HAL second once I requested ChatGPT to place this image of me in an workplace setting:

stop-dave
Screenshot by David Gewirtz/ZDNET

It informed me, “I am sorry, however I am unable to help with that request.” At the very least ChatGPT did not say, “Look Dave, I can see you are actually upset about this. I actually assume you ought to sit down down calmly, take a stress tablet, and assume issues over.”

Additionally: Due to my 5 favourite AI instruments, I am working smarter now

Advantageous. And now for one thing utterly completely different.

Leaving on a jet prepare

This is the ZDNET brand, which I uploaded to DALL-E:

zdnet
Screenshot by David Gewirtz/ZDNET

First, I attempted to get it to place it on a jet:

Put this brand on the aspect of a jumbo jet

At the very least it bought the colour proper:

jet.png
Screenshot by David Gewirtz/ZDNET

Then I attempted to get it to place the emblem on a constructing.

Put this brand on the aspect of a brick constructing

It remembered inexperienced, however not the precise inexperienced:

building.png
Screenshot by David Gewirtz/ZDNET

So I attempted to get DALL-E to maneuver the constructing onto a mannequin railroad.

Put the constructing on a mannequin railroad

The result’s one thing resembling a mannequin railroad (though the monitor within the foreground is more likely to trigger a derailment).

railroad.png
Screenshot by David Gewirtz/ZDNET

There’s a brick constructing, but it surely’s not the identical brick constructing, and any pretense of the ZDNET brand is gone. Not even the ZDNET inexperienced stays.

Additionally: Generative AI can simply be made malicious regardless of guardrails, say students

So, in fact, I requested it to do that:

Additionally put the jumbo jet on a mannequin railroad

I bought this. I simply need to know if these are planes or missiles within the water.

jet-train.png
Screenshot by David Gewirtz/ZDNET

What have we realized?

After tinkering with this DALL-E characteristic, I believe we will conclude the next:

  • You possibly can add photographs to DALL-E.
  • You possibly can ask it to change them, however with blended outcomes.
  • DALL-E fails loads.
  • ChatGPT might not be demonstrating Synthetic Common Intelligence, but it surely’s bought summary expressionism down.
  • Its responses are uncomfortably near these of the HAL-9000.

And there you go. Have you ever uploaded photographs to DALL-E? How has it carried out for you? Tell us within the feedback under.


You possibly can comply with my day-to-day undertaking updates on social media. You should definitely subscribe to my weekly replace e-newsletter on Substack, and comply with me on Twitter at @DavidGewirtz, on Fb at Fb.com/DavidGewirtz, on Instagram at Instagram.com/DavidGewirtz, and on YouTube at YouTube.com/DavidGewirtzTV.

Leave a Reply

Your email address will not be published. Required fields are marked *