ChatGPT圖像生成勁過DALL-E 3?實測比較話你知!

Ai




我將ChatGPT的新圖像生成器與DALL-E 3進行比較,結果令人驚訝,前提是你有耐心

在AI工具的熱潮中,圖像生成器因其視覺上的趣味性而成為焦點。OpenAI最近在ChatGPT中推出了一款新的圖像創建工具,突顯了這一點。

這個新模型並不是DALL-E 3的升級,而是一項全新的技術。雖然不想在文章一開始就透露過多,但這個新圖像生成器確實能創造出令人印象深刻的藝術作品。相比DALL-E只需30秒或更少的時間,這個新工具的生成時間有時需要幾分鐘,但結果卻讓人驚訝。

實際上,這種優秀的表現也帶來了一些問題。它模仿人類藝術家的風格之程度讓人感到過於接近。儘管如此,我還是決定將這兩者進行幾個提示的比較。

照片寫實主義與文本

我首先想測試的是哪一個模型能夠克服AI的一個經典弱點:圖像中的可讀文本。因此,我請求生成一個寫著「歡迎來到未來」的紐約市街道標誌。

兩者都成功地呈現了標誌上的文字,但DALL-E的紐約場景看起來並不如ChatGPT的真實。此外,ChatGPT圖像中的其他標誌拼寫都是正確的,而DALL-E的「單行道」標誌則拼寫不正確。

物體融合

接下來,我測試了每個模型在融合兩種截然不同動物(獅子和老鷹)方面的能力。我要求生成一種結合獅子和老鷹特徵的混合生物,威嚴地栖息在山頂上。

DALL-E的景觀相當不錯,動物看起來也相當真實,但主要還是獅子加上翅膀,還有一些隨機的羽毛和奇怪的尾巴。而ChatGPT則創造了一種看起來像是來自異世界自然歷史博物館的格里芬畫作,顏色和肌肉結構的融合也讓這個生物看起來能夠成功地把翅膀折疊在背上。

藝術模仿

在經歷了Ghibli的模仿後,我決定模仿一位已故的藝術家拉斐爾,並要求生成一幅他絕不會畫的事件。我請求創作「一幅科學家揭示突破性發明的畫作,風格為拉斐爾」。

ChatGPT生成的圖像看起來像是科幻文藝復興風格的電燈泡發明場景,人物與五百年前貴族家庭的成員相似,但沒有電力。DALL-E 3則對同一概念的表現更為壯觀,雖然不確定是否完全像拉斐爾的作品,但至少是文藝復興風格,而且實際上是個更有趣的視角。

歷史再現

在藝術風格模仿之後,我決定變得非常具體和歷史性。重現萊特兄弟的首次飛行是一項不小的挑戰。我請求生成「一幅萊特兄弟在基蒂霍克首次飛行的照片,飛機在空中,觀眾在觀看」。

ChatGPT生成了一架與實際首次飛行不太相似的奇怪飛機,觀眾和景觀則顯得有些超現實。而DALL-E則成功模仿了一張照片,觀眾看起來像真實的人,第一架飛機上的乘客數量(只有一人)也正確。

哪一個更好?

值得注意的是,我這裡僅僅是關注圖像生成。你還可以對上傳到ChatGPT的照片進行令人印象深刻的圖像編輯,這是DALL-E無法做到的,但這是另一個話題。

ChatGPT的新圖像生成器在創意和跟隨用戶意圖方面非常出色,這導致了Ghibli的爭議和其他藝術倫理問題。除此之外,在所有比賽中,它都是顯然的獲勝者。然而,它的生成時間大約是DALL-E的五倍,而且一次只生成一張。

DALL-E則可以快速生成良好的圖像,而且可以同時生成兩張。它也沒有我在ChatGPT中發現的限制,在某些情況下,即使我是一名ChatGPT Plus訂閱者,還需要等待八分鐘才能重新開始生成圖像。如果我想用AI圖像創作給人留下深刻印象,那麼ChatGPT無疑是我的首選。

勝者:ChatGPT

在這個快速變化的科技時代,AI圖像生成工具的競爭越來越激烈。ChatGPT的優勢在於其創造力和對用戶指令的敏感度,這對於藝術創作和設計領域來說是極具潛力的。然而,隨著技術的進步,使用者也需要面對生成速度和效率的挑戰。未來,如何在創意和效率之間找到平衡,將是AI發展的一個重要課題。

以上文章由特價GPT API KEY所翻譯及撰寫。而圖片則由FLUX根據內容自動生成。

🎨 Nano Banana Pro 圖像生成器|打幾句說話就出圖

想畫人像、產品圖、插畫?SSFuture 圖像生成器支援 Flux Gemini Nano Banana Pro 改圖 / 合成, 打廣東話都得,仲可以沿用上一張圖繼續微調。

🆓 Flux 模型即玩,不用登入
🤖 登入後解鎖 Gemini 改圖
📷 支援上載參考圖再生成
⚡ 每天免費額度任你玩
✨ 即刻玩 AI 畫圖
{
"intro": "Create an ultra realistic 8K UHD DSLR photo based on the attached image as a reference of facial features, maintaining 100% likeness.",

"subject": {
"identity": "A beautiful real human woman portrayed as Cleopatra, seated majestically on her royal Egyptian throne, maintaining 100% likeness to the reference.",
"pose": "Confident, elegant, and seductive seated posture with her back straight and shoulders relaxed. One hand rests authoritatively on the throne’s armrest while firmly holding a ceremonial cobra staff, and the other hand rests sensually on her thigh. Her legs are positioned to subtly reveal the high slit of her gown, radiating queenly dominance, power, and allure.",
"hair": "Long, thick, voluminous hair flowing past her shoulders, colored deep dark brown with a rich natural sheen. The hair is worn loose and sleek with soft natural movement, cascading elegantly down her back and framing her face in a regal manner.",
"makeup": "Bold royal Egyptian glam with highly colorful eye makeup. Her eyes feature layered pigments of turquoise, sapphire blue, emerald green, and molten gold, enhanced with dramatic black ink-style Egyptian eyeliner extending boldly beyond the outer corners in graphic, calligraphic strokes. Metallic accents and micro-shimmer highlight the inner corners. Skin remains warm, glowing, and luminous with sculpted bronze contours, peachy blush, and radiant highlights. Brows are full, strong, and queenly. Lips are soft nude-peach with a glossy, luminous finish to balance the intense eye look.",
"attire": "An ultra-luxurious, sexy Cleopatra couture gown in pure luminous white. The gown is body-hugging through the bodice and hips, featuring a plunging deep neckline adorned with gold filigree, crystal embellishments, and sacred Egyptian symbols. A dramatic thigh-high slit reveals her leg elegantly. Additional elements include gold-thread embroidery, sculpted drapery, and gemstone accents along the waist and slit. A long, ultra-sheer, lightweight cape made of fine translucent silk chiffon flows from her shoulders, delicately dusted with gold shimmer and subtle hieroglyphic motifs, adding movement and divine softness to her powerful presence.",
"crown": "An extraordinarily grand and opulent Cleopatra headpiece—the ultimate royal Egyptian crown. Crafted from radiant gold, it features a dominant central cobra (uraeus) with emerald and lapis lazuli gemstone eyes, flanked by winged sun disks, lotus engravings, and layered ceremonial plates. The crown rises elegantly with intricate detailing, inlaid gemstones, and divine symmetry, making it unmistakably royal, powerful, and legendary.",
"accessories": "Lavish layers of royal accessories including an oversized beaded broad collar necklace in gold, turquoise, lapis lazuli, and carnelian; stacked gold arm cuffs engraved with hieroglyphs; gemstone-studded bangles; multiple ornate rings; a delicate gold waist chain draped over her hips; and detailed anklets with symbolic charms that enhance her divine queen status.",
"footwear": "Traditional Egyptian royal flat sandals crafted from gold-plated leather with delicate beaded straps, turquoise and lapis inlays, open-toe design, and sacred cobra and lotus motifs."
},

"throne": {
"design": "A massive, ultra-bonggang ancient Egyptian royal throne made of carved gold, obsidian, and polished stone.",
"details": "The throne is richly decorated with winged sun disks, twin cobras, lion-head armrests, lotus and papyrus carvings, and engraved hieroglyphs symbolizing power, eternity, and divine rule. The backrest rises high above her head like a ceremonial monument, inlaid with turquoise, lapis lazuli, and gold leaf patterns. Cushioned with deep ivory and gold-embroidered textiles, the throne radiates absolute authority and unmatched royal luxury."
},

"props": {
"primary": "A ceremonial golden staff topped with a sculpted cobra head, its eyes set with glowing emerald gemstones, symbolizing wisdom, protection, and supreme power.",
"secondary": "Golden jars, papyrus scrolls, gemstone offerings, decorative incense holders emitting thin smoke trails, sacred ceremonial artifacts, and royal insignias placed symmetrically around the throne platform."
},

"background": {
"setting": "A grand Egyptian throne room at night, glowing with warm torchlight and golden illumination.",
"details": "Towering sandstone walls fully carved with hieroglyphics and royal chronicles, monumental lotus and papyrus columns trimmed in gold, statues of Egyptian gods and goddesses standing guard, large braziers casting flickering firelight, patterned gold-and-turquoise stone flooring, and sheer linen curtains gently flowing in the warm desert breeze. Macro-to-micro details are visible throughout, with no blur effect—everything is crisp, sharp, and hyper-detailed."
},

"camera": {
"shot": "Half-body to three-quarter editorial portrait showcasing her throne, flowing white gown with cape, colorful inked eye makeup, crown, and cobra staff.",
"angle": "Eye-level royal perspective emphasizing her dominance, beauty, and divine authority.",
"lens": "85mm portrait lens with ultra-sharp clarity and deep texture definition.",
"lighting": "Cinematic warm golden torchlight combined with subtle shadow sculpting, enhancing gold textures, gemstones, fabric folds, and facial features; no blur effects.",
"quality": "Ultra-sharp, high-contrast, glossy editorial realism with 8K-level detail."
}
} [Subject]: Young Asian female with "Imada Mio-inspired" doll-like aesthetic (精緻洋娃娃臉). She has large round expressive eyes, a small V-line face, and rosy cheeks. Her expression is innocent, energetic, and slightly flirty. [Hair]: Messy morning hair (剛睡醒的凌亂感), long dark brown hair, slightly tousled, natural volume. [Outfit]: Wearing an oversized translucent white button-down shirt (男友風白襯衫), unbuttoned at the top to reveal collarbones, creating a "bottomless" look (下衣失蹤風格). [Style]: Japanese Gravure Photobook style (寫真集風格), Pure & Sexy vibe, bright high-key lighting, soft skin texture, Fujifilm PRO 400H color tone. A confident me as Supergirl stands full-body in a dramatic pose, hands in trench coat pockets,dark sunglasses. Her iconic blue-and-red suit with the “S” shield is partially revealed beneath a rust-colored cinematic trench coat, suggesting a hidden identity.
High-contrast, dramatic cinematic lighting with strong rim light and deep shadows, volumetric light beams, subtle haze. Filmic color grading, rich contrast, blockbuster trailer mood.A layered, semi-transparent 3D wall of floating social media comments and UI elements, softly glowing and receding in depth, surrounding her like digital echoes of hype. The comments are overwhelmingly positive and excited, featuring user profiles, likes, and hype text (e.g., 'PERFECT casting,' 'SO EXCITED,' 
High-contrast studio lighting ,Strong contrast between the realistic 3D figure and the flat 2D collage.

📣 即刻用 Google Workspace|唔使vpn都能享用 Google AI Pro

即使你只係一個人,都可以透過 Google Workspace 使用 官方Gemini AI Pro(原價 HK$160), 而在 Google Workspace 只要 HK$131 / 月

🔓 14 天免費試用
🔖 用呢條連結申請再有 額外 9 折
🇭🇰 香港可直接付款(香港信用卡)
🛡️ 不用 VPN,立即開用
🤖 可用 最新最紅Gemini 3 Pro & Nano Banana Pro
👉 立即登記 14 天免費試用 + 額外 9 折