AI 如何辨識物件中的「面孔」?

Ai




人工智能中的幻覺:機器能否在無生命物體中發現面孔?

麻省理工學院計算機科學與人工智能實驗室(CSAIL)的一項新研究深入探討了幻覺現象,並引入了一個涵蓋5,000個幻覺圖像的人類標記數據集,遠遠超過以往的收集。研究團隊利用這個數據集發現了人類與機器感知之間的若干驚人差異,以及在麵包片上看到面孔的能力如何可能曾經拯救了我們遠古祖先的生命。

研究揭示了什麼?

研究發現,人工智能模型似乎無法像人類一樣識別幻覺面孔。出乎意料的是,團隊發現只有當訓練算法去識別動物面孔時,它們在檢測幻覺面孔方面才有顯著提升。這一意外的聯繫暗示了我們識別動物面孔的能力——對生存至關重要——與我們在無生命物體中看到面孔的傾向之間可能存在進化上的聯繫。

幻覺的「金髮區」

另一個有趣的發現是研究人員所謂的幻覺「金髮區」——即幻覺最有可能發生的一類圖像。「有一個特定的視覺複雜度範圍,在這個範圍內,人類和機器最有可能在非面孔物體中察覺到面孔,」麻省理工學院電氣工程和計算機科學教授William T. Freeman說。「過於簡單,無法形成面孔;過於複雜,則成為視覺噪音。」

為了揭示這一點,研究團隊開發了一個模型,用來模擬人類和算法如何檢測幻覺面孔。分析該模型時,他們發現了一個清晰的「幻覺峰值」,即看到面孔的可能性最高,對應於圖像的複雜程度恰到好處的那些。這個預測的「金髮區」在對真實人類受試者和AI面孔檢測系統的測試中得到了驗證。

數據集的應用

這個新的「物中面孔」數據集遠遠超過了以往研究中通常只使用20-30個刺激物的規模。這一規模讓研究人員能夠探討先進的面孔檢測算法在對幻覺面孔進行微調後的行為,顯示出這些算法不僅可以被編輯以檢測這些面孔,還可以作為我們大腦的硅基替代品,讓團隊能夠提出和回答一些無法在人類中提問的問題。

這項研究還可能應用於改善面孔檢測系統,減少誤報,這可能對自動駕駛汽車、人機交互和機器人等領域具有影響。數據集和模型還可以幫助產品設計領域,通過理解和控制幻覺來創造更好的產品。

研究的未來方向

研究人員正準備與科學界分享他們的數據集,同時也展望未來。未來的工作可能涉及訓練視覺-語言模型來理解和描述幻覺面孔,可能會導致AI系統以更人性化的方式與視覺刺激互動。

這項研究不僅令人著迷,還啟發人們思考。它提出了一個引人入勝的問題:為什麼我們會在事物中看到面孔?這一問題的思考可能會教會我們一些重要的視覺系統如何超越其通過生活中所接受訓練的知識進行概括。

編者評論:

這項研究不僅揭示了人類和機器在面孔識別上的根本差異,還讓我們思考人類感知的進化根源。這種幻覺現象是否僅僅出於社會行為,還是更深層次的生存本能?這些問題不僅對心理學和計算機科學有啟示意義,還可能在未來的技術應用中提供新的視角。特別是在人工智能不斷進步的今天,理解這種人機差異有助於我們設計出更智能、更人性化的技術。這項研究也提醒我們,科技發展不僅需要技術上的突破,還需要從人性和心理學的角度去理解和應用。

以上文章由特價GPT API KEY所翻譯。而圖片則由FLUX根據內容自動生成。

發佈留言

發佈留言必須填寫的電子郵件地址不會公開。 必填欄位標示為 *

🎨 Nano Banana Pro 圖像生成器|打幾句說話就出圖

想畫人像、產品圖、插畫?SSFuture 圖像生成器支援 Flux Gemini Nano Banana Pro 改圖 / 合成, 打廣東話都得,仲可以沿用上一張圖繼續微調。

🆓 Flux 模型即玩,不用登入
🤖 登入後解鎖 Gemini 改圖
📷 支援上載參考圖再生成
⚡ 每天免費額度任你玩
✨ 即刻玩 AI 畫圖
Base Setup
keep 100 percent facial information adherence of the attached image and turn her into a lone traveler posed on a sunlit desert dune ridge, captured as a live action photograph or movie still, not an illustration or CGI render, with a sexy, confident, heat soaked editorial mood.

Shot and Camera
Full body shot from slightly low height, framing her on the right third, with sweeping dunes rolling into the distance on the left, using a wide cinematic lens feel that keeps the landscape vast and minimal.

Identity and Pose
She has a slim, toned build, medium height impression, sun kissed skin, and long loose hair blown by the wind. She stands barefoot on the crest, one leg forward so the hip shifts naturally, one hand resting on her upper thigh and the other lightly gripping a sheer wrap at her side, wearing a high cut desert bikini with a gauzy open sarong that shows plenty of leg and midriff without nudity, head turned over her shoulder with a subtle, knowing smile, 8k Photorealistic and hyper realistic.

Lighting and Environment
Harsh midday sun from high right casts crisp shadows along her body and the dune ripples, with soft sky bounce filling the shadows enough to keep detail. The sand textures, wind carved ridges, and pale blue sky remain exactly like the reference, with faint footprints and slight imperfections grounding her on the slope.

Masking and Constraints
Change only by adding the subject and her wardrobe, keep dune shapes, lighting direction, perspective, horizon line, and white balance identical, preserving realistic body proportions and clear contact between her feet and the sand. Absolutely no added text, no painterly or toon look, no CGI plastic skin, no see through fabric on intimate areas, no warped limbs or floating feet, strictly require consistent perspective, natural film like grain, fine skin texture and sand detail, and physically correct contact shadows in the sand depressions around her feet. Ultra-realistic editorial portrait. Face identity locked, adult (25+), natural anatomy, blonde hair unchanged. Subject just waking up, reclining/semi-upright in an ultra-luxury penthouse bedroom with premium white linens and floor-to-ceiling windows.
Wearing a very short white satin slip lightweight, fluid, subtle daylight translucency. Calm, freshly awakened expression, eyes softly focused at camera. Relaxed, natural posture.
Early-morning natural light, warm-neutral editorial grading.
Ultra-real skin texture, slightly tousled hair, high-fidelity satin physics.
Camera: RAW 32K, DSLR, 50–85mm f1.8, shallow DOF, sharp face focus.
No CGI, no filters, no nudity, no identity drift, no fantasy. Generate an ultra-realistic, highly ultra-detailed, 8k resolution with 1080x1080 pixel portrait of me using the uploaded image for reference (preserved the likeness and the original face for reference) of a cinematic studio portrait of a woman seated on a simple wooden chair with a minimalist design, positioned slightly to the left of the frame. She is captured in a contemplative pose, with her body turned to the left, her left arm resting gracefully on the back of the chair, and her right hand gently touching her face near her lips, conveying a sense of introspection and elegance. Her long, wavy hair cascades naturally over her shoulders, framing her face and adding softness to the composition. She wears an oversized, textured knit sweater that slips off her shoulders, exposing her collarbones and upper chest, emphasizing a relaxed and intimate mood. Her legs are bare, with her right foot flat on the ground and her left knee slightly raised, creating a dynamic line that guides the viewer’s eye through the composition. *** The background is a seamless, deep charcoal or dark brown studio backdrop, providing a rich, neutral setting that enhances the dramatic lighting. The lighting setup features a single, soft yet directional light source positioned to the left of the subject, casting gentle, sculptural shadows that highlight the contours of her face, shoulders, and arms, while creating a subtle gradient across her form. The light accentuates the texture of her sweater and the natural shine of her hair, adding depth and dimension to the image. The color palette is monochromatic with warm, muted tones—shades of gray, brown, and beige—contributing to a timeless, artistic aesthetic. The image is shot with a professional full-frame camera using an 85mm or 50mm lens at a wide aperture (f/1.8 to f/2.😎 to achieve a shallow depth of field, ensuring the subject is in sharp focus while the background remains softly blurred. The resolution is ultra-high, capturing every detail from the fine texture of her sweater to the subtle expression of her pose. The overall style is elegant, contemplative, and refined, emphasizing mood and atmosphere over overt glamour. Post-processing is minimal, maintaining natural skin tones, enhancing contrast and clarity, and preserving the authenticity of the scene. This portrait embodies a delicate balance between simplicity and emotional depth, making it suitable for fine art, editorial, or fashion photography.