AI「世界模型」解密:LLM點解咁似識世界?價值觀又點影響?

Ai




1. 「世界モデル」という表現の由来

在自然語言處理領域中,大規模語言模型(LLM)被稱為「世界模型(World Model)」的原因,主要是因為這些模型似乎能夠從龐大的文本數據中提取到的知識,涵蓋了語言、世界概念、物理現象、社會規範及一般常識等。雖然語言模型的本質是為了預測下一個單詞或標記,但在學習過程中,它們吸收了廣泛的知識和概念,讓人感覺它們似乎具備對世界的理解。

2. 「世界的認識」與「標記」的處理的共通點與差異

共通點

LLM基本上將標記表達為向量(數值陣列),並將其視為向量空間中的點。關於世界的知識同樣以統計模式的方式在這個向量空間中累積。

因此,「關於世界的知識」與「標記信息」在原理上是相同的表達形式,並不存在明確的區別。

相異點

標記信息是局部且具體的,而關於世界的知識或概念則是從多個標記的關聯中得出的抽象且複雜的情境模式。

換句話說,世界知識以高度結構化的信息形式存儲於向量空間中,遠超過單純的單詞層面信息。

3. 人類價值觀的反映

大規模語言模型的構建過程中,包含了人類進行超參數調整、數據集選擇和模型設計的過程,因此開發者的價值觀及社會偏見無論是直接還是間接都會反映在模型中。特別是通過強化學習及人類反饋進行的微調(RLHF),更是將明確的人類價值觀和社會規範引入模型中。

因此,LLM所呈現的世界觀並非「客觀中立的世界」,而是應被視為「人類設計的主觀且抽象的世界模型」。

4. 結論

「世界模型」這一表述,是人類對模型從龐大數據中獲得的抽象複雜知識的感性理解所產生的比喻。實際上,世界知識與標記一樣,都是在向量空間中進行存儲,但以更複雜的關係進行抽象表達。

開發者的價值觀在超參數調整及訓練方法中影響模型是無可避免的,因此LLM的世界模型並不具備純粹的客觀性。

因此,在使用「世界模型」這一術語時,我們需要理解其背後的上下文及限制。或許,「世界模型」只是AI初創企業為了從風險投資中獲得高估值而包裝的術語。大家在使用「言語」時,應該要謹慎。AI時代意味著「言語的力量」影響力的擴大,因為除了人類之外,AI這個第二種「物種」也開始理解語言。

在這段文章中,作者深入探討了「世界模型」的概念及其背後的意義,並強調了人類價值觀在AI模型中的反映。這不僅是對技術的分析,更是對人類與AI關係的反思。隨著AI技術的發展,我們必須更加警惕這些技術可能帶來的倫理和社會影響。當我們設計和使用這些模型時,應該保持批判性思維,以避免無意中放大社會偏見或錯誤的世界觀。這是一個需要全社會共同關注的議題,未來的AI發展應在更具責任感的框架內進行。

以上文章由特價GPT API KEY所翻譯及撰寫。

🎨 Nano Banana Pro 圖像生成器|打幾句說話就出圖

想畫人像、產品圖、插畫?SSFuture 圖像生成器支援 Flux Gemini Nano Banana Pro 改圖 / 合成, 打廣東話都得,仲可以沿用上一張圖繼續微調。

🆓 Flux 模型即玩,不用登入
🤖 登入後解鎖 Gemini 改圖
📷 支援上載參考圖再生成
⚡ 每天免費額度任你玩
✨ 即刻玩 AI 畫圖
An ultra realistic portrait of a young man (facial detail 100% matches with the reference photo) relaxing on a wooden balcony with a scenic view of green mountains in the background. Golden-hour sunlight from the left side creates a warm glow, soft highlights on his hair and shoulders, and a subtle sun flare. He is wearing a black t-shirt in a white shirt with long sleeve, a black ripped jeans and black white sport shoes. his short straight hair is blowing by the morning wind. He sits with his body slightly angled, shoulders turned to the right side of the frame, one leg casually folded up onto the chair.

He holds a white cup emitting light steam, suggesting a freshly made hot drink. His face turns gently to the right with a subtle, relaxed smile, giving a calm and peaceful expression. The environment is an outdoor highland area with silhouettes of hills and trees in the distance, softly darkened by the backlit sunlight.

Camera angle is low angle level but slightly from the left side, creating a natural and intimate framing. Background is softly blurred with shallow depth of field, using a wide-aperture lens. Warm, earthy, soft color tones {
"intro": "Create an ultra realistic 8K UHD DSLR photo based on the attached image as a reference of facial features, maintaining 100% likeness.",

"subject": {
"identity": "A stylish beautiful woman portrayed as Cleopatra, the eternal Queen of Egypt, exuding power, seduction, and divine authority.",
"angle": "Full-body editorial portrait captured at a refined 3/4 angle, with both the subject and her throne positioned diagonally, rendered in ultra-crisp clarity with no blur.",
"pose": {
"body_position": "She is seated regally on a luxurious Egyptian throne angled slightly to the side, her torso and legs elegantly turned to match the diagonal composition, enhancing her curves and royal poise.",
"hands": "One arm rests gracefully along the angled armrest of the throne, while the other cradles a magnificent royal cat against her body.",
"expression": "She looks directly into the camera with a composed, intelligent, and seductive gaze—calm authority mixed with magnetic allure."
}
},

"appearance": {
"outfit": "An exceptionally bongga, sexy, and ultra-colorful Cleopatra couture gown designed as a high-fashion masterpiece. The gown features a sculpted corset bodice encrusted with multicolored gemstones—turquoise, lapis blue, emerald green, ruby red, amethyst violet, and molten gold—arranged in intricate Egyptian patterns. The fabric transitions into layered sheer silks in jewel tones that cascade dramatically, creating movement and depth. A daring thigh-high slit reveals her leg, while illusion panels and crystal embroidery contour her waist and hips. The gown shimmers with every hue, bold yet luxurious, sensual yet undeniably royal.",
"accessories": "A dramatic Egyptian crown with raised cobra centerpiece and iridescent gemstone inlays, oversized multi-layered gold collar necklace, engraved arm cuffs, crystal-encrusted finger rings, an ornate gold waist belt, anklets with delicate charms, and elegant flat Egyptian sandals.",
"hair": "Her hair is shoulder-length, sleek, and glossy with soft movement, modernized yet inspired by ancient Egyptian elegance, no bangs.",
"makeup": "High-impact, colorful Egyptian glam makeup—intensely elongated kohl eyeliner, bold eyeshadow blended in gold, turquoise, teal, emerald, and hints of violet, sculpted cheekbones with luminous gold highlight, flawless bronzed skin, defined brows, and rich nude-to-berry satin lips with a sensual glow."
},

"props": {
"animal": "A stunning, regal Egyptian cat of exceptional beauty, with sleek, glossy fur patterned in warm sand, charcoal, and soft gold tones. The cat has large almond-shaped eyes that glow amber-gold, finely sculpted features, and an elegant posture. It wears a delicate gold collar adorned with tiny gemstones and a miniature Bastet charm, symbolizing protection, divinity, and royal favor."
},

"background": {
"macro_environment": "A grand royal palace courtyard in ancient Egypt at golden hour, composed diagonally to echo the angled throne, with towering sandstone columns, carved relief walls, and distant pyramids beneath a richly colored desert sky.",
"midground_details": "Palm trees gently swaying, monumental statues of Bastet and other Egyptian deities, ceremonial fire torches, flowing silk banners in jewel tones, and distant palace attendants positioned subtly for scale.",
"micro_elements": "Ultra-sharp hieroglyph carvings, visible stone grain and chisel marks, fine desert sand particles, radiant gemstone reflections, metallic gold highlights, intricate embroidery threads, and realistic sun-cast shadows—every element sharply defined with zero blur."
},

"lighting": {
"type": "Cinematic natural golden-hour lighting enhanced with soft reflective fill light.",
"effect": "Warm sunlight amplifies the vivid colors of the gown and gemstones while sculpted shadows define her face, body, throne, and the cat, creating a dramatic yet luxurious editorial mood."
},

"camera": {
"camera_type": "DSLR",
"resolution": "8K UHD",
"lens": "50mm prime lens",
"aperture": "f/8 for maximum sharpness across subject and background",
"iso": 100,
"shutter_speed": "1/200s",
"focus": "Extreme sharp focus from foreground to background, no bokeh, no blur"
},

"style": "High-fashion editorial, cinematic realism, ultra-luxury Egyptian couture, vibrant jewel-toned palette, historical grandeur fused with modern sensuality, extremely detailed, sharp, powerful, and seductive"
} A young woman with fair skin is taking a selfie inside a fitness center (gym).

Main Subject: A young woman with long black hair in a messy bun. She is wearing a light cream or ivory long-sleeved sports zipper jacket and black tight sports shorts. She is also wearing long white socks with black stripes at the top and white sports shoes.

Expression and Pose: She is sitting on a weight training machine (apparently a leg press machine or similar) and holding a phone (with a leopard/leopard print case) to take a selfie in the mirror or using the front camera, with her face forward, a soft smile, and her face and eyes looking at the mirror.

Background (Gym): The background is dominated by modern gym equipment in dark gray and red. There are a few other unfocused people in the background, including a man on the left lifting weights or standing near a barbell, and another man in a green shirt standing near equipment. The gym floor appears dark.

Brief Prompt (Suitable for Image Search or Hashtags):

Selfie in the gym, young woman with messy bun and cream jacket, sitting on a training machine, aesthetic sporty, indoor lighting. Without changing her face.