ElevenLabs破66億估值:聲音AI已成過去式?

Ai

ElevenLabs估值突破66億美元 執行長指語音已不再是賺錢關鍵

ElevenLabs以打造逼真AI語音聞名。這間由兩位對電影配音品質不滿的波蘭工程師創立的公司,如今已成為盈利企業,估值高達66億美元,較九個月前翻倍。公司最近宣布了一項由紅杉資本(Sequoia)和ICONIQ領投、a16z等參與的1億美元股份回購計劃,旗下技術廣泛應用於《Fortnite》遊戲角色及客服機械人,並與OpenAI競爭,力求成為AI語音的標準。

在TechCrunch的Equity播客中,ElevenLabs執行長Mati Staniszewski於今年Disrupt大會上透露一個驚人看法:他認為語音模型在未來幾年將成為商品化,利潤空間會大幅縮小。那麼ElevenLabs在競爭激烈的環境中,下一步有何策略?

完整訪談中,Staniszewski談及:

– 為何ElevenLabs正從語音模型轉型為打造對話式AI代理平台
– 公司如何利用水印技術、AI檢測及設備認證,應對深度偽造問題
– 他相信不久將出現AI生成內容超越人類創作的趨勢
– ElevenLabs如何進軍音樂生成領域,並與視頻模型合作推動音頻與影像融合

ElevenLabs的發展路線,從專注語音合成到拓展更廣泛的AI對話及內容生成,反映了AI產業快速演變的現實。語音技術曾是AI應用的熱門切入點,但隨著技術普及和競爭加劇,單純的語音合成已難以帶來持續的利潤。ElevenLabs選擇轉型,尋求差異化競爭優勢,尤其在對話式AI和多媒體融合方面布局,顯示其前瞻性的戰略眼光。

此外,公司對於深度偽造(deepfake)問題的重視,也透露出AI技術發展同時帶來的倫理和安全挑戰。透過水印和設備認證等技術手段,ElevenLabs嘗試建立信任機制,這對整個AI音視頻產業的健康發展至關重要。

從更廣的角度看,Staniszewski提及未來AI生成內容將超越人類創作,這既是技術進步的必然,也是對內容生產者及社會文化的一大挑戰。如何在AI創作與人類創意間找到平衡,保護知識產權與創作多樣性,將是業界和監管機構需要共同面對的課題。

總的來說,ElevenLabs的成長故事和轉型策略,不僅體現了AI語音技術的變遷,也揭示了整個AI產業從單一技術突破到綜合應用平台的發展趨勢。對香港及全球讀者而言,這是了解AI產業新動態、洞察未來科技趨勢的寶貴視角。

以上文章由特價GPT API KEY所翻譯及撰寫。而圖片則由FLUX根據內容自動生成。

🎨 Nano Banana Pro 圖像生成器|打幾句說話就出圖

想畫人像、產品圖、插畫?SSFuture 圖像生成器支援 Flux Gemini Nano Banana Pro 改圖 / 合成, 打廣東話都得,仲可以沿用上一張圖繼續微調。

🆓 Flux 模型即玩,不用登入
🤖 登入後解鎖 Gemini 改圖
📷 支援上載參考圖再生成
⚡ 每天免費額度任你玩
✨ 即刻玩 AI 畫圖
A close-up of a young man (as in the uploaded image) Dominates the right side of the frame, viewed from a low-angle perspective, giving him a powerful and imposing presence. He is wearing a simple white tank top (muscle shirt) and tan or light-yellow cargo shorts. He is holding a wooden baseball bat slung over his right shoulder. His expression is serious and determined.

JustinE the Rottweiler: Located on the left, chained to man. JustinE is in an aggressive stance, mouth wide open in a fierce snarl or bark, showing his teeth and tongue, with saliva visible. He is wearing a thick, heavy silver chain leash and collar, with a small circular dog tag visible that says "JustinE". The dog is large and muscular, facing the viewer.

Leash: A heavy-gauge silver metal chain connects JustinE's collar to man's hand.

Lighting: Strong, harsh daylight typical of Southern California, creating deep shadows and high contrast, enhancing the dramatic feel.

Background/Setting: A Southern San Andreas (Los Santos) street setting. The background is slightly blurred but suggests a hot, dry, urban environment with power lines/telegraph wires and a glimpse of an older vehicle (possibly a green van) in the lower-left corner. The sky is a bright, clear, light yellow-green color, indicative of a hot day.

hard shadows, and a distinct color palette typical of GTA loading screens and cover.

Highly detailed and rendered. Vibrant, cinematic. Create a photorealistic and highly detailed image featuring the attached image walking confidently down a modern city street, accompanied by Jason Statham, Dwayne “The Rock” Johnson, and Jason Momoa acting as bodyguards.

John Wick (Keanu Reeves) is walking just beside or slightly behind the subject, holding an umbrella over him to shield from light rain.

The subject should be the central figure, wearing stylish casual clothing — like a fitted jacket, dark jeans, and sunglasses — exuding calm authority and cool charisma.

Statham, The Rock, and Momoa are dressed in black tactical-style suits, maintaining alert, protective stances, scanning the surroundings like professional bodyguards. John Wick wears his signature black suit and tie, looking composed as he holds the umbrella.

The setting is a downtown urban street with wet pavement reflecting city lights, parked luxury cars, and paparazzi in the background snapping photos.

The photo should look like a real paparazzi shot — slightly off-angle, mid-step motion blur, with realistic lighting and reflections.

Lighting: natural daylight with overcast skies, reflections from wet concrete, realistic shadows, subtle raindrops on the umbrella and clothing.

Camera realism: crisp detail on facial features and clothing textures, shallow depth of field emphasizing the group, with lens flare or light bloom for authenticity.

Mood & tone: grounded, cinematic, and stylish — feels like a moment from a celebrity entourage photo or action-movie press capture, taken with an iPhone by paparazzi.

Style: ultra-realistic, documentary-style street photography with modern cinematic sharpness. [Subject]: Young Asian female with "Imada Mio-inspired" doll-like aesthetic (精緻洋娃娃臉). She has large round expressive eyes, a small V-line face, and rosy cheeks. Her expression is innocent, energetic, and slightly flirty. [Hair]: Messy morning hair (剛睡醒的凌亂感), long dark brown hair, slightly tousled, natural volume. [Outfit]: Wearing an oversized translucent white button-down shirt (男友風白襯衫), unbuttoned at the top to reveal collarbones, creating a "bottomless" look (下衣失蹤風格). [Style]: Japanese Gravure Photobook style (寫真集風格), Pure & Sexy vibe, bright high-key lighting, soft skin texture, Fujifilm PRO 400H color tone.