免費AI音樂神器YuE:Open Source都咁勁?!

Ai

🎬 YouTube Premium 家庭 Plan成員一位 只需
HK$148/年

不用提供密碼、不用VPN、無需轉區
直接升級你的香港帳號 ➜ 即享 YouTube + YouTube Music 無廣告播放


立即升級 🔗

我利用這個免費的AI音樂創作工具製作音樂——質素竟然意外地好

AI音樂生成領域在過去兩年中一直是AI革命其中一個相對穩定的部分。兩個主導公司,Suno和Udio,已經在這個小眾市場上建立了值得信賴的聲譽和粉絲基礎。

不過,這種安穩的現狀可能即將發生劇變。一個名為YuE的新音樂生成平台剛剛推出,並且是免費的、開源的,所生成的音樂質量令人驚訝。

YuE在中文中意指「音樂」和「快樂」,其實是一組協同工作的模型,旨在提供完整的音樂作品。這些模型涵蓋了歌詞創作、樂器和音樂類型。與許多新的中國AI模型一樣,YuE的開放性鼓勵了大量自製開發,主要是為了減少計算需求,讓更多人能夠利用這個工具。

最初的項目需要至少24GB的視頻RAM,而官方建議要創作完整歌曲仍然最低需要80GB。顯然,這對普通家庭用戶來說是遙不可及的,主要是針對專業人士、商業和學術界。

好消息是,許多努力已經投入到為大眾創建更小的包裝中,其中包括受歡迎的Pinokio平台的工作,這讓任何人都能快速輕鬆地在Windows上運行開源AI項目。

取捨之道

這些小型VRAM版本的交換是音質確實有所下降,生成時間可能極為緩慢。即使使用Pinokio,基本的VRAM需求仍然需要12GB,這對於大多數計算機來說都是不現實的。然而,最近一位有創意的用戶推出了一個超低內存版本,讓我能夠在我的僅有8GB RTX GPU系統上進行實驗。

這是我創作的作品:

第一印象是非常熟練的Gradio用戶界面。螢幕左側是提示框,下面是用來輸入歌詞的框,然後是你想生成的曲目數量。還可以設置你想使用的RAM量,這與歌曲的長度和段落數量有關。

按下生成按鈕,然後坐等平台生成音軌。

開發者聲稱,使用16GB VRAM的GPU,一分鐘的音軌只需四分鐘來創作。不幸的是,這似乎並不會隨著內存的降低而相應縮短,因為在我的8GB系統上,生成兩首40秒和50秒的曲目分別需要2到2.5小時。

開發者聲稱,使用16GB VRAM的GPU,一分鐘的音軌只需四分鐘來創作。

但無論如何,這些曲目都非常驚人。雖然它們較短,音質也不是頂級,但音樂性卻令人驚艷。

上次我在計算機上測試AI音樂生成時,聽起來就像90年代的髒街機。這次的音樂是真正的音樂,準確遵循提示,出色的聲音和商業AI服務應有的樂器編排。

你可以在這裡的SoundCloud上聽到更多成果:

結語

這個項目仍然非常粗糙,所需的計算資源也非常驚人。即使你有一台不錯的計算機,你也會花很多時間等待音軌的生成。但是——這是個大「但是」——儘管有這些缺點,這仍然是這個領域開放產品的一次驚人初試。

如果開源AI音樂生成現在能產生這種質量,那麼商業服務如Udio和Suno將很快感受到來自DIY社區的真實壓力。

這篇文章讓我反思了開源技術的潛力和對商業市場的影響。YuE的出現不僅是對音樂創作的一次革新,也是對商業模式的一次挑戰。隨著越來越多的開源項目涌現,未來的音樂創作可能不再需要依賴昂貴的商業軟件,這將使創作變得更加民主化。這樣的發展不僅有助於音樂創作者的多元化,也可能改變整個音樂產業的生態系統。

以上文章由特價GPT API KEY所翻譯及撰寫。而圖片則由FLUX根據內容自動生成。

🎨 Nano Banana Pro 圖像生成器|打幾句說話就出圖

想畫人像、產品圖、插畫?SSFuture 圖像生成器支援 Flux Gemini Nano Banana Pro 改圖 / 合成, 打廣東話都得,仲可以沿用上一張圖繼續微調。

🆓 Flux 模型即玩,不用登入
🤖 登入後解鎖 Gemini 改圖
📷 支援上載參考圖再生成
⚡ 每天免費額度任你玩
✨ 即刻玩 AI 畫圖
Use the original face exactly as it is, without changing a details. A stunning, very fit female model with a sun-kissed, glowing tan (hyper-realistic, high-sheen skin texture).
• Attire: A strapless, high-waisted one-piece swimsuit (or very closely cut two-piece) in a dark navy or black color with prominent white polka dots. The suit has a cutout design around the midriff and a knotted/bow detail at the bust.
• Accessories:
• An oversized straw sun hat with a wide brim, featuring a colorful striped ribbon band.
• Large, chunky yellow or gold hoop earrings with a woven/textured design.
• Hair & Makeup: Classic glamour style. Bright, bold red lipstick. Hair is pulled back or tucked under the hat.
Setting & Composition
• Background: A solid, seamless, rich mustard yellow or deep ochre/orange-yellow color, providing a warm, high-contrast backdrop.
• Prop: The model is leaning on an antique or elaborate wooden armchair or chaise lounge with ornate carvings and textured, light gold/yellow upholstery (e.g., damask or brocade).
• Pose: Confident, classic, slightly leaning into the prop, looking directly at the camera with a bright, genuine smile. Three-quarter body shot.
Style & Lighting
• Lighting: Dramatic, high-key, professional studio lighting with a strong single light source to create deep shadows and extreme highlights, especially catching the sheen on the model's skin. The lighting emphasizes a sultry, glamorous mood.
• Aesthetic: High-fashion editorial, Pin-up, Retro 1950s Glamour, Vintage Summer.
• Keywords/Details: Hyper-detailed, photorealistic, cinematic lighting, rich texture, high contrast, high sheen. Google Gemini / Meta AI 🎯💯 

Promte ✅ 

A hyper-realistic night portrait of a handsome young man leaning casually against a black luxury car, laughing heartily with head tilted slightly back. He has a groomed beard, short styled dark hair, and is wearing a plain black long-sleeve crewneck sweatshirt. The lighting is low-key and cinematic, with the subject's face warmly illuminated against the darkness. The background features a dark street with beautiful, creamy golden bokeh orbs from distant streetlights and car headlights. Shot on a Sony A7R IV with an 85mm f/1.4 lens, shallow depth of field, sharp focus on the smile, high contrast, moody urban night atmosphere, 8k resolution. Size 4:5 ratio 100% use my upload reference image Generate an ultra-realistic, highly ultra-detailed, 8k resolution with 1080x1080 pixel portrait of me using the uploaded image for reference (preserved the likeness and the original face for reference) of a cinematic studio portrait of a woman seated on a simple wooden chair with a minimalist design, positioned slightly to the left of the frame. She is captured in a contemplative pose, with her body turned to the left, her left arm resting gracefully on the back of the chair, and her right hand gently touching her face near her lips, conveying a sense of introspection and elegance. Her long, wavy hair cascades naturally over her shoulders, framing her face and adding softness to the composition. She wears an oversized, textured knit sweater that slips off her shoulders, exposing her collarbones and upper chest, emphasizing a relaxed and intimate mood. Her legs are bare, with her right foot flat on the ground and her left knee slightly raised, creating a dynamic line that guides the viewer’s eye through the composition. *** The background is a seamless, deep charcoal or dark brown studio backdrop, providing a rich, neutral setting that enhances the dramatic lighting. The lighting setup features a single, soft yet directional light source positioned to the left of the subject, casting gentle, sculptural shadows that highlight the contours of her face, shoulders, and arms, while creating a subtle gradient across her form. The light accentuates the texture of her sweater and the natural shine of her hair, adding depth and dimension to the image. The color palette is monochromatic with warm, muted tones—shades of gray, brown, and beige—contributing to a timeless, artistic aesthetic. The image is shot with a professional full-frame camera using an 85mm or 50mm lens at a wide aperture (f/1.8 to f/2.😎 to achieve a shallow depth of field, ensuring the subject is in sharp focus while the background remains softly blurred. The resolution is ultra-high, capturing every detail from the fine texture of her sweater to the subtle expression of her pose. The overall style is elegant, contemplative, and refined, emphasizing mood and atmosphere over overt glamour. Post-processing is minimal, maintaining natural skin tones, enhancing contrast and clarity, and preserving the authenticity of the scene. This portrait embodies a delicate balance between simplicity and emotional depth, making it suitable for fine art, editorial, or fashion photography.