阿里巴巴發布全新開源影片生成模型




阿里巴巴推出最新開源視頻生成模型

阿里巴巴近日推出了其最新的開源視頻生成模型Wan2.1-FLF2V-14B。該模型旨在簡化視頻創作過程,允許用戶輸入起始和結束幀,以實現視頻的自動生成。這一創新技術將為短視頻創作者提供更大的創作自由,幫助他們高效且經濟地開發自己的AI模型和應用程序。

Wan2.1-FLF2V-14B是阿里巴巴基礎模型Wan2.1系列的一部分,該系列專門設計用於從文本和圖像生成高質量的圖像和視頻。該模型現已在Hugging Face、GitHub以及阿里巴巴雲的開源社區ModelScope上開放源碼。

該模型在執行用戶指令、保持首幀和生成視頻的一致性以及實現首尾幀之間的平滑過渡方面表現出色。它能夠根據用戶的提示詞生成720p分辨率的5秒視頻,且免費使用。

Wan2.1-FLF2V-14B的核心技術是一種創新的視頻生成方法,通過增加一個控制調整機制,實現了視頻生成的精確控制。該機制利用用戶提供的序列首尾幀作為控制條件,實現了首尾幀之間的平滑過渡。

為了確保視覺穩定性,該機制有助於將首尾幀的語義特徵注入生成過程,使模型在風格、內容和結構上保持一致性,同時動態轉換幀。

作為最早開放源碼的大型AI模型的全球科技公司之一,阿里巴巴雲在2025年2月開放源碼了四個Wan2.1模型。截至目前,這些模型在Hugging Face和ModelScope上已吸引了超過220萬次下載。

今年早些時候發布的Wan2.1系列是首個支持中文和英文文本效果的視頻生成模型。它在視頻生成模型綜合基準測試VBench排行榜上排名第一。

阿里巴巴雲於2023年8月發布了其首個開放大型語言模型(LLM)Qwen-7B。Qwen的開放模型一直在Hugging Face Open LLM排行榜上名列前茅,其性能與全球領先的AI模型相匹配。

近年來,阿里巴巴雲已開放源碼超過200個生成式AI模型。目前,已有超過10萬個基於Qwen家族模型的衍生模型在Hugging Face上開發,使其成為全球最著名的AI模型家族之一。

阿里巴巴視頻生成模型

作為編輯,我認為阿里巴巴此次推出的開源視頻生成模型Wan2.1-FLF2V-14B具有里程碑式的意義。它不僅為短視頻創作者提供了更多的創作自由,還推動了AI技術在視頻生成領域的發展。同時,這也反映了阿里巴巴在推動AI技術開源和共享方面的 commitment。

然而,值得注意的是,隨著AI技術的不斷發展,視頻生成模型也面臨著諸多挑戰,如生成的視頻質量、內容審核等。因此,阿里巴巴需要繼續投入研發,不斷優化模型性能,以滿足用戶日益增長的需求。

此外,阿里巴巴的開放源碼策略也值得關注。通過開放源碼,阿里巴巴不僅推動了AI技術的發展,還促進了產業鏈的協同創新。這對於推動中國AI產業的發展具有積極意義。

總之,阿里巴巴此次推出的開源視頻生成模型Wan2.1-FLF2V-14B是AI技術發展的一個重要里程碑。它不僅為短視頻創作者提供了更多的創作自由,還推動了AI技術在視頻生成領域的發展。未來,阿里巴巴需要繼續投入研發,不斷優化模型性能,以滿足用戶日益增長的需求。

🎬 YouTube Premium 家庭 Plan成員一位 只需 HK$148/年

不用提供密碼、不用VPN、無需轉區
直接升級你的香港帳號 ➜ 即享 YouTube + YouTube Music 無廣告播放

立即升級 🔗

🎨 Nano Banana Pro 圖像生成器|打幾句說話就出圖

想畫人像、產品圖、插畫?SSFuture 圖像生成器支援 Flux Gemini Nano Banana Pro 改圖 / 合成, 打廣東話都得,仲可以沿用上一張圖繼續微調。

🆓 Flux 模型即玩,不用登入
🤖 登入後解鎖 Gemini 改圖
📷 支援上載參考圖再生成
⚡ 每天免費額度任你玩
✨ 即刻玩 AI 畫圖
一隻在香港茶餐廳喝奶茶的貓 Generate an ultra-realistic, highly ultra-detailed, 8k resolution with 1080x1080 pixel portrait of me using the uploaded image for reference (preserved the likeness and the original face for reference) of a cinematic studio portrait of a woman seated on a simple wooden chair with a minimalist design, positioned slightly to the left of the frame. She is captured in a contemplative pose, with her body turned to the left, her left arm resting gracefully on the back of the chair, and her right hand gently touching her face near her lips, conveying a sense of introspection and elegance. Her long, wavy hair cascades naturally over her shoulders, framing her face and adding softness to the composition. She wears an oversized, textured knit sweater that slips off her shoulders, exposing her collarbones and upper chest, emphasizing a relaxed and intimate mood. Her legs are bare, with her right foot flat on the ground and her left knee slightly raised, creating a dynamic line that guides the viewer’s eye through the composition. *** The background is a seamless, deep charcoal or dark brown studio backdrop, providing a rich, neutral setting that enhances the dramatic lighting. The lighting setup features a single, soft yet directional light source positioned to the left of the subject, casting gentle, sculptural shadows that highlight the contours of her face, shoulders, and arms, while creating a subtle gradient across her form. The light accentuates the texture of her sweater and the natural shine of her hair, adding depth and dimension to the image. The color palette is monochromatic with warm, muted tones—shades of gray, brown, and beige—contributing to a timeless, artistic aesthetic. The image is shot with a professional full-frame camera using an 85mm or 50mm lens at a wide aperture (f/1.8 to f/2.😎 to achieve a shallow depth of field, ensuring the subject is in sharp focus while the background remains softly blurred. The resolution is ultra-high, capturing every detail from the fine texture of her sweater to the subtle expression of her pose. The overall style is elegant, contemplative, and refined, emphasizing mood and atmosphere over overt glamour. Post-processing is minimal, maintaining natural skin tones, enhancing contrast and clarity, and preserving the authenticity of the scene. This portrait embodies a delicate balance between simplicity and emotional depth, making it suitable for fine art, editorial, or fashion photography. A dynamic, ultra-realistic action shot of a snowboarder performing a high-air jump on a snowy mountain slope. The rider wears a bright green winter jacket, black snow pants, gloves, and a dark beanie, with reflective goggles catching the cold mountain light. A cloud of visible breath escapes from the rider’s mouth in the freezing air. Snow explodes upward from the snowboard, creating sharp, frozen particles suspended mid-air. The background features a dramatic high-altitude landscape with forested slopes and distant mountains under soft, cold blue lighting. Capture cinematic contrast, DSLR realism, 85mm lens, f/2.8, crisp details, slow-motion energy, dynamic composition, atmospheric depth, high-clarity sports photography.