Janus
p/janus-2
Unified Multi-Modal AI by DeepSeek
Zac Zuo
Janus β€” Unified Multi-Modal AI by DeepSeek
Featured
10
β€’
The Janus series by DeepSeek offers powerful AI models for unified multimodal understanding and generation. It includes Janus-Pro (advanced reasoning), Janus (decoupled visual encoding), and JanusFlow (harmonized autoregression and rectified flow).
Replies
Zac Zuo
Hunter
πŸ“Œ
Hey everyone! DeepSeek is ON FIRE! πŸ”₯ They just dropped the Janus series – a new family of AI models focused on unified multimodal understanding and generation. Here's the breakdown: ✨ Janus-Pro: The top-tier model, trained with more data and a larger size for advanced multimodal reasoning and high-quality image generation. 🧩 Janus: Features a decoupled visual encoding architecture, offering flexibility and strong performance in vision-language tasks. ⚑ JanusFlow: Integrates rectified flow with an autoregressive model for enhanced generative capabilities. The Janus series stands out by unifying both understanding and generation across vision and language in a single framework. DeepSeek is also pushing the boundaries with novel architectures, including decoupled visual encoding and the integration of rectified flow. You can download the models now and explore their capabilities!
Masum Parvej
@zac_zuo ..the integration of rectified flow with an autoregressive model in JanusFlow is fascinating, how does it enhance generative capabilities compared to traditional methods? would love to learn more ....
Masum Parvej
@zac_zuo wait a minute! the @Janus series sounds nothing less than revolutionary!
Jayaram Babu
The Janus series by DeepSeek is a revolutionary leap in AI technology! 🌟 From Janus-Pro offering advanced reasoning to Janus with its cutting-edge decoupled visual encoding, and JanusFlow harmonizing autoregression β€” this suite is truly next-level. πŸš€ Whether you need powerful insights, seamless visual understanding, or intelligent flow, the Janus models are designed to elevate your AI projects like never before! πŸ’₯πŸ” I'm excited to see the endless possibilities these models unlock! πŸ™ŒπŸ”₯
Kelvin Ikhide
384x384 input isn't going to do it for me... but let's wait and see.
Chris Messina
Top Hunter
Looks like @Stable Diffusion has a new competitor... which is free and MIT licensed!
Rohan Chaubey
@stable @chrismessina Looks like input is limited to 384 x 384
Zac Zuo
Hunter
@stable @chrismessina @rohanrecommends Yes the resolution (both input and output) is the major limitation of these models currently. DeepSeek team is working on that for future versions, as mentioned in their paper.
yanshuo
Launching soon!
Really impressive job, congrats!
Max Comperatore
bruh deepseek is absolutely crushing everything. keep going guys. nvidia bankrupt lmao. DEEPSEEK ROCKS LFG
Daniel Stewart
Love how it gets both aesthetic and functional requirements. Could use better support for CAD file formats though.