All activity

Zac Zuo
left a comment
Hi everyone!
Check out Instella, a new family of 3B language models from AMD. These models are interesting because they're achieving performance comparable to larger, open-weight models, but with a smaller footprint.
Key points:
š Strong Performance: Outperforms other 3B models, and rivals some larger open-weight models like Llama-3.2-3B and Gemma-2-2B.
āØ Reasoning Focus: Second stage of...

Instella
Open 3B Small LMs from AMD

Zac Zuo
left a comment
Hi everyone!DuckDuckGo brings Duck.ai, a free and private AI chatbot! It's now out of beta and offers a really interesting alternative to other AI chat services.What makes it different:š”ļø Privacy-Focused: Chats are anonymized and not used for training the AI models. DuckDuckGo proxies the requests, so your IP address isn't shared.š¤ Multiple Models: You can choose from a selection of models,...
Duck.ai
Private AI Chat from DuckDuckGo



Aya Vision, from Cohere For AI, is the open-weights, multilingual, multimodal models (8B & 32B). Outperforms larger models on multilingual vision tasks. Available on Hugging Face and Kaggle.

Aya Vision
Multilingual, Multimodal AI from Cohere

Zac Zuo
left a comment
Hi everyone!
HunyuanVideo-I2V is a new open-source image-to-video model from Tencent, and it's a significant addition to the growing AI video space! This builds on their previous HunyuanVideo project, but focuses specifically on turning still images into videos.
Here's the detail:
š¼ļø Image-to-Video: Give it an image and a text prompt, and it generates a video.
š
High Resolution: Can generate...

HunyuanVideo-I2V
High-Res Image-to-Video with LoRA

HunyuanVideo-I2V, from Tencent, is a open-source image-to-video generation model. Up to 720p resolution, 129 frames. Supports custom LoRA training for unique effects.

HunyuanVideo-I2V
High-Res Image-to-Video with LoRA

With Hailuo Video App from MiniMax, you could easily create AI videos (text & image) with "Director Mode" for cinematic camera control ā all on your phone! iOS, Android, web.

Hailuo Video App
Direct AI Movies, From Your Phone

ToddlerBot is an open-source, low-cost humanoid robot platform from Stanford, designed for machine learning research in locomotion and manipulation. Easy to build and fully customizable.

ToddlerBot
Build Your Own Humanoid Robot

Zac Zuo
left a comment
Hi everyone!
Hailuo AI is now going to mobile! It's like having a complete AI video studio right in your pocket! This app, from MiniMax, lets you create amazing videos using just text or images.
The really cool part is the new "Director Mode", powered by their newest I2V-01-Director and T2V-01-Director models.
You could control camera movements (like pan, zoom, tilt, dolly, even Hitchcock...

Hailuo Video App
Direct AI Movies, From Your Phone

Zac Zuo
left a comment
Hi everyone!Check out QwQ-32B, a new open-source language model from the Qwen team. It's achieving something remarkable: reasoning performance comparable to DeepSeek-R1, but with a model that's 20 times smaller (32B parameters vs. 671B)!This is a big deal because:š¤Æ Size/Performance Ratio: It punches way above its weight class in reasoning, math, and coding tasks.š§ Scaled Reinforcement Learning:...

QwQ-32B
Matching R1 reasoning yet 20x smaller


Zac Zuo
left a comment
Hi everyone!AI Mode is clearly the punch back to Perplexity. Google is trying to take the search experience to the next level, using a custom version of Gemini 2.0 to tackle complex, multi-part questions that might normally require multiple searches.It's currently available as an opt-in experiment in Labs for Google One AI Premium subscribers. They're gradually rolling it out to more...

Google Search AI Mode
Go beyond search

AI Mode is a new generative AI experiment in Google Search. Get AI-powered answers to complex questions, with links to sources. Uses a custom Gemini 2.0 model for multi-step reasoning.

Google Search AI Mode
Go beyond search

Zac Zuo
left a comment
Hi everyone!Check out Aya Vision, a new set of open-weights models from Cohere For AI, and this is a significant step towards making AI truly global! Most vision-language models are heavily biased towards English. Aya Vision tackles this head-on by supporting 23 languages spoken by over half the world's population.Here's why it's important:š Multilingual by Design: Excels at understanding and...

Aya Vision
Multilingual, Multimodal AI from Cohere


Zac Zuo
left a comment
Hi everyone!Sharing Colab Agent, a new tool integrated directly into Google Colab that could seriously speed up data science workflows! It's powered by Gemini 2.0 and lets you generate entire, runnable Colab notebooks just by describing your analysis goals in natural language.Here's how it works:1. Upload Data: CSV, JSON, or .txt files (up to 1GB).2. Describe Your Goal: Tell it what you want to...

Colab Agent
Go from data to insights

Simplify data tasks with the new Data Science Agent, powered by Gemini, and generate functional notebooks, now available free in Google Colab.

Colab Agent
Go from data to insights

Zac Zuo
left a comment
Hi everyone!Sharing Sesame's Conversational Speech Model (CSM), and this is a big step beyond typical text-to-speech. The goal is to achieve what Sesame calls "voice presence": making spoken interactions feel real, understood, and valued.A PH version of this model System Card is :)š Emotional Context: It tries to understand and respond to the emotion in the conversation.ā±ļø Conversational...

Sesame
Conversational speech model that achieves voice presence

Zac Zuo
left a comment
Hi everyone!Sharing CogView4, a new open-source text-to-image model from the ChatGLM team, and it's got some seriously impressive capabilities!What stands out:š¼ļø Native 2K Resolution: Generates images at 2048x2048 natively ā no need for upscaling.š (Almost) Unlimited-Length Prompts: Supports verrrrrrrrry long and detailed prompts, in both Chinese and English.š In-Image Text Generation: Can...

CogView4
Open-Source, 2K Resolution Text-to-Image from ChatGLM