Unveiling the Latest in Generative Media Innovations

Published On Sat Feb 22 2025
Unveiling the Latest in Generative Media Innovations

Today in Generative Media - What's new in Generative Media

I’m Not Convinced Ethical Generative AI Currently Exists (Wired)

CEO of Clearview AI, a controversial facial recognition startup, has resigned (TechCrunch)

“AI is merely a mechanism”: how Superside built a human-led AI brand (Creative Bloq)

A.I. Is Changing How Silicon Valley Builds Start-Ups (New York Times)

New Google AI Leak Reveals Powerful Gemini Upgrade (Forbes)

Latest Projects and Research

A Very Good Question And a plausible answer about why language models and video models are on different paths (Mike Gioia on Substack)

Project Starlight is a groundbreaking AI research preview by Topaz Labs that transforms low-resolution and degraded video into HD quality.

Diffusion Models for Video Generation

VMS [Video Model Studio] is a Gradio app that wraps around Finetrainers, to provide a simple UI to train AI video models on Hugging Face. You can deploy it to a private space, and start long-running training jobs in the background. (GitHub)

PaliGemma 2 Mix - New Instruction Vision Language Models by Google (HuggingFace)

Gemini Deep Research rolling out to Google Workspace (9to5Google)

ByteDance AI Researchers Introduce 'MagicVideo'

SongGen: A Single Stage Auto-regressive Transformer for Text-to-Song Generation (project page)

Phantom Subject-Consistent Video Generation via Cross-Modal Alignment (project page)

YOLOv12: Attention-Centric Real-Time Object Detectors (arXiv)

YOLOv12 drops CNN and fully adopts an attention-based model. Outperforms all real-time object detection models without losing speed. (X)

Diffusion Models for Video Generation

The Ultra-Scale Playbook: Training LLMs on GPU Clusters (HuggingFace)