Hacker News with Generative AI: Video Generation

Google's Veo3 Is Already Deepfaking All of YouTube's Most Smooth-Brained Content (gizmodo.com)
Wake up, babe, new viral AI video generator dropped. This time, it’s not OpenAI’s Sora model in the spotlight, it’s Google’s Veo 3, which was announced on Tuesday during the company’s annual I/O keynote. Naturally, people are eager to see what chaos Veo 3 can wreak, and the results have been, well, chaotic. We’ve got disjointed Michael Bay fodder, talking muffins, self-aware AI sims, puppy-centric pharmaceutical ads—the list goes on.

Artificial Intelligence, Video Generation, Google, YouTube, Social Media

27 points by thunderbong 57 days ago | 20 comments

AniSora: Open-source anime video generation model (komiko.app)
AniSora is the most powerful open-source animated video generation model developed by Bilibili.

Open Source, Artificial Intelligence, Generative AI, Anime, Video Generation

356 points by PaulineGar 62 days ago | 218 comments

Show HN: LTXV 13B Distilled – Generate 5s Videos in Under 10s (lightricks.com)
An open-source AI model that generates high-quality video content in seconds, built for speed, storytelling and creative control

AI, Video Generation, Open Source, Machine Learning, Creative Tools

10 points by statusreport 66 days ago | 2 comments

LTXVideo 13B AI video generation (ltxv.video)
A groundbreaking 13B-parameter AI model by Lightricks, revolutionizing video creation with unprecedented speed and quality. 30x faster than comparable models, powered by advanced multiscale rendering technology.

Artificial Intelligence, Video Generation, Technology, AI Models

216 points by zoudong376 70 days ago | 64 comments

Packing Input Frame Context in Next-Frame Prediction Models for Video Generation (lllyasviel.github.io)
Diffuse thousands of frames at full fps-30 with 13B models using 6GB laptop GPU memory. Finetune 13B video model at batch size 64 on a single 8xA100/H100 node for personal/lab experiments. Personal RTX 4090 generates at speed 2.5 seconds/frame (unoptimized) or 1.5 seconds/frame (teacache). No timestep distillation. Video diffusion, but feels like image diffusion.

Video Generation, Deep Learning, Computer Vision, Artificial Intelligence

270 points by GaggiX 91 days ago | 27 comments

Tom and Jerry One-Minute Video Generation with Test-Time Training (test-time-training.github.io)
Adding TTT layers into a pre-trained Transformer enables it to generate one-minute videos with strong temporal consistency and motion smoothness.

Generative AI, Computer Vision, Machine Learning, Deep Learning, Video Generation

80 points by walterbell 102 days ago | 18 comments

Show HN: VaporVibe – auto-generate video demos for vibe-coded projects (influme.ai)

Software, Open Source, Video Generation, AI

5 points by gmdnn 108 days ago | 5 comments

Fast Video Generation with Sliding Tile Attention (hao-ai-lab.github.io)
TL;DR: Video generation with DiTs is painfully slow – HunyuanVideo takes 16 minutes to generate just a 5-second video on an H100 with FlashAttention3. Our sliding tile attention (STA) slashes this to 5 minutes with zero quality loss, no extra training required. Specifically, STA accelerates attention alone by 2.8–17x over FlashAttention-2 and 1.6–10x over FlashAttention-3.

Video Generation, Artificial Intelligence, Computer Vision, Optimization, Performance

12 points by zhisbug 150 days ago | 2 comments

Goku Flow Based Video Generative Foundation Models (github.com/Saiyan-World)
Goku is a new family of joint image-and-video generation models based on rectified flow Transformers.

Generative AI, Video Generation, Deep Learning, Transformers

34 points by lastdong 158 days ago | 11 comments

VideoJAM: Joint Appearance-Motion Representations for Enhanced Motion Generation (hila-chefer.github.io)
Despite tremendous recent progress, generative video models still struggle to capture real-world motion, dynamics, and physics.

Video Generation, Computer Vision, Artificial Intelligence, Machine Learning

8 points by geox 163 days ago | 0 comments

Veo 2: Our video generation model (deepmind.google)
Veo creates videos with realistic motion and high quality output, up to 4K. Explore different styles and find your own with extensive camera controls.

Video Generation, Artificial Intelligence, Deep Learning, Google, Computer Vision

587 points by mvoodarla 215 days ago | 327 comments

Veo and Imagen 3: Announcing new video and image generation models on Vertex AI (cloud.google.com)
Generative AI is leading to real business growth and transformation. Among enterprise companies with gen AI in production, 86% report an increase in revenue1, with an estimated 6% growth. That’s why Google is investing in its AI technology with new models like Veo, our most advanced video generation model, and Imagen 3, our highest quality image generation model.

Generative AI, Google Cloud, Video Generation, Image Generation

30 points by srameshc 227 days ago | 15 comments

OpenAI's Sora has been leaked (techcrunch.com)
A group appears to have leaked access to Sora, OpenAI’s video generator, in protest of what they’re calling duplicity and “art washing” on OpenAI’s part.

OpenAI, Generative AI, Video Generation, Artificial Intelligence, Leaks

25 points by cut3 235 days ago | 3 comments

The Matrix: a foundation world model for generating infinite-length videos (twitter.com)

Generative AI, Computer Vision, Video Generation, Artificial Intelligence

4 points by outrun86 240 days ago | 0 comments

Mochi 1 preview. A new SOTA in open-source video generation. Apache 2.0 (twitter.com)

Open Source, Video Generation, Artificial Intelligence, Software

15 points by modeless 270 days ago | 6 comments

Sora-like text-to-video model from Chinese startup Minimax, 10 examples (twitter.com)

Generative AI, Video Generation, China, Artificial Intelligence, Startups

10 points by eh_why_not 318 days ago | 2 comments

CogVideoX: A Cutting-Edge Video Generation Model (medium.com)