Google、テキスト-to-ビデオAI「Imagen Video」を公開

xguru · 2022-10-07T10:52:01+09:00

Video Diffusion Modelにテキストを入力して動画を作成する「Text-conditional Video Generation System」テキストから低解像度の動画（24x48ピクセル、16フレーム、3fps）を生成し、7つの拡散モデルをカスケードしてアップスケールするのが特徴最終出力は1280x768、24fps。5.3秒の動画を生成可能論文: Imagen Video : High Definition Video Generation with Diffusion Models

(imagen.research.google)

9 ポイント投稿者 xguru 2022-10-07 | 1件のコメント | WhatsAppで共有

Video Diffusion Modelにテキストを入力して動画を作成する「Text-conditional Video Generation System」
テキストから低解像度の動画（24x48ピクセル、16フレーム、3fps）を生成し、7つの拡散モデルをカスケードしてアップスケールするのが特徴
最終出力は1280x768、24fps。5.3秒の動画を生成可能
論文: Imagen Video : High Definition Video Generation with Diffusion Models

1件のコメント

xguru 2022-10-07

Imagen - Googleのtext-to-image diffusion model
Imagen-pytorch - Google ImagenをPytorchで実装
 Make-A-Video : テキストでビデオを生成するAI

Google、テキスト-to-ビデオAI「Imagen Video」を公開

関連記事

1件のコメント