Seaweed, short for "Seed-Video," is a foundational model for video generation that leverages diffusion transformers with approximately 7 billion parameters. Trained using the equivalent compute power of 1,000 H100 GPUs, Seaweed learns world representation from extensive multi-modal data, including video, image, and text. This model enables the creation of videos in various resolutions, aspect ratios, and durations based on text descriptions, showcasing its versatility for a wide range of applications.