Minimax AI - компания по созданию видео с помощью искусственного интеллекта в Китае

Минимаксный искусственный интеллект

Minimax AI Company Overview

MiniMax AI is developing AI large-scale modeling technologies, infrastructure builders, and content application solutions. The latest foray into generative AI by the Alibaba- and Tencent-backed unicorn startup, MiniMax is dedicated to the development of General Artificial Intelligence (AGI) engine systems, which was founded in 2021 and is headquartered in Shanghai, China. One of its main products is a text-to-video generator that has made a splash for its ability to generate hyperrealistic footage of humans, including accurate hand movements.

Minimax AI Products

Video Generation Model: video-01

Video-01 is AI model that can generate high-resolution videos from text instructions, supporting a resolution of 1,280 x 720 pixels at 25 frames per second. Videos are currently limited to six seconds. Video-01 offers various styles, including anime, CGI, and video game graphics. The model shows relatively few image errors or artifacts and even seems capable of displaying text in videos. MiniMax video-01 is a good model, roughly equivalent to Luma Labs Dream Machine but not as good as Runway Gen-3

Music Generation Model: Music-01

Music-01 is text-to-music ai model, Key features include:

  • Highly anthropomorphic music generation: This model crafts intricate and emotional musical compositions, making it ideal for various creative scenarios and offering significant flexibility and innovation in music creation.
  • Multi-style support: The model adeptly handles a wide range of music styles — from traditional instruments to modern electronic music, and from Chinese classical to Western pop.

Text Large Model: abab 6.5s

abab 7 supports efficient training of vast datasets, significantly enhancing practicality and response speed while drastically reducing training and reasoning costs for large models. Compared to the traditional Transformer architecture, this new architecture cuts costs by over 90% at a sequence length of 128K, with even greater advantages as the sequence length increases.

Voice big model: speech-01

Variety of high-quality hyper-anthropomorphic tones, next-generation voice generation capabilities.

Поделиться

Добавить комментарий

Ваш адрес email не будет опубликован. Обязательные поля помечены *