Join the event trusted by enterprise leaders for nearly two decades. VB Transform brings together the people building real enterprise AI strategy. Learn more Multi-modal models that can process both ...
OpenAI had been stung by the release of Google’s Nano Banana image models, and it appears that it has now come up ...
The arrival of OpenAI’s DALL-E 2 in the spring of 2022 marked a turning point in AI, when text-to-image generation suddenly became accessible to a select group of users, creating a community of ...
Generative artificial intelligence startup Stability AI Ltd. today announced the release of Stable Diffusion 3.5, which includes three next-generation open-source text-to-image AI model variants. The ...
Apple researchers presented UniGen 1.5, a system that can handle image understanding, generation, and editing within a single ...
Following on from the release of its DeepSeek-R1 AI model which has taken the world by storm. DeepSeek has also introduced Janus Pro, a new open source multimodal AI image generator that combines ...
Chinese cloud provider Alibaba has released four versions of its video-generation AI model as open source, allowing users to download and run them for free on capable PCs. The Wan2.1 text-to-video ...
After announcing the new Imagen 3 image generation model at Google I/O in May, Google has now made the text-to-image model available to all Gemini users. On Wednesday, Google said Imagen 3 "is now ...
ElevenLabs, an AI startup that just raised a $180 million mega-funding round, has been primarily known for its audio-generation prowess. The company took a step in another technological direction by ...