• Pivot 5
  • Posts
  • Black Forest Labs: new text-to-image model

Black Forest Labs: new text-to-image model

Pivot 5: 5 stories. 5 minutes a day. 5 days a week.

1. Black Forest Labs: new text-to-image model

Black Forest Labs

Black Forest Labs has launched its FLUX.1 suite of generative deep learning models for text-to-image synthesis. The team, consisting of AI researchers and engineers, aims to advance the field and make generative AI a fundamental building block for future technologies.

The FLUX.1 suite includes models for VQGAN, Latent Diffusion, Stable Diffusion, and Adversarial Diffusion Distillation. The company has secured a Series Seed funding round of $31 million, with angel investors and follow-up investments from General Catalyst and MätchVC. The FLUX.1 models are designed for various applications, including text-to-image synthesis, and are available for free.

Read the full story here

2. Gemini 1.5 Pro (Experimental 0801) has been tested

For the first time, Google Gemini has claimed the #1 spot, surpassing GPT-4o/Claude-3.5 with an impressive score of 1300 (!), and also achieving #1 on our Vision Leaderboard.

Gemini 1.5 Pro (0801) excels in multi-lingual tasks and delivers robust performance in technical areas like Math, Hard Prompts, and Coding.

Read the full story here

3. Runway announces even faster, cheaper AI video model Gen-3 Alpha Turbo

Runway, a New York-based startup, has announced the launch of a faster version of its Gen-3 Alpha model, Gen-3 Alpha Turbo. The Turbo model is 7x faster than the original Gen-3 Alpha and allows users to generate new videos in real-time, producing a 10 second video in 11 seconds.

The company aims to maintain its lead in offering realistic, Hollywood-quality generative AI video models, despite competition from other startups like Pika Labs, Luma AI, Kling, and OpenAI's Sora. Runway is also working on an update to its mobile app to include support for image-to-video with Gen-3 Alpha.

Read the full story here

4. Google announces new open AI model Gemma 2 2B; Aims to beat other generative AI models

Google has released the new open-source AI model Gemma 2 2B, a compact yet more efficient language model designed for tasks like text generation, translation, and answering questions. The model has outperformed competitors like ChatGPT 3.5 in terms of speed and conversational abilities.

Google has also introduced new AI tools, ShieldGemma, and Gemma Scope, designed to enhance AI capabilities while prioritizing safety and transparency. The release comes after the U.S. Commerce Department endorsed open AI models, highlighting the need for capabilities to monitor potential risks.

Read the full story here

5. Sharks are congregating at a California beach. AI is trying to keep swimmers safe

 

Padaro Beach in California has launched SharkEye, an initiative using drones to monitor the presence of juvenile great white sharks. The initiative alerts locals, including lifeguards and surf shop owners, when a shark is spotted.

The technology uses video and AI to analyze shark behavior and train a computer vision machine learning model to detect great white sharks. Shark attacks are rare, with 69 people globally affected in 2023.

Read the full story here

Advertise with Pivot 5 to reach influential minds & elevate your brand

Get your brand in front of 80,000+ businesses and professionals who rely on Pivot 5 for daily AI updates. Book future ad spots here.