Weekly AI report

The most interesting news from the last week

The Symbiopreneur
October 20, 2023

Read time: 6 minutes | Sponsor this newsletter

Welcome to the weekly AI report

In this week's AI news:

Figure 01: Setting New Standards in Humanoid Robotics
DeepFake is Never Easier Thanks to FaceFusion
Real-time AI conversations are here
Google's Green Light: AI-Driven Traffic Lights for Sustainable Cities
The Techno-Optimist Manifesto
Google Image generation directly in the search bar
Learning AI from the very beginning (Google released 9 free courses)
DALL·E 3 is now available in ChatGPT Plus and Enterprise

Meet Figure 01: A Cool New Robot!

Have you heard about Figure 01? It's a cool new robot made by a company called Figure Inc. They built this amazing robot in just one year and with $70 million. That's super fast!

Walking Like a Pro

Figure 01 can walk on two legs just like us, and it's really good at it. Instead of being told where to go, it knows how much force to use in its legs to walk around. That means it can handle real-world stuff like walking on uneven ground.

Super Flexible

What's really cool is how flexible Figure 01 is. It has 41 ways it can move its body parts like arms, legs, and even fingers. That makes it super useful for different tasks.

Talking to Humans

This robot isn't just good at moving; it's also good at talking to people. It has a screen on its head to chat with us, which is pretty neat!

What’s Next?

The team behind Figure 01 is already thinking about what to do next. They've even built a special warehouse for it. So keep an eye on this robot; it's going to do some big things!

DeepFake Never Easier Thanks to FaceFusion

Riley Brown, an X user, shocked the world by revealing his discovery: FaceFusion. This is a DeepFake software that runs locally and at incredible speeds. All you need to create a deep fake is a video and a single photo of the person you want to mimic! Even the creators of the software acknowledge its potential for unethical use. In their GitHub repository disclaimer, they state, 'We acknowledge the unethical potential of FaceFusion and are resolutely dedicated to establishing safeguards against such misuse.

Ok THIS IS A DEEP FAKE of @garyvee for educational purposes....
This software was FREE, run locally on my CPU, and it took 2min 30 seconds to render..
But we need to start having a conversation about this...
This is going to get bad....
On the other hand. I'm going to be… twitter.com/i/web/status/1…
— Riley Brown (@rileybrown_ai)
6:01 AM • Oct 17, 2023

This is so cool but also kind of scary. Even the people who made FaceFusion said, "We know this can be used in bad ways." They're working on ways to keep people from misusing it.

Here is the GitHub repo. Don't engage in any immoral activities!

Real-Time AI Chats are Here with PlayHT 2.0 Turbo!

Realtime AI Conversations are here!
Introducing PlayHT 2.0 Turbo ⚡️
Our new blazing fast Conversational AI Text-to-Speech model with <300ms latency!
✅ Input text streaming from LLMs
✅ Output audio streaming
✅ Clone any voice & accent
Try here - play.ht/playground/
— PlayHT (@play_ht)
8:48 PM • Oct 17, 2023

PlayHT presented PlayHT 2.0 Turbo! It's their fastest voice AI ever. It turns your text into speech in less than a second!

What's New?

Super quick text-to-speech in under 300 milliseconds!
You can stream text input and get audio output right away.
It's like talking to a real human—their AI is that good!

Why It's Cool

Try it out on their Playground.
Use their new code toolkits for Python and NodeJS.

Make It Your Own

You can clone any voice or accent.
Add emotion to your text. Make it sound happy, sad, or anything in between.
Test it all on their Playground without having to code anything!

Get Started!

If you're looking to create AI that sounds like a real human, start using PlayHT 2.0 Turbo.

Google's Green Light: AI-Driven Traffic Lights for Sustainable Cities

Traffic congestion at city intersections contributes significantly to pollution and greenhouse gas emissions. Traditional methods of managing this issue are costly and inefficient.

The Solution: Green Light

Utilizing AI algorithms and Google Maps data, Green Light offers a sophisticated yet straightforward way to manage urban traffic. Preliminary tests show a 30% reduction in stops and a 10% decrease in emissions.

How it Works

Data Collection: Green Light uses existing city mapping to understand current traffic light configurations.
Traffic Modeling: Analyzes traffic patterns, including wait times and flow, to generate a model.
Recommendations: Suggests actionable changes to light timing that city engineers can implement quickly.
Impact Analysis: Measures and reports on the effectiveness of the implemented changes, allowing for ongoing improvements.

Why Choose Green Light?

Cost-Effective: No new hardware needed.
Automated Monitoring: Continual assessment of traffic patterns.
Trusted Data Source: Based on Google Maps.
User-Friendly Interface: Easy-to-read reports and recommendations.

Green Light is operational in 70 intersections across 12 global cities, showing promising results in reducing emissions and improving traffic flow. If you’re a city planner or engineer, consider joining the Green Light initiative.

The Techno-Optimist Manifesto

The Techno-Optimist Manifesto -- please read and Ask Me Anything! Post questions as replies to this xeet.
— Marc Andreessen -- e/acc (@pmarca)
2:49 PM • Oct 16, 2023

The Techno-Optimist Manifesto is a compelling argument for embracing technology as a catalyst for human advancement and liberty. It challenges prevailing societal attitudes that inhibit progress, advocating instead for ambition, innovation, and constructive risk-taking. Concluding with a resounding call to action, the manifesto urges us to seize the opportunities technology offers to create a future of abundance, adventure, and fulfillment. It is an essential read for anyone intrigued by the untapped potential of our symbiotic relationship with technology.

You can read The The Techno-Optimist Manifesto here.

Google Introduces Generative AI Features in Search

In a recent update, Google has expanded its search capabilities to include generative AI, allowing users to create custom images directly within the search interface. This feature is currently in the testing phase and available to users who opt into the SGE experiment.

How It Works

Users interested in accessing this feature can enable it through Google's experimental labs platform. Once activated, you can type a detailed search query to generate an image. For example, a search query like "Draw an image of a capybara wearing a chef's hat and cooking breakfast" can produce up to four AI-generated images matching the description.

Implications

This new capability aims to aid users in creative processes, offering a convenient tool for visualizing ideas without needing external software. It may prove beneficial for a range of users, from designers and content creators to educators and marketers.

Safety Measures

Google is taking steps to ensure responsible use of this technology by prohibiting the creation of harmful or misleading images. Generated images will also come with metadata labeling and embedded watermarking to indicate their AI-generated nature.

Availability

The feature is currently only available to U.S. users and in the English language. User feedback will be collected to refine the offering further.

For more in-depth analysis of AI's role in everyday life and its future potential, follow our regular updates.

You can read more about it here.

Learning AI from the very beggining (Google released 9 free courses)

You can find the courses here.

DALL·E 3 is now available in ChatGPT Plus and Enterprise

OpenAI has announced the release of DALL·E 3, an advanced image-generating model integrated into its ChatGPT Plus and Enterprise services. This update allows users to generate images through simple text-based conversations, offering a multitude of design possibilities in real-time.

Key Features

Real-Time Image Generation: Describe a concept or idea, and DALL·E 3 will instantly generate multiple image variants.
User Interaction: Users can ask for real-time revisions, allowing for a dynamic creative process.
Safety Measures: OpenAI has implemented a multi-tiered safety system to prevent the generation of harmful or misleading imagery.

Technological Advancements

DALL·E 3 represents significant progress in image generation, featuring more visually striking and detailed images compared to its predecessor. The model can handle complex prompts and supports both landscape and portrait aspect ratios.

User Feedback and Safety

OpenAI encourages users to provide feedback on generated outputs for continual improvement. The safety mitigation stack prevents potentially harmful image generation and has undergone thorough testing to cover edge cases.

Future Directions

OpenAI is also working on a provenance classifier, an internal tool aimed at identifying whether an image was generated by DALL·E 3. The classifier has shown promising results in early evaluations and may become part of a broader strategy to identify AI-generated content.

Creative Controls and Ethical Use

DALL·E 3 is programmed to decline requests mimicking the style of living artists, and OpenAI offers an opt-out option for creators who don't wish their work to be included in future model training.

With DALL·E 3, OpenAI continues to push the boundaries of what is possible in the realm of generative AI, all while considering the ethical implications and ensuring a safe user experience.

That is all for this week's Weekly AI report. If you liked this one, be sure to follow me on X and LinkedIn. Until the next Friday!