ThirdBrAIn.tech

❯

❯

❯

❯

❯

I built my own AutoGPT that makes videos

I built my own AutoGPT that makes videos

Apr 11, 20232 min read

auto-GPT
video-creation
AI-video-generator
AI-tools
scripting-automation
AI-content-generation
voice-cloning
video-editing-software
AI-APIs
automated-video-production
YT/2023/M04
YT/2023/W15

I built my own AutoGPT that makes videos

AI Nuggets

Detailed Instructions from the Video Transcript

Tools and APIs Used:

OpenAI’s GPT models

Pinecone DB (Vector Database)

Puppeteer (Headless Browser)

rey.so (Code Snippet Image Generation)

Giphy API (Animated GIFs)

11 Labs API (Voice Cloning)

Steps to Create an AI-Generated Video:

Script Writing with GPT-4:

Input a video idea to GPT-4.

GPT-4 writes a video script.

Asset Generation:

Use Puppeteer to navigate to rey.so and generate PNG images for code snippets.

Use GPT-4 to identify people or logos in the script and return a link to an image on the internet.

Download the identified images and save them to the file system.

Use GPT-4 to determine search terms for the Giphy API.

Use the Giphy API with an API key to obtain animated GIFs.

Voice Narration:

Clone a voice using the 11 Labs API with about two and a half hours of recordings.

Loop over each script file and make an API call to 11 Labs to generate the voice.

Save the generated voice as a WAV file.

Video Editing:

Initially, attempt to use GPT-4 to write an ffmpeg script to edit the video.

Due to reliability issues, write custom code with fluent-ffmpeg.

Loop over the assets and combine them into a single video.

CLI Commands and Tips:

Use Puppeteer for web scraping and automating browser tasks.

Use the Giphy API to fetch relevant animated GIFs.

Use fluent-ffmpeg for video editing and combining assets.

Additional Information:

The video titled “Rust in 100 seconds” was generated entirely by AI.

The video demonstrates the use of AI to create content that is indistinguishable from human-created content.

Important URLs:

rey.so: For generating PNG images of code snippets.

Giphy API: For fetching animated GIFs (exact URL not provided in the transcript).

11 Labs API: For voice cloning (exact URL not provided in the transcript).

Notes:

The code has not been released publicly due to concerns about its impact on humanity.

The quality of the AI-generated voice can still be improved.

Disclaimer:

Some of the capabilities mentioned, such as cloning voices, may have legal implications.

The creator of the video has expressed concerns about the potential impact of releasing the code on humanity.

Graph View

I built my own AutoGPT that makes videos
Detailed Instructions from the Video Transcript

🟡 Loading DB...

GitHub Discord Community Obsidian

Created with Quartz v4.5.0 © 2025