TheAITraveler & MultiverseDreaming
šŸ’­

TheAITraveler & MultiverseDreaming

Tags
Software Development
Creator Economy
AI
Published
August 29, 2023
Author
Randall Hand
URL
So over the last few weeks Iā€™ve made some more improvements to the overall system, and built up a totally new channel with a few changes. Thought I would discuss what Iā€™ve done and learned here.

Video Improvements

I really wasnā€™t happy with the videos, they seemed a bit ā€œemptyā€. With a bit of work, I was able to modify the encoding to include a multi-layer approach. Now each ā€œslideā€ contains, bottom to top:
  • A static zoomed + blurred + darkened version of the image to 100% screen coverage
  • The slow zoom-in of the image
  • The overlay caption
Without that background I was seeing some strange artifacts where the zoomed image wouldnā€™t increase in size, except behind the text. Iā€™m pretty sure itā€™s an internal moviepy bug, but this was an easy way to fix it that led to a better video anyway.

A new Channel

Now that the system was mostly working, I decided to copy-paste it into a slightly new format. Behold:
This channel uses much of the same technology with a few changes:
  • ChatGPT generally chooses a topic itself. I simply give it a theme of ā€œfableā€, ā€œdreamā€, ā€œnightmareā€, ā€œmythā€, etc and let it do the rest.
  • All of the imagery is generated with DALL-E, OpenAIā€™s image generation system.
  • Some encoding tweaks to cross-fade between slides. Given the ā€œdreamā€ nature of the channel, that just seemed more appropriate.
It took a bit of tweaking, but itā€™s starting to generate some truly impressive stuff. I was really impressed with this one:
Video preview
Ā 

Hardware Upgrade

Prior to this weekend, I was running everything on a Raspberry Pi. It was OK, but had a lot of problems.
  • the MicroSD Storage was pretty slow.
  • A 55s video took about an hour to encode.
  • My Nagios & SMNP system kept throwing warnings as I was maxing out memory, and occasionally the encode would get oom_killed.
So I found a super-cheap microPC on Amazon: A CyberGeek Intel Quad-Core with 16G Ram for $129 on sale (Up a bit now). Comes with Ubuntu Linux pre-installed, and has been a HUGE upgrade. Shorts now encode in about 5 minutes, and I donā€™t have nearly as many compiler errors with outdated raspberry-pi distributions.

Prompt updates

Previously, I was using YAKE to convert the script captions into search queries for images. That worked ok, but led to some really bizarre videos where the individual string lacked enough content. When the script used pronouns in a sentence like ā€œIt has several beautiful vistasā€, YAKE had no idea what ā€œitā€ meant so you wound up with really random results.
With a bit of prompt engineering, I got ChatGPT to return a result array containing individual sentences and a recommended image search query. With that, results have improved significantly. It still sometimes does weird things, but the frequency of it has dropped significantly.
Ā 

Where next?

So this is where I could use some advice.. What next? I have 2 channels running: One procedural (travel) and one experimental (dreaming), 55s shorts multiple times a day completely automated. What do I do next?