The Gorilla and the Giraffe Walk Into a Bank

Fine-tuning a LoRA on Akira and discovering that style transfer is basically just theft, but a really good kind.

Been fine-tuning a cinematic LoRA.

The test prompt is a story about a gorilla and a giraffe who go on a bank robbing spree — which is, arguably, the correct prompt for evaluating any image model. If your model can't render armed primates in a heist scenario with a giraffe getaway driver, what are you even doing.

The original scenario image is fine. Competent. It has a gorilla, it has a giraffe, they appear to be up to something.

Then you run the same prompt through the Akira model.

And that's when it becomes a different kind of problem — the kind where you close your laptop, walk around the room, and come back to check if you actually saw what you think you saw. The heavy ink lines. The speed. The exact way neon reflects off wet pavement at 2am in a city that doesn't technically exist. The gorilla looks like it should be on a three-story wall mural in Neo-Tokyo.

This is the thing about style transfer that nobody talks about seriously enough: you're not really generating an image, you're excavating the visual logic of a human artist — the decisions Katsuhiro Otomo made in the eighties about how shadow works, how motion works, how dread works — and then repurposing that logic to render a giraffe holding a duffel bag full of cash.

Otomo didn't consent to this. Neither did the gorilla.

The Akira LoRA is cracking, though. That's the word for it. Not good, not impressive — cracking. Which is its own category.

The Gorilla and the Giraffe Walk Into a Bank

Counterpoints