Putting the 'role' back in role-playing games since 2002.
Donate to Codex
Good Old Games
  • Welcome to rpgcodex.net, a site dedicated to discussing computer based role-playing games in a free and open fashion. We're less strict than other forums, but please refer to the rules.

    "This message is awaiting moderator approval": All new users must pass through our moderation queue before they will be able to post normally. Until your account has "passed" your posts will only be visible to yourself (and moderators) until they are approved. Give us a week to get around to approving / deleting / ignoring your mundane opinion on crap before hassling us about it. Once you have passed the moderation period (think of it as a test), you will be able to post normally, just like all the other retards.

Bringing D&D/AD&D campaign settings to life with Stable Diffusion

deuxhero

Arcane
Joined
Jul 30, 2007
Messages
11,401
Location
Flowery Land
Can you try this one? One of Eberron's major characters suspiciously lacks any art (the other national leaders all have art somewhere).

Five Nations said:
Eleven-year-old Jaela Daran came from humble origins. [...] Jaela herself seems rather humble, modest, and meek for a young girl whose pronouncements alter the history of a nation. [...] Jaela usually dresses in simple gray or black clothes, walking barefoot on the marble steps of the Cathedral. She has gray eyes, short-cropped dark hair, and a chocolate-colored complexion [...] she carries the burden of an entire nation on her slim shoulders

(her equipment entry also mention she wears a silver arrowhead as a holy symbol)
 

Zed Duke of Banville

Dungeon Master
Patron
Joined
Oct 3, 2015
Messages
11,901
"Eleven-year-old Jaela Daran came from humble origins. [...] Jaela herself seems rather humble, modest, and meek for a young girl whose pronouncements alter the history of a nation. [...] Jaela usually dresses in simple gray or black clothes, walking barefoot on the marble steps of the Cathedral. She has gray eyes, short-cropped dark hair, and a chocolate-colored complexion [...] she carries the burden of an entire nation on her slim shoulders"
"(her equipment entry also mention she wears a silver arrowhead as a holy symbol)"

Stable Diffusion can't even handle weapons correctly, so I didn't bother trying to manage a specific holy symbol. Putting the other characteristics into the prompts without being specific about age and using the common Mucha/Rutkowski/Artgerm combination yielded this portrait in the first batch of five images:
00006.png



Some of the images show her barefoot, but this kind of thing is probably a distraction from the more important prompts given the limited resolution of the output. Some other possibilities (2nd for an older version):
00041.png
00028.png



Specifying 11 years old seems to result in the images more accurately capturing that age but at the expense of other characteristics:
00059.png
00066.png

00070.png
00078.png
 

Non-Edgy Gamer

Grand Dragon
Patron
Glory to Ukraine
Joined
Nov 6, 2020
Messages
14,946
Strap Yourselves In
Zed Duke of Banville if you intend to spend any significant amount of time on this, you owe it to yourself to check out the Krita plugins for SD.
https://github.com/sddebz/stable-diffusion-krita-plugin

Basically turns Krita into Photoshop on crack. If there's something you don't like about an image, select that part, feather the selection to blend in the result and either alter the prompt, or scribble whatever you want there and regenerate it.

It's especially nice for fixing things like eyes, even if only as a first step to fix them with GFPGAN later.
 
Last edited:

Zed Duke of Banville

Dungeon Master
Patron
Joined
Oct 3, 2015
Messages
11,901
Zed Duke of Banville if you intend to spend any significant amount of time on this, you owe it to yourself to check out the Krita plugins for SD.
https://github.com/sddebz/stable-diffusion-krita-plugin
I've been using Stable Diffusion's text2img for other, non-gazetteer-related images, and have been attempting to use its img2img function as well. I intend also to determine how to improve StableDiffusion's output using these kinds of plug-ins or add-ons, but the point of this thread was to demonstrate that text2img can be used to create, fairly quickly and fairly easily, worthwhile portraits reflecting the characters they are intended to represent.


GAZ8 The Five Shires returns to demi-humans, this time with the realm of the hobbits halflings. Since I've never cared for halflings, and this gazetteer was written by Ed Greenwood, I couldn't bring myself to create more than four portraits, even though there is an extended section on notable personages:

Jaervosz Dustyboots, one of the five sheriffs of the shires:
STdspfh.png



Joam Astlar, an adventurer:
0p9K2GC.png



Meermeera Jollybars, a full-figured and merry brunette:
mYZuLC7.png



Shandysar Lollos, a female ex-pirate now wandering the shires:
Tsffl8o.png




GAZ9 The Minrothad Guilds is unfortunately another gazetteer about a country lacking a firm identity and only containing details for a few people, but unlike Ierendi this gazetteer was also rather boring and the first to go out of print. For this one, I created two portraits each for four characters, with one portrait in the usual Mucha/Rutkowski/Artgerm combination and the other portrait based on John Singer Sargent, Eugene Galien-Laloue, and Edouard Leon Cortes:

Nosmo Beldan, a wealthy merchant:
eDX1QGk.png
toSNt6R.png



Harmon Caetros, a guild agent in Karameikos:
dzvkPSk.png
RGgxiIk.png



Ariana Demerick, a female pirate:
HvS3cmB.png
lFykxde.png



Generic male pirate:
cOeaOg0.png
JRs4rzv.png
 

Zed Duke of Banville

Dungeon Master
Patron
Joined
Oct 3, 2015
Messages
11,901
GAZ10 The Broken Lands poses a greater challenge since the inhabitants are humanoids: trolls, ogres, gnolls, kobolds, goblins, hobgoblins, bugbears, and three kinds of orcs. There are certain difficulties that result from Stable Diffusion prompts:
  • Gnoll seems not to be interpreted, but "hyena humanoid" works well
  • Kobold seems not to be interpreted, and since kobolds have always been depicted as scaly dog-creatures it's extremely difficult to obtain something that looks right
  • Hobgoblin seems not to be interpreted, but they can be considered larger goblins anyway
  • Bugbears seem to be drawing on art that depicts them as bear-men rather than larger goblinoids with a vague resemblance to bears
Nonetheless, it was possible to mostly achieve decent results for the leaders of the Broken Lands tribes as described in the gazetteer:

Haa'k Hordar, the Troll Queen:
kYHa5mC.png



Alebane, Chief of the Ogres:
1ny1uMu.png



Nizam, Pasha of the Gnolls:
5HGD1Dg.png



Kol XV, High Doge of the Kobolds:
jItZ26S.png



Doth, King of the Goblins, is really the powerless consort of Queen Yazar:
t7Gefdf.png



Yazar, Queen of the Goblins, is a powerful warrior, goes about scantily dressed, is beautiful, and yet is a goblin; this might capture three of the four:
s0bNhmz.png



Hutal-Khan, of the Hobgoblins:
K03sWDF.png


Ohr'r, Chief of the Bugbears, shouldn't resemble a bear this much but looks somewhat cool:
UKIXg7b.png



Hoolg Red-Mane, Chief of the Red Orcs:
AuJ4cKi.png



Moghul-Khan, of the Yellow Orcs:
2LiktmQ.png



Thar, Chief of the Common Orcs and King of the Broken Lands:
WCMfmhV.png
 

Dexter

Arcane
Joined
Mar 31, 2011
Messages
15,655
Can this shit generate ink or pencil pictures?
Yes, you can even indulge in your Alien Horror scenes or pornographic fantasies as Japanese wood carvings or Renaissance Stained Glass Windows, you just have to specify the art style:
1662183234015226.jpg
 

Zed Duke of Banville

Dungeon Master
Patron
Joined
Oct 3, 2015
Messages
11,901
GAZ11 The Republic of Darokin details a confederation of former city-states dominated by merchant houses. Although the depiction of Darokin in this gazetteer is English in many ways, the earlier inspiration for this country seems to have been Renaissance Italy, and it also bears similarities to the low countries, so it is perhaps suited for a baroque artstyle based on painters such as Rembrandt and Caravaggio. As with some of the other gazetteers, there isn't a proper notable personages section, although it does detail a few scattered people. I also created a few scenic images with a 50% greater height or width, depending on the subject:

Member of the Darokin Diplomatic Corps:
t0AIHxr.png



Female merchant:
ElAwWqw.png



Beggar who engages in criminal activity on the side:
3zatp4g.png



Female cleric:
50US2K5.png



City Market:
1yv5BOP.png



Church exteriors and interiors:
m0mCSYx.png
GNMVREk.png
tv7kfEe.png

g34Hyr8.png
93b5hfn.png
sms16s2.png



Palace exteriors and interiors:
wE6j2G2.png
tFxVk1U.png
LIx8FDY.png
qtL3MvH.png
HELSPUX.png
0nO053g.png



Itheldown Castle, a cursed and haunted ruin, in the Mucha/Rutkowski/Artgerm style:
SNm2mcF.png
 

Zed Duke of Banville

Dungeon Master
Patron
Joined
Oct 3, 2015
Messages
11,901
The Dawn of the Emperors Box Set was published in 1989, describing the empires of Thyatis and Alphatia. Thyatis is based on the Roman or Byzantine empires, and therefore is suited for a style based on painters such as John William Waterhouse, Frederick Leighton, and Lawrence Alma-Tadema, who frequently depicted classical settings.

Emperor Thincol Torion, with dark brown hair, black eyes, tall, muscular, hawklike features, and a purple gold-lined toga (though he should be clean-shaven):
GALoNMl.png



Empress Gabriela Torion, middle-aged, black hair, brown eyes, careworn and depressed:
eBmxDwN.png



Prince Eusebius Torion, brown hair, brown beard, brown eyes, tall, muscular, craggy features, wearing Roman armor, impassive, military bearing, cold:
pHafksM.png



Princess Stefania Torion, with red hair, blue eyes, and green garb:
Hlr9Ctd.png



Demetrion Karagenterpolus, the Imperial court wizard, white hair, white beard, white robes with red trim, elderly, honorable and trustworthy:
TQVmCOP.png



Anaxibius, a gladiator, black hair, black eyes, tall, muscular, wearing Roman gladiator armor:
gd4XUOS.png



The Coliseum of Thyatis City (all four images from one batch of five images):
JxyZVwY.png
J8xag2q.png

qd52sHl.png
KFIboNc.png



Public baths:
tGcVRYD.png
42YJezH.png



The Great Library of Thyatis:
4Wvu1GW.png
ayon2h0.png

szhbSQ1.png
QzRWk9u.png



The Imperial Palace, a five-story building, huge and luxurious:
usQLs3H.png
clxUAbX.png

dTE9iOl.png
5dGmpkV.png
 

Lagole Gon

Arcane
Patron
Joined
Nov 4, 2011
Messages
7,292
Location
Retaken Potato
Insert Title Here RPG Wokedex Codex Year of the Donut Pathfinder: Wrath
Can this shit generate ink or pencil pictures?
Yes, you can even indulge in your Alien Horror scenes or pornographic fantasies as Japanese wood carvings or Renaissance Stained Glass Windows, you just have to specify the art style:
1662183234015226.jpg

It all looks like semi-animu.
Can this shit generate GOOD ink or pencil pictures?

Something like...

Frank Frazetta:
CAST3.jpg


Mark Shultz:
mark-schultz-mark-schultz-robert-e.-howards-conan-of-cimmeria:-1932-1933-vol.-1-original-art-(wandering-star.jpg
 
Joined
Jan 14, 2018
Messages
50,754
Codex Year of the Donut
Can this shit generate ink or pencil pictures?
Yes, you can even indulge in your Alien Horror scenes or pornographic fantasies as Japanese wood carvings or Renaissance Stained Glass Windows, you just have to specify the art style:
1662183234015226.jpg

It all looks like semi-animu.
Can this shit generate GOOD ink or pencil pictures?

Something like...

Frank Frazetta:
CAST3.jpg


Mark Shultz:
mark-schultz-mark-schultz-robert-e.-howards-conan-of-cimmeria:-1932-1933-vol.-1-original-art-(wandering-star.jpg
1663.png

313.png


it sucks at hands & weapons though, definitely needs to be trained on those more
 

Dexter

Arcane
Joined
Mar 31, 2011
Messages
15,655
Can this shit generate ink or pencil pictures?
Yes, you can even indulge in your Alien Horror scenes or pornographic fantasies as Japanese wood carvings or Renaissance Stained Glass Windows, you just have to specify the art style:
1662183234015226.jpg

It all looks like semi-animu.
Can this shit generate GOOD ink or pencil pictures?

Something like...

Frank Frazetta:
CAST3.jpg


Mark Shultz:
mark-schultz-mark-schultz-robert-e.-howards-conan-of-cimmeria:-1932-1933-vol.-1-original-art-(wandering-star.jpg
That's because it is in semi-Animu style by the guy that made that picture to demonstrate, I did a bunch of Frazetta inspired stuff along with Luis Royo and Kentaro Miura in the General Gaming thread:
Ana-de-Armas-Fantasy-by-Luis-Royo.jpg
Scarlett-Johansson-Fantasy-by-Luis-Royo.jpg

Scarlett-Johansson-Fantasy-by-Frank-Frazetta-Kentaro-Miura.jpg


00003-50-k-lms-2523544200.png
00007-50-k-lms-3551332941.png
00010-50-k-lms-3551332944.png
00019-50-k-lms-3551332953.png
00024-50-k-lms-3933925620.png
00030-50-k-lms-3933925626.png
00032-50-k-lms-3933925628.png
00005-50-k-lms-3551332939.png


Some more on the fantasy side with different modifiers and character classes:
00001-50-k-lms-1207806503.png
00004-50-k-lms-1207806506.png
00000-50-k-lms-1758006704.png


00000-50-k-lms-3263220990.png
00002-50-k-lms-3263220992.png
00000-50-k-lms-713508853.png
00002-50-k-lms-713508855.png
00004-50-k-lms-713508857.png
00001-50-k-lms-346474401.png
00002-50-k-lms-346474402.png
00004-50-k-lms-346474404.png


00000-50-k-lms-1923435543.png
00001-50-k-lms-1923435544.png
00002-50-k-lms-1923435545.png
00003-50-k-lms-1923435546.png
00005-50-k-lms-2498225060.png
00006-50-k-lms-2498225061.png

I can try to do some Barbarians for you, starting with:
"Close portrait of a male human barbarian (((Conan))) DnD pencil/ink art by Frank Frazetta"
grid-0357.png
grid-0358.png

Okay, so what happened here? Frazetta is a good artist and he's a strong vector, as are Conan (which I chose to emphasize) and DnD already giving some pretty good non-cherrypicked results. He's good with body proportions, composition and attire, but he's not the strongest with faces.

So what do we do? We complement him with other strong vectors to guide the AI to give us better results, things like Conan, DnD, LotR and similar can be used as a style guide, but if we want better faces we need a good portrait artist e.g.: https://www.wikiart.org/en/john-william-waterhouse/
"Close portrait of a male human barbarian (((Conan))) DnD LotR pencil art by John William Waterhouse and Frank Frazetta"
"Close portrait of a male human barbarian (((DnD))) LotR ink art by John William Waterhouse and Frank Frazetta"

Here are our next non-cherrypicked results:
More Conan:
grid-0353.png

grid-0354.png


More DnD:
grid-0356.png
grid-0355.png


Adding in some Crosshatching for Style, more Conan:
grid-0352.png
grid-0351.png


More DnD:
grid-0350.png
grid-0349.png

If we want an even stronger vector for the image with an even more distinct and clear face a well-known/often photographed celebrity is useful, so we can try something like:
"Close portrait of Arnold Schwarzenegger as Conan the Barbarian ink art by John William Waterhouse and Frank Frazetta"
grid-0362.png
grid-0361.png


If we add DnD/LotR:
grid-0359.png
grid-0360.png

It's all about what you put in the prompt and how you balance, emphasize and complement things.

What it can't do well (yet) is action shots and the likes or images where several objects have complex relations to or interact with one another, for instance you'll usually get garbage if you try to replicate the image above by typing something like:
"Male human barbarian holding a sexy Amazon woman is jumping for a vine in the jungle with an Aztec temple and trees in the background pencil art by Frank Frazetta"
grid-0363.png


Even if you add several style vectors:
grid-0365.png
 
Last edited:

Lagole Gon

Arcane
Patron
Joined
Nov 4, 2011
Messages
7,292
Location
Retaken Potato
Insert Title Here RPG Wokedex Codex Year of the Donut Pathfinder: Wrath
Welp. I must admit, it did much better than I expected. It has strong souless uncanny valley whiff about it, but it is impressive.
I guess next step is to make AI paint over simple 3d shapes. If somebody does that I can see this shit exploding.

Art students on suicide watch.

LimpUnequaledChafer-max-1mb.gif


I'm trying to make Justin Sweet/Vance Kovacs style Icewind Dale portrait but it seem the AI is mostly inspired by shitty fanart.
 
Last edited:

Non-Edgy Gamer

Grand Dragon
Patron
Glory to Ukraine
Joined
Nov 6, 2020
Messages
14,946
Strap Yourselves In
The hands kind of ruin every picture for me.
Until it's properly trained on them, a temporary solution is to put "hands" in as a negative prompt, causing the AI to try to stay away from drawing them. So you'll get more shots with the hands out of frame, behind their back etc.
 

Dexter

Arcane
Joined
Mar 31, 2011
Messages
15,655
It has strong souless uncanny valley whiff about it, but it is impressive.
A lot of the double heads/double arms/sword at wrong place artifacts are because the image isn't 512x512 like Zed has been doing, which is the resolution the model was trained at. But quadratic portraits are kind of boring, so it's usually fine if only every 2nd or 3rd image is usable, just make larger batches.
I guess next step is to make AI paint over simple 3d shapes. If somebody does that I can see this shit exploding.
It's already kind of exploding and chances are some AI tool will be able to do things like 3D models from pictures within a few years if that's what you mean: https://rpgcodex.net/forums/threads...ted-images-as-art.143986/page-11#post-8108268
I'm trying to make Justin Sweet/Vance Kovacs style Icewind Dale portrait but it seem the AI is mostly inspired by shitty fanart
They're probably not famous enough and not prominently included in the training data, can't find them listed here for instance: https://github.com/AUTOMATIC1111/stable-diffusion-webui/blob/master/artists.csv
Can only find a few images, some of which are repeated up to 7 times scraped from different Websites in the LAION-Aesthetics v2 6+ database:
https://laion-aesthetic.datasette.io/laion-aesthetic-6pls/images?_search=Justin+Sweet&_sort=rowid
https://laion-aesthetic.datasette.io/laion-aesthetic-6pls/images?_search=Vance+Kovacs&_sort=rowid

SD has afaik been trained on a Subset of images (hearing conflicting info between 800m and 2B picked for aesthetics) of LAION-5B and I can't even find them there. See for instance what comes up for "Frank Frazetta" compared to their names:
https://rom1504.github.io/clip-retrieval/?back=https://knn5.laion.ai&index=laion5B&useMclip=false&query=Frank+Frazetta
https://rom1504.github.io/clip-retrieval/?back=https://knn5.laion.ai&index=laion5B&useMclip=false&query=Justin+Sweet
https://rom1504.github.io/clip-retrieval/?back=https://knn5.laion.ai&index=laion5B&useMclip=false&query=Vance+Kovacs

You could try Textual Inversion and train them yourself on 5 strong example images if you're set on it and have enough VRAM: https://rentry.org/textard
 
Last edited:

Non-Edgy Gamer

Grand Dragon
Patron
Glory to Ukraine
Joined
Nov 6, 2020
Messages
14,946
Strap Yourselves In
I guess next step is to make AI paint over simple 3d shapes. If somebody does that I can see this shit exploding.
Unless you mean texturing 3D objects, it can technically do that with img2img, depending on how high you set your diffusion.

Works on sketches too.

ODef3hu.png

JVa9qeR.png

It's not an exact science yet though. And it's obviously limited by the source to some degree.
 
Last edited:

Bigfass

Learned
Patron
Joined
Oct 9, 2020
Messages
561
Location
Florida
Codex Year of the Donut
Fuck's sake, the rate at which this is accelerating most of the digital artists/concept artists/so on will be begging on the streets in like, what, 2 to 3 years.
Art students on suicide watch.
Not really.

All the publicly known text-to-image AI systems have a serious problem with what they call compositionality. When you tell the AI to draw a picture of a man on a horse, it has no idea who's the man, who's the horse, and what their relationship is supposed to be in the picture. It will give you a result that will very likely have both a man and a horse, and the man's likely to be riding the horse (and not the other way around) but only because that is the scenario that it's familiar with based on its training data.

If you try anything even a bit more complicated, it will fall on its ass. A prompt for "a man on a horse holding a cat that's wearing a top hat" is unlikely to put the hat on the cat, even if it puts the cat in the man's hands (which is far from guaranteed).

Asking for something unusual is likely to give you nonsense; "a baby holding up his mother" did not give me a single picture of a super-strong baby lifting a woman. But hey, at least the result was diverse.

1663929029807.png


This doesn't seem to be a problem that can be "fixed" as the issue is likely to be foundational. When the "AI" processes language, there's no resulting mental model (like there is with humans), no understanding of what's been communicated. It's not even a text-to-image problem, but an "AI" problem in general.

1663930123557.png


This guy writes a lot about this stuff:

https://garymarcus.substack.com/archive
 

Non-Edgy Gamer

Grand Dragon
Patron
Glory to Ukraine
Joined
Nov 6, 2020
Messages
14,946
Strap Yourselves In
It was just a few months ago that people were posting Dalle Mini memes and laughing at the idea of image generation ever being useful.

Now there are a dozen articles a week from Increasingly Nervous Man telling you that AI is a dead end.
 
Last edited:

Bigfass

Learned
Patron
Joined
Oct 9, 2020
Messages
561
Location
Florida
Codex Year of the Donut
Now there are dozens of articles from Increasingly Nervous Man telling you that AI is a dead end.
To be fair most of the criticism is towards journalists and bloggers who play with DALL-E for an hour then proclaim that it'll imminently change the world as we know it. Which in turn fuels stuff like this:

1663933416824.png


But yeah, Increasingly Nervous Man is saying exactly that, because intelligence requires a cognitive model of the world, or at least the given task.

A household robot that does the laundry but never puts the dog in the dryer is a great thing to aspire to but it does require intelligence.
 

As an Amazon Associate, rpgcodex.net earns from qualifying purchases.
Back
Top Bottom