Firen Word Generator

Words	Gloss
gugk	g_g_k/u_y
kyjaj	k_j_j/y_a
poxd	p_x_d/o_o
fak'	f_k_'/a_o
wy'as	w_'_s/y_a
fagg	f_g_g/a_e
jalt	j_l_t/a_y
'ejj	'_j_j/e_e
jyxiz	j_x_z/y_i
xethis	x_th_s/e_i
wedup	w_d_p/e_u
we'ab	w_'_b/e_a
'yxg	'_x_g/y_y
zegil	z_g_l/e_i
kypid	k_p_d/y_i
wotad	w_t_d/o_a
fags	f_g_s/a_y
zubt	z_b_t/u_o
wetux	w_t_x/e_u
papib	p_p_b/a_i
wysax	w_s_x/y_a
gytis	g_t_s/y_i
subas	s_b_s/u_a
bup'	b_p_'/u_e
gasp	g_s_p/a_e
gekuw	g_k_w/e_u
tuzal	t_z_l/u_a
lizf	l_z_f/i_y
tusut	t_s_t/u_u
judl	j_d_l/u_e
wyzuw	w_z_w/y_u
jidug	j_d_g/i_u
za''	z_'_'/a_y
pekaf	p_k_f/e_a
dothk	d_th_k/o_e
bowd	b_w_d/o_e
jejk	j_j_k/e_e
thysk	th_s_k/y_y
zutk	z_t_k/u_y
sys'	s_s_'/y_y
zez'	z_z_'/e_y
fejw	f_j_w/e_y
bekth	b_k_th/e_o
xyst	x_s_t/y_e
bilth	b_l_th/i_e
giku'	g_k_'/i_u
wadt	w_d_t/a_e
jyts	j_t_s/y_e
sazx	s_z_x/a_o
biji'	b_j_'/i_i
liduf	l_d_f/i_u
zetuj	z_t_j/e_u
wexuth	w_x_th/e_u
jytut	j_t_t/y_u
gigug	g_g_g/i_u
xalap	x_l_p/a_a
'ozaz	'_z_z/o_a
fidut	f_d_t/i_u
byzal	b_z_l/y_a
je'w	j_'_w/e_e
kosj	k_s_j/o_e
tozt	t_z_t/o_o
dytb	d_t_b/y_e
jeps	j_p_s/e_o
xywj	x_w_j/y_e
'ojw	'_j_w/o_y
tojib	t_j_b/o_i
wyzx	w_z_x/y_o
lakd	l_k_d/a_e
gesuk	g_s_k/e_u
biwk	b_w_k/i_o
petg	p_t_g/e_e
kubis	k_b_s/u_i
kosij	k_s_j/o_i
xabz	x_b_z/a_o
pikux	p_k_x/i_u
zypf	z_p_f/y_o
zefil	z_f_l/e_i
'ets	'_t_s/e_o
bowub	b_w_b/o_u
jelal	j_l_l/e_a
wodip	w_d_p/o_i
dexf	d_x_f/e_o
dugb	d_g_b/u_o
thazx	th_z_x/a_o
taxux	t_x_x/a_u
jezug	j_z_g/e_u
lepx	l_p_x/e_o
tegul	t_g_l/e_u
xip'	x_p_'/i_e
dekak	d_k_k/e_a
lyzuj	l_z_j/y_u
fowat	f_w_t/o_a
supf	s_p_f/u_o
dytik	d_t_k/y_i
jus'	j_s_'/u_e
tolag	t_l_g/o_a
sazid	s_z_d/a_i
faku'	f_k_'/a_u
wujif	w_j_f/u_i

Process returned 0

Utility Functions: Clear, Permalink

Noteworthy nodes in each datafile include:

Language	Datafile name	Root nodes (Click a root to generate from it)	Remarks
Firen	syllables.yml	`Sentence`, `Noun`, `Verb`, `NominalRoot`, `VerbalRoot`,	More information about Firen can be found on the Wiki.
Sajem Tan	sajemtan.yml	`Word`, `Root`, `Suffix`, `UnlikelyWord`, `UnlikelyRoot`, `UnlikelySuffix`,	Sajem Tan is a collaborative conlang. It has a website here.
English	english.yml	`Sentence`,	My (possibly poorly-considered) attempt to encode basic English grammar in WordGen. I apologise in advance to anyone who tries to make sense out of it.
Dab vi Suxi Kidap	ffb.yml	`Sentence`, `Word`, `Compound`, `Syllable`,	DVSK is a very simple isolating language that was created as a collaboration between me and 4 other people from the Sajem Tan tribe, however it was abandoned after working out the foundations.
Xanz	xanz.yml	`word`, `tricons`, `root`, `word1`,	Another collaborative language in the Sajem Tan universe. It is the source of triconsonantal roots in Sajem Tan.
Jafren	jafren.yml	`Sentence`, `Word`, `ChordL`, `Chord`, `Chord1`, `Chord2`, `Chord3`, `Chord4`, `Chord5`,	A musical language used in the same setting as Firen. It is currently much less well-developed.
Jokes	jokes.yml	`Gender`,	Someone on Mastodon posted a silly CFG for making gender jokes, so I encoded it as a WordGen datafile. Nothing more to it.
Numbers	numbers.yml	`number`, `phoneNumber`, `internationalPhoneNumber`,	This is one of the first files I ever wrote, and it shows. It makes use of outdated and deprecated features of WordGen and made the very questionable choice of using 'val' for a phonetic English reading of the number and 'ipa' for the digits.
Tests	CFGs.yml	`Dyck`, `binPalindrome`, `Node`,	This file exists as a testing ground for things that are too simple to need their own files, and for new or experimental features. You will need to uncrease the recursion depth to use some of these roots, particularly `Node` or else get a million errors.

Note that CFGs.yml is not allowed on this web interface due to higher resource use than the other files and its reliance on WordGen/Cpp features.

Feel free to look at the sources for WordGen/Py and WordGen/Cpp. wordgen.py is the current version of the script, and syllables.yml is the current version of the Firen data file.

This is the web frontend for a Python program that will produce random words using a (rather nifty) weighted-randomized macro expansion approach. IPA transcriptions are generated from the same file, and are not directly attached to the orthography. This means that "digraph recognition" is not even a concept to worry about.

In a second phase, regular expressions and Mealy-type finite state machines are applied to transform the output.

The Firen datafile is generally quite well-developed, and produces generally good results. The IPA transcriptions are sometimes non-obvious because they include synchronic sound changes, and sometimes unnatural but generally still correct, such as with the overzealous syllabification.

The other datafiles are in various stages of development.

Not that it matters or anything, but unless you provide your own seeds, this web frontend has worse randomness because it is simply using Unix time as the seed. (It's required that the server generates the seed for the permalink to work, and time is the standard easy choice for these things.) When run from the command line without an explicit seed parameter, the randomness is much better (Python seeds its random generator from the system's main entropy source). Maybe I could make this Base64-encode some bytes from /dev/urandom or something for the seed instead, it wouldn't change too much.

Working-1.py is a less flexible earlier (Python 2 only) draft, which technically knows nothing about words, and only generates syllables. You may find it interesting or even useful. syllables1.yml is the data file for that version. The two versions are not compatible, but are mostly similar and a single file could in theory be agnostic between them.

Once this is "done", my next plan is to implement something with Markov chains, the more classical way to generate natural language.

Top

Words: (Limit 250)
Show IPA:
Show glosses:
Show old orthography: (Sajem Tan only)
Show original IPA: (Sajem Tan only)
Show Ðab Tan: (Sajem Tan only)
Show Ðab Tan IPA: (Sajem Tan only)
Show ABC notation: (Jafren only)
Show alphabetical notation: (Jafren only)
Datafile:
Root Node:

Show paths (debug):
Show regex steps (debug):
Enable seed (debug):
Seed:
Recursion depth (debug):