Firen Word Generator

Words: (Limit 250)
Show IPA:
Show glosses:
Show old orthography: (Sajem Tan only)
Show original IPA: (Sajem Tan only)
Show Ðab Tan: (Sajem Tan only)
Show Ðab Tan IPA: (Sajem Tan only)
Show ABC notation: (Jafren only)
Show alphabetical notation: (Jafren only)
Datafile:
Root Node:
Show All Options:

WordsGloss
áfwathiTO-f_w_th/e_a-GEN
amagaddTO-b_g_d/a_a-LOC
ewi'e'dFROM-w_'_'/i_e-LOC
jijbij_j_b/i_o-GEN
á'ubegiTO-'_b_g/u_e-GEN
elijaldFROM-l_j_l/i_a-LOC
toxwt_x_w/o_y-DIR
á'uje'iTO-'_j_'/u_e-GEN
añhgosTO-s_g_s/o_o-DIR
zutkz_t_k/u_e-DIR
áxijywTO-x_j_w/i_y-DIR
e'putFROM-'_p_t/o_u-DIR
álikelTO-l_k_l/i_e-DIR
dothfid_th_f/o_y-GEN
fufagif_f_g/u_a-GEN
añhlokiTO-s_l_k/y_o-GEN
jexujenj_x_j/e_u-LOC
thabutenth_b_t/a_u-LOC
eszoziFROM-s_z_z/o_o-GEN
tygjt_g_j/y_e-DIR
gijakg_j_k/i_a-DIR
añhtildTO-s_t_l/y_i-LOC
pazutip_z_t/a_u-GEN
edtuzFROM-d_t_z/o_u-DIR
elikuzdFROM-l_k_z/i_u-LOC
edjojFROM-d_j_j/e_o-DIR
se'gens_'_g/e_o-LOC
ezijyfdFROM-z_j_f/i_y-LOC
á'uthaziTO-'_th_z/u_a-GEN
ebisasFROM-b_s_s/i_a-DIR
e'safFROM-'_s_f/o_a-DIR
áxthasiTO-x_th_s/y_a-GEN
añhjejiTO-s_j_j/o_e-GEN
ekidalFROM-k_d_l/i_a-DIR
ekugediFROM-k_g_d/u_e-GEN
'eji'i'_j_'/e_i-GEN
edletFROM-d_l_t/e_e-DIR
áwatofiTO-w_t_f/a_o-GEN
áthulyxTO-th_l_x/u_y-DIR
'ogl'_g_l/o_o-DIR
ethijefFROM-th_j_f/i_e-DIR
áwwaddTO-w_w_d/o_a-LOC
tyzathit_z_th/y_a-GEN
bexuthenb_x_th/e_u-LOC
ekkylFROM-k_k_l/o_y-DIR
thyldith_l_d/y_y-GEN
áxudabTO-x_d_b/u_a-DIR
añhabajiTO-s_b_j/a_a-GEN
ekuge'dFROM-k_g_'/u_e-LOC
añhaxalTO-s_x_l/a_a-DIR
áfa'oziTO-f_'_z/a_o-GEN
ezsitdFROM-z_s_t/e_i-LOC
pothudenp_th_d/o_u-LOC
bydafb_d_f/y_a-DIR
áxpodiTO-x_p_d/o_o-GEN
ellyldFROM-l_l_l/o_y-LOC
buduxenb_d_x/u_u-LOC
pytibenp_t_b/y_i-LOC
áfjojTO-f_j_j/o_o-DIR
petjip_t_j/e_o-GEN
amdytdTO-b_d_t/e_y-LOC
anhfexTO-t_f_x/e_e-DIR
angikopdTO-g_k_p/i_o-LOC
'ygidi'_g_d/y_i-GEN
susibs_s_b/u_i-DIR
anhisygdTO-t_s_g/i_y-LOC
áwugiwdTO-w_g_w/u_i-LOC
etusakiFROM-t_s_k/u_a-GEN
ezuzypiFROM-z_z_p/u_y-GEN
fo''enf_'_'/o_y-LOC
anthasTO-d_th_s/e_a-DIR
añhabisTO-s_b_s/a_i-DIR
e'usybFROM-'_s_b/u_y-DIR
elupi'dFROM-l_p_'/u_i-LOC
pylajp_l_j/y_a-DIR
á'dylTO-'_d_l/e_y-DIR
lyjlenl_j_l/y_o-LOC
ejujeziFROM-j_j_z/u_e-GEN
egtytdFROM-g_t_t/o_y-LOC
amizusdTO-b_z_s/i_u-LOC
'ewj'_w_j/e_y-DIR
eptedFROM-p_t_d/y_e-DIR
efxawFROM-f_x_w/o_a-DIR
á'dajTO-'_d_j/e_a-DIR
likgl_k_g/i_y-DIR
thozxenth_z_x/o_o-LOC
álatoliTO-l_t_l/a_o-GEN
angiguthdTO-g_g_th/i_u-LOC
jyzufij_z_f/y_u-GEN
anhsotdTO-t_s_t/e_o-LOC
xukthix_k_th/u_y-GEN
añapathdTO-z_p_th/a_a-LOC
añupygTO-z_p_g/u_y-DIR
áxizigiTO-x_z_g/i_i-GEN
ájifuddTO-j_f_d/i_u-LOC
amzy'iTO-b_z_'/o_y-GEN
eb'ufiFROM-b_'_f/e_u-GEN
babusenb_b_s/a_u-LOC
e'uxibFROM-'_x_b/u_i-DIR
ze'kenz_'_k/e_e-LOC

Process returned 0

Utility Functions: Clear, Permalink

Noteworthy nodes in each datafile include:

LanguageDatafile nameRoot nodes (Click a root to generate from it)Remarks
Firensyllables.ymlSentence, Noun, Verb, NominalRoot, VerbalRoot, More information about Firen can be found on the Wiki.
Sajem Tansajemtan.ymlWord, Root, Suffix, UnlikelyWord, UnlikelyRoot, UnlikelySuffix, Sajem Tan is a collaborative conlang. It has a website here.
Englishenglish.ymlSentence, My (possibly poorly-considered) attempt to encode basic English grammar in WordGen. I apologise in advance to anyone who tries to make sense out of it.
Dab vi Suxi Kidapffb.ymlSentence, Word, Compound, Syllable, DVSK is a very simple isolating language that was created as a collaboration between me and 4 other people from the Sajem Tan tribe, however it was abandoned after working out the foundations.
Xanzxanz.ymlword, tricons, root, word1, Another collaborative language in the Sajem Tan universe. It is the source of triconsonantal roots in Sajem Tan.
Jafrenjafren.ymlSentence, Word, ChordL, Chord, Chord1, Chord2, Chord3, Chord4, Chord5, A musical language used in the same setting as Firen. It is currently much less well-developed.
Jokesjokes.ymlGender, Someone on Mastodon posted a silly CFG for making gender jokes, so I encoded it as a WordGen datafile. Nothing more to it.
Numbersnumbers.ymlnumber, phoneNumber, internationalPhoneNumber, This is one of the first files I ever wrote, and it shows. It makes use of outdated and deprecated features of WordGen and made the very questionable choice of using 'val' for a phonetic English reading of the number and 'ipa' for the digits.
TestsCFGs.ymlDyck, binPalindrome, Node, This file exists as a testing ground for things that are too simple to need their own files, and for new or experimental features. You will need to uncrease the recursion depth to use some of these roots, particularly Node or else get a million errors.

Note that CFGs.yml is not allowed on this web interface due to higher resource use than the other files and its reliance on WordGen/Cpp features.

Feel free to look at the sources for WordGen/Py and WordGen/Cpp. wordgen.py is the current version of the script, and syllables.yml is the current version of the Firen data file.

This is the web frontend for a Python program that will produce random words using a (rather nifty) weighted-randomized macro expansion approach. IPA transcriptions are generated from the same file, and are not directly attached to the orthography. This means that "digraph recognition" is not even a concept to worry about.

In a second phase, regular expressions and Mealy-type finite state machines are applied to transform the output.

The Firen datafile is generally quite well-developed, and produces generally good results. The IPA transcriptions are sometimes non-obvious because they include synchronic sound changes, and sometimes unnatural but generally still correct, such as with the overzealous syllabification.

The other datafiles are in various stages of development.

Not that it matters or anything, but unless you provide your own seeds, this web frontend has worse randomness because it is simply using Unix time as the seed. (It's required that the server generates the seed for the permalink to work, and time is the standard easy choice for these things.) When run from the command line without an explicit seed parameter, the randomness is much better (Python seeds its random generator from the system's main entropy source). Maybe I could make this Base64-encode some bytes from /dev/urandom or something for the seed instead, it wouldn't change too much.

Working-1.py is a less flexible earlier (Python 2 only) draft, which technically knows nothing about words, and only generates syllables. You may find it interesting or even useful. syllables1.yml is the data file for that version. The two versions are not compatible, but are mostly similar and a single file could in theory be agnostic between them.

Once this is "done", my next plan is to implement something with Markov chains, the more classical way to generate natural language.

Top