Firen Word Generator
Word | Gloss |
---|---|
zůjakkalu | PP-[PFV]fall[IND]-INC |
nad slannaro | PSV [C][PFV]sing[IND]-TERM |
nad nům | PSV APV |
nad zůhanalu | PSV PP-[PFV]speak[IND]-INC |
zůfama | PP-[PFV]sleep[IND] |
sikůkitta | DP-HAB-verb[IND] |
nům kůčunna | APV [C]HAB-birth[IND] |
nad půt nepalu | PSV DPV [C][PFV]lend[IND]-INC |
sůtebičo | DP-[PFV]verb:SBJ-MOMT |
půt gůta | DPV [C][PFV]give[IND] |
půt nepa | DPV [C][PFV]lend[IND] |
nad nům | PSV APV |
nad nům | PSV APV |
nad nům | PSV APV |
nad bůloṙnali | PSV F-[PFV]verb[IND]-INC |
zůzůttamčo | PP-[PFV]verb:DEO-MOMT |
sůskoṙffam | DP-[PFV]overdo_farewell:DEO |
nad nům | PSV APV |
nům bůsaigamlu | APV F-[PFV]wrap:DEO-INC |
nad nům | PSV APV |
nad nům | PSV APV |
nům zůsůggalu | APV PP-[PFV]be_sick[IND]-INC |
nům bikůčunna | APV F-HAB-birth[IND] |
nům bůgůlamro | APV F-[PFV]mark:DEO-TERM |
nům sůzaimaro | APV DP-[PFV]turn[IND]-TERM |
zůskoṙffamli | PP-[PFV]overdo_farewell:DEO-INC |
nad gůtůtafu | PSV [C]CONT-touch:COND |
kůskoṙffafu | [C]HAB-overdo_farewell:COND |
důda | [C][PFV]verb[IND] |
zegůčudfu | PP-CONT-shine:COND |
gůčudam | [C]CONT-shine:DEO |
nad nům | PSV APV |
čučaṙ | [C][PFV]sneeze:SPEC |
nad nům | PSV APV |
nad sůtůtao | PSV DP-[PFV]touch:SBJ |
nad nům | PSV APV |
gůčuda | [C]CONT-shine[IND] |
zůskoṙffa | PP-[PFV]overdo_farewell[IND] |
nad sigůpetla | PSV DP-CONT-grasp[IND] |
nad nům | PSV APV |
jakkadesk | [C][PFV]fall:DISS |
nad nům | PSV APV |
nad nům | PSV APV |
nad nům | PSV APV |
nům zůlůttaṙ | APV PP-[PFV]remove:SPEC |
nům sas kůskoṙffa | APV RCP [C]HAB-overdo_farewell[IND] |
nům gůpittalu | APV [C]CONT-feel_pain[IND]-INC |
jakka | [C][PFV]fall[IND] |
nad nům | PSV APV |
nad bůpetlamro | PSV F-[PFV]grasp:DEO-TERM |
nad zegůkogfu | PSV PP-CONT-push:COND |
nům bůvollaṙ | APV F-[PFV]change:SPEC |
nad nům | PSV APV |
bikůtupu | F-HAB-verb:SBJ |
nad nům | PSV APV |
nad bigůhisppaṙro | PSV F-CONT-find:SPEC-TERM |
čudure | [C][PFV]shine:SBJ-TERM |
nad sigůvůmma | PSV DP-CONT-create[IND] |
nad kůkogfu | PSV [C]HAB-push:COND |
nad saosao | PSV [C][PFV]count:SBJ |
nům gůlůtta | APV [C]CONT-remove[IND] |
sůjakkamro | DP-[PFV]fall:DEO-TERM |
nům volla | APV [C][PFV]change[IND] |
nům kůzaimůṙ | APV [C]HAB-turn:SPEC |
nad půt neparo | PSV DPV [C][PFV]lend[IND]-TERM |
nům kůvollare | APV [C]HAB-change[IND]-TERM |
nad sůfosa | PSV DP-[PFV]help[IND] |
nům bikůčonna | APV F-HAB-verb[IND] |
nad nům | PSV APV |
nad nům | PSV APV |
nad tozzare | PSV [C][PFV]love[IND]-TERM |
zůčučače | PP-[PFV]sneeze[IND]-MOMT |
nad bikůtůtai | PSV F-HAB-touch:SBJ |
nům bůvosta | APV F-[PFV]lose[IND] |
nad nům | PSV APV |
gessačo | [C][PFV]verb[IND]-MOMT |
nad sas fosa | PSV RCP [C][PFV]help[IND] |
nad sas sůtoṙsesk | PSV RCP DP-[PFV]shout:DISS |
sanoṙ | [C][PFV]verb:SPEC |
půt gůtamro | DPV [C][PFV]give:DEO-TERM |
bůlaska | F-[PFV]verb[IND] |
nad nům | PSV APV |
bikůjakkai | F-HAB-fall:SBJ |
jakka | [C][PFV]fall[IND] |
nad zekůkoṙeskro | PSV PP-HAB-govern:DISS-TERM |
kůčučamče | [C]HAB-sneeze:DEO-MOMT |
nad nům | PSV APV |
nad bůgessaṙ | PSV F-[PFV]verb:SPEC |
nad zůvůmmadesk | PSV PP-[PFV]create:DISS |
nům gůgůlaro | APV [C]CONT-mark[IND]-TERM |
nům sas gůskoṙffare | APV RCP [C]CONT-overdo_farewell[IND]-TERM |
nům bůgůla | APV F-[PFV]mark[IND] |
čučam | [C][PFV]sneeze:DEO |
nům čunnače | APV [C][PFV]birth[IND]-MOMT |
nům bigůgůla | APV F-CONT-mark[IND] |
nad nům | PSV APV |
nům zůčunnali | APV PP-[PFV]birth[IND]-INC |
nad nům | PSV APV |
sůskoṙffa | DP-[PFV]overdo_farewell[IND] |
nad nům | PSV APV |
Process returned 0
Utility Functions: Clear, Permalink
Noteworthy nodes in each datafile include:
Language | Datafile name | Root nodes (Click a root to generate from it) | Remarks |
---|---|---|---|
Firen | syllables.yml | Sentence , Noun , Verb , NominalRoot , VerbalRoot , | More information about Firen can be found on the Wiki. |
Sajem Tan | sajemtan.yml | Word , Root , Suffix , UnlikelyWord , UnlikelyRoot , UnlikelySuffix , | Sajem Tan is a collaborative conlang. It has a website here. |
English | english.yml | Sentence , | My (possibly poorly-considered) attempt to encode basic English grammar in WordGen. I apologise in advance to anyone who tries to make sense out of it. |
Dab vi Suxi Kidap | ffb.yml | Sentence , Word , Compound , Syllable , | DVSK is a very simple isolating language that was created as a collaboration between me and 4 other people from the Sajem Tan tribe, however it was abandoned after working out the foundations. |
Xanz | xanz.yml | word , tricons , root , word1 , | Another collaborative language in the Sajem Tan universe. It is the source of triconsonantal roots in Sajem Tan. |
Jafren | jafren.yml | Sentence , Word , ChordL , Chord , Chord1 , Chord2 , Chord3 , Chord4 , Chord5 , | A musical language used in the same setting as Firen. It is currently much less well-developed. |
Jokes | jokes.yml | Gender , | Someone on Mastodon posted a silly CFG for making gender jokes, so I encoded it as a WordGen datafile. Nothing more to it. |
Numbers | numbers.yml | number , phoneNumber , internationalPhoneNumber , | This is one of the first files I ever wrote, and it shows. It makes use of outdated and deprecated features of WordGen and made the very questionable choice of using 'val' for a phonetic English reading of the number and 'ipa' for the digits. |
Tests | CFGs.yml | Dyck , binPalindrome , Node , | This file exists as a testing ground for things that are
too simple to need their own files, and for new or experimental
features. You will need to uncrease the recursion depth to use some of
these roots, particularly Node or else
get a million errors. |
Note that CFGs.yml is not allowed on this web interface due to higher resource use than the other files and its reliance on WordGen/Cpp features.
Feel free to look at the sources for WordGen/Py and WordGen/Cpp.
wordgen.py
is the current version of the script, and
syllables.yml
is the current version of the Firen data file.
This is the web frontend for a Python program that will produce random words using a (rather nifty) weighted-randomized macro expansion approach. IPA transcriptions are generated from the same file, and are not directly attached to the orthography. This means that "digraph recognition" is not even a concept to worry about.
In a second phase, regular expressions and Mealy-type finite state machines are applied to transform the output.
The Firen datafile is generally quite well-developed, and produces generally good results. The IPA transcriptions are sometimes non-obvious because they include synchronic sound changes, and sometimes unnatural but generally still correct, such as with the overzealous syllabification.
The other datafiles are in various stages of development.
Not that it matters or anything, but unless you provide your own seeds, this web frontend has worse randomness because it is simply using Unix time as the seed. (It's required that the server generates the seed for the permalink to work, and time is the standard easy choice for these things.) When run from the command line without an explicit seed parameter, the randomness is much better (Python seeds its random generator from the system's main entropy source). Maybe I could make this Base64-encode some bytes from /dev/urandom or something for the seed instead, it wouldn't change too much.
Working-1.py
is a less flexible earlier (Python 2 only) draft,
which technically knows nothing about words, and only generates syllables.
You may find it interesting or even useful. syllables1.yml
is
the data file for that version. The two versions are not compatible, but
are mostly similar and a single file could in theory be agnostic between
them.
Once this is "done", my next plan is to implement something with Markov chains, the more classical way to generate natural language.
Top