Markov scrambled a world lexicon

The final for Reading and writing electronic text and it was mostly reading text.

Dealing in mostly context and meaning for most of the semester pushed my focus to the phonetics and sounds of words when spoken. I had a feeling from the beginning that my project would be incoherent simply due to my lack of proper pronunciation. I am obsessed with the idea of meaning that words have and how all our meaning has been built somewhat arbitrarily, albiet to me some phrases do match “the feeling” of the word. For example: flojo (flow ho) means lazy in spanish, there is a great number onomatopoeia words out there but I was hoping to make more of an ideophone prose that would probably end up sounding like clopping random noises together due to my lack of native phonetic proficiency.

On that idea of understanding: if the audience speaks/understands english, spanish,italian and japanese then the phonetics of the project won’t be paid much attention and it will simply be our minds trying to find meaning, so I rendered all the sounds to be phonetically mixed between languages so you would have hybrid such as TA MÉ TOL (table fragmented between 3 languages English/German, Spanish, Croatian, ).

So, without knowing it I had made a “super conceptual project.” Yet, I was still hoping for the best and that what was read would be heard more like an experimental jazz set.

To break it down this is what my project looked like mapped out:

1. Run markov chain on western alphabet / phonetic translations of non western languages. (This will blend our source text together between languages to give us a more phonetical smoothie when it comes to the final spoken piece)

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
from random import choice,uniform
import re
 
def w_choice(l):
    t = float(sum(y for x,y in l.items()))
    n = uniform(0,1)
    i = ''
    for i,w in l.items():
        if n < (w/t):
            break
        n -= (w/t)
    return i
 
class Markov:
    def __init__(self,f=None):
        self.splitre = re.compile("(\S+\s*)")
        self.dic = None
        if not f is None:
            self.addfile(f)
 
    def addfile(self,f):
        txt = open(f).read()
        self.add(txt)
 
    def add(self,txt,split=True):
        if self.dic is None:
            self.dic = {}
 
        if split is True:
            words = [x for x in self.splitre.split(txt) if x != '']
        else:
            words = txt
 
        for pos,word in enumerate(words):
            subd = self.dic.setdefault(word,{})
            try:
                next = words[pos+1]
                subd[next] = subd.setdefault(next,0)+1
            except:
                pass
 
    def genline(self):
        if self.dic is None:
            return ""
 
        last = choice(self.dic.keys())
        txt = last.capitalize()
        while( txt[-1] != '\n' ):
            if last == '': 
                last = choice(self.dic.keys())
            word = w_choice(self.dic[last])
            last = word
            txt += word
 
        return txt[:-1]
 
    def gen(self,l,sep=None):
        if self.dic is None:
            return ""
 
        if sep is None:
            sep = ''
 
        last = choice(self.dic.keys())
        txt = ""
        while( len(txt) < l ):
            if last == '': last = choice(self.dic.keys())
            word = w_choice(self.dic[last])
            last = word
            if( len(txt + word + sep) > l ): 
                break
            txt += word + sep
 
        return txt

2. Use logios from CMU for final phonetical output.

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
First run without markov treatment:
random and matched output
 
 
After all, to the well-organized mind, death is but the next great adventure.
 
Saba kē bāda, acchī taraha sē āyōjita mana mēṁ, mr̥tyu lēkina agalē mahāna sāhasika hai.
 
baul de khun shek il oh ow tun koohn de mutuvun de kezein de al de mulv de kawldin.
 
Afinal, para a mente bem organizada, a morte é apenas a próxima grande aventura.Russian	
Kekkyoku no tokoro, yoku soshiki no kokoro ni,-shi wa tsugi no idaina bōkendesu.
 
Nachdem alle, die gut organisierte Geist ist der Tod, sondern das nächste große Abenteuer.
 
Italian	
Dopo tutto, per la mente ben organizzata, ma la morte è la prossima grande avventura.
 
Polish	
Po tym wszystkim, do umysłu dobrze zorganizowany, ale śmierć jest obok wielkiej przygody.
 
Ukrainian
Vreshti-resht , do dobre orhanizovanym rozumom , smert π , ale nastupnyyÃÜ velyka pryhoda .
 
Malay
Setelah semua, ke fikiran yang terancang dengan baik, kematian hanyalah petualangan besar berikutnya.
 
Filipino
Matapos ang lahat, sa mga well-organized isip, ang kamatayan ay ngunit ang malaking adventure.

3. Repeat the same phrase over and over again but in different languages so to work on the idea of “incantation” or magic.

Phrase: “After all, to the well organized mind, death is but the next great adventure.”

4. Try to pronounce result text without pulling a muscle, attempt to give a good spectrum of language:

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
Result text: 
 
Logios Cleaned up phonetics
 
NACHDEM ALLE, DIE GUT ORGANISIERTE GEIST IST DER TOD, ABER DAS NäCHSTE GROßE ABENTEUER.
 
1. N AE CH D AH M AO L D AY G AH T AO R G AE N AH S IY R T G AY S T IH S T D ER T AA D EY B ER D AE S NG K S T G R OW AH B EH N T Y UW AH R
 
APRèS TOUT, à L'ESPRIT BIEN ORGANISé, MAIS LA MORT EST LA GRANDE AVENTURE. 
 
2. AE P R Z T AW T L EH S R AH T B IY N AO R G AH N IY Z M EY Z L AA M AO R T AH S T EY T L AA G R AE N D AH V EH N CH ER
 
DESPUéS DE TODO, CON LA MENTE BIEN ORGANIZADA, LA MUERTE NO ES SINO LA PRóXIMA AVENTURA.	
 
3. D EH S AH S D IY T OW D OW K AA N L AA M EH N T B IY N AO R G AH N IH Z AH D AH L AA M Y UW AH R T N OW EH S S AY N OW L AA P R K S IH M AH AH V EH N T Y AH R AH
 
AFTER ALL, TO THE WELL-ORGANIZED MIND, DEATH IS BUT THE NEXT GREAT ADVENTURE. 
 
4. AE F T ER AO L T UW DH AH W EH L AO R G AH N AY Z D M AY N D D EH TH IH Z B AH T DH AH N EH K S T G R EY T AH D V EH N CH ER
 
KEKKYOKU NO TOKORO, YOKU SOSHIKI NO KOKORO NI,-SHI WA TSUGI NO IDAINA BōKENDESU.	
 
5. K EH K AY AH K UW N OW T AA K AH R OW Y OW K UW S AA SH AH K IY N OW K AA K AH R OW N IY SH IY W AA T S UW JH IY N OW AH D EY N AH B K EH N D AH S UW
 
IMMERS, DE GOED GEORGANISEERDE GEEST, DE DOOD IS, MAAR DE VOLGENDE GROTE AVONTUUR.
 
6. IH M EH R Z D IY G OW D JH IY AH R G AE N AH S AH R D JH IY AH S T D IY D UW D IH Z M EY AH R D IY V OW L JH AH N D G R OW T AH V AA N T Y UW AH R
 
V KONTSE KONTSOV, K HOROSHO ORGANIZOVANNYM UMOM , SMERTʹ , NO SLEDUYUSHCHIĭ BOLʹSHOE PRIKLYUCHENIE.
 
7. V IY K AA N T S K AA N T S AH V K EY HH AO R AH SH OW AO R G AE N AH Z AH V AE N AH M Y UW M AH M S M AH R T N OW S L EH D Y UW AY AH SH IY B OW L SH OW P R IH K L AY AH CH IY N AY
 
PO TYM WSZYSTKIM, DO UMYSłU DOBRZE ZORGANIZOWANY, ALE śMIERć JEST OBOK WIELKIEJ PRZYGODY.
 
8. P OW T IH M W S Z IH S T AH M D UW Y UW M AH S UW D AA B R Z Z AO R G AH N AH Z OW AH N IY EY L M AY AH R JH EH S T AA B AH K W IY L K IY JH P R Z IH G AH D IY
 
AFINAL, PARA A MENTE BEM ORGANIZADA, A MORTE é APENAS A PRóXIMA GRANDE AVENTURA.	
 
9. AH F AY N AH L P EH R AH EY M EH N T B EH M AO R G AH N IH Z AH D AH EY M AO R T AE P AH N AH Z EY P R K S IH M AH G R AE N D AH V EH N T Y AH R AH
 
EFTER ALT, AT DET VELORGANISEREDE TANKERNE DøDEN ER, MEN DET NæSTE STORE EVENTYR.	
 
10. EH F T AH R AE L T AE T D EH T V AH L AH R G AH N IH S AH R IY D T AE NG K AH R N D EH N EH R M EH N D EH T N S T S T AO R AH V EH N T AH R
 
DOPO TUTTO, PER LA MENTE BEN ORGANIZZATA, MA LA MORTE è LA PROSSIMA GRANDE AVVENTURA.	
 
11. D OW P OW T AH T OW P ER L AA M EH N T B EH N AO R G AH N IH Z AH T AH M AA L AA M AO R T L AA P R OW S IH M AH G R AE N D AE V EH N T Y AH R AH
 
DOPO TUTTO, PER LA MENTE BEN ORGANIZZATA, MA LA MORTE è LA PROSSIMA GRANDE AVVENTURA.
 
12. D OW P OW T AH T OW P ER L AA M EH N T B EH N AO R G AH N IH Z AH T AH M AA L AA M AO R T L AA P R OW S IH M AH G R AE N D AE V EH N T Y AH R AH
 
BìJìNG, ZǔZHī LIáNGHǎO DE TóUNǎO, SǐWáNG ZHǐ BùGUò SHì XIà YīGè WěIDà DE Mà OXIǎN.	
 
13. B IY JH EY EH N JH IY Z IY Z IY EY CH L IH N OW D IY T Y UW N OW EH S D AH B AH L Y UW EH N G Z IY EY CH B G Y UW EH S EY CH Z IY Y G W IH D D IY M AA K S AH N
 
VéGTéRE IS, A JóL SZERVEZETT SZEM ELőTT TARTVA, A HALáL, DE A KöVETKEZő NAGY KALAND.	
 
14. V G T AH R IH Z EY JH EY L S Z AH R V AH Z AH T S Z EH M EH L T T AA R T V AH EY HH AO L D IY EY K V EH T AH Z N EY G IY K AE L AH N D
 
LOPPUJEN LOPUKSI, HYVIN ORGANISOITU MIELESSä, KUOLEMA ON VAAN SEURAAVA SUURI SEIKKAILU.	
 
15. L AA P Y UW JH AH N L AH P AH K S IY HH IH V AH N AO R G AH N AH S OY T UW M IY L AH S K Y UW AA L AH M AH AA N V EY AH N S UH R AH V AH S UW AH R IY S AY K EY L UW
 
TROSS ALT, TIL DEN VELORGANISERTE SINN, ER DøDEN, MEN DEN NESTE STORE EVENTYRET.
 
16. T R AA S AE L T T IH L D EH N V EH L AH R G AE N AH S AH R T S IH N ER D EH N M EH N D EH N N EH S T S T AO R AH V EH N T AH R AH T
 
NäR ALLT KOMMER TILL DEN VäLORGANISERADE åTANKE äR DöD MEN NäSTA STORA äVENTYR.
 
17. EH N R AE L T K AA M ER T IH L D EH N V L AO R G AH N IH S AH R EY D T AE NG K R D IY D M EH N N S T AA S T AO R AH V EH N T AH R
 
V KRAÄ­NA SMETKA, S DOBRE ORGANIZIRANI UM , SMÅ­RTTA E , NO NA SLEDVASHTIYA GOLYAMO PRIKLYUCHENIE.
 
18. V IY K R AE N AH S M EH T AH EH S D OW B AH R AO R G AH N AH Z IH R AH N IY AH M S M R T AA IY N OW N AA S L EH D AH SH T IY AY AH G AH L AY AH M OW P R IH K L AY AH CH IY N AY
 
METá APó óLA, ME TI̱N KALá ORGANO̱MéNI̱ NOU , O THáNATOS , ALLá I̱ EPóMENI̱ MEGáLI̱ PERIPéTEIA.
 
19. M EH T AE P L AA M IY T IH N K AE L AO R G AH N AA M N IY N UW OW TH N AE T AH Z AO L AY EH P M AH N IY M EH G L IY P AH R IH P T IY AH
 
GYEOLGUG, JAL JOJIG SINGYEONG , JUG-EUM EUN GEU DA-EUM KEUN MOHEOM IDA .
 
20. JH AY IY OW L G AH G JH AE L JH AA JH AH G S IH N JH AY IY AH NG JH AH G Y UW M Y UW N JH UW D AA Y UW M K Y UW N M AA HH IY AH M AY D AH
 
PO TOM VšETKOM, ABY DOBRE-ORGANIZOVANý MYSEľ JE SMRť, ALE NA ďALšIE VEľKé DOBRODRUžSTVO.
 
21. P OW T AA M V EH T AH M AH B AY D OW B AH R AO R G AH N IH Z AH V AH N M AY Z JH IY EH S EH M AA R EY L N AA AH L AY V EH K D AA B R AH D R AH S T V OW
 
LA URMA URMEI, PENTRU MINTEA BINE ORGANIZATE, DAR MOARTEA ESTE URMăTOAREA MARE AVENTURA.
 
22. L AA AH R M AH AH R M IY P EH N T R UW M IH N T IY AH B AY N IY AO R G AH N AY Z AH T D AA R M OW R T IY AH EH S T AH R M T OW R IY AH M EH R AH V EH N T Y AH R AH
 
DESPRéS DE TOT, AMB LA MENT BEN ORGANITZADA, LA MORT NO éS SINó LA PROPERA AVENTURA.
 
23. D EH S R Z D IY T AA T AE M L AA M EH N T B EH N AO R G AH N IH T S AH D AH L AA M AO R T N OW S S IH N L AA P R AA P IH R AH AH V EH N T Y AH R AH
 
NAKON SVEGA, DOBRO ORGANIZIRANA UM, ALI SMRT JE SLJEDEćA VELIKA PUSTOLOVINA.
 
24. N AE K AH N S V EH G AH D AA B R OW AO R G AH N AH Z IH R AH N AH AH M AA L IY EH S EH M AA R T JH IY S L JH IY D IY AH V EH L IH K AH P AH S T AH L AA V AH N AH
 
GALų GALE, SU GERAI ORGANIZUOTU PROTAS, MIRTIS, TAčIAU ARTIMIAUSIAS DIDELIS NUOTYKIS.
 
25. G AE L G EY L S UW JH EH R EY AO R G AH N AH Z UW W AH T UW P R OW T AE Z M AH R T AH S T EY AH AA R T AH M IY AO S IY AH Z D IH D AH L AH S N Y UW AA T AH K AH S
 
KONEC KONCEV, ZA DOBRO ORGANIZIRANO UM, SMRT JE PA NASLEDNJA VELIKA AVANTURA.
 
26. K AA N AH K K AA N S AH V Z AA D AA B R OW AO R G AH N AH Z IH R AH N OW AH M EH S EH M AA R T JH IY P AA N AE Z L EH D N JH AH V EH L IH K AH AH V AE N T Y AH R AH
 
SONUçTA, IYI ORGANIZE AKLA, öLüM ANCAK BIR SONRAKI BüYüK MACERA.
 
27. S AA N Y UW T AH AY AY IY AO R G AH N AY Z AE K L AH EH L M AE NG K AH K B ER S AA N R AH K IY B IH K M AE S AH R AH
 
POSLE SVEGA, DOBRO ORGANIZOVANI UM , SMRT JE SAMO SLEDEćU VELIKU AVANTURU .
 
28. P OW Z AH L S V EH G AH D AA B R OW AO R G AH N AH Z AA V AH N IY AH M EH S EH M AA R T JH IY S AE M OW S L IY D Y UW V EH L AH K UW AH V AE N T Y AH R UW
 
NAKON SVEGA, DOBRO ORGANIZIRANA UM, ALI SMRT JE SLJEDEćA VELIKA PUSTOLOVINA.
 
29. N AE K AH N S V EH G AH D AA B R OW AO R G AH N AH Z IH R AH N AH AH M AA L IY EH S EH M AA R T JH IY S L JH IY D IY AH V EH L IH K AH P AH S T AH L AA V AH N AH
 
GALU GALā, AR LABI ORGANIZēTU PRāTU, NāVE IR TIKAI NāKAMAIS LIELAIS PIEDZīVOJUMS.
 
30. G AE L UW G AE L AA R L AE B IY AO R G AH N IH Z T UW P R T UW N V IY IH R T IH K EY NG K AE M EY AH S L AY L EY Z P IY D Z AH JH AH M Z
 
EFTIR ALLT SAMAN, Að VEL SKIPULöGð HUGA, ER DAUðINN EN STóRKOSTLEGT æVINTýRI.
 
31. EH F T AH R AE L T S EY M AH N AH V EH L S K IH P AH L G HH Y UW G AH ER D AO AH N EH N S T R K AA S T L AH G T V IH N T R IY
 
PAS Të GJITHA, Në MENDJEN E ORGANIZUAR MIRë, VDEKJA ëSHTë VETëM AVENTURë TJETëR E MADHE.
 
32. P AA Z T IY G JH IH TH AH EH N M EH N JH AH N IY AO R G AE N AH Z UW W AH R M IH R V D EH K JH AH EH S EY CH T IY V EH T M AH V EH N T AH R T JH EH T R IY M AE D HH
 
VRESHTI-RESHT , DO DOBRE ORHANIZOVANYM ROZUMOM , SMERTʹ , ALE NASTUPNYY̆ VELYKA PRYHODA.
 
33. V R EH SH T IY R EH SH T D UW D OW B AH R AO R AH N AH Z AA V AH N AH M R AH Z UW M AH M S M AH R T EY L N AE S T AH P N AY IY V EH L AH K AH P R IH HH AH D AH
 
SETELAH SEMUA, KE PIKIRAN YANG TERORGANISIR DENGAN BAIK, KEMATIAN HANYALAH PETUALANGAN BESAR BERIKUTNYA.
 
34. S EH T AH L AH S EH M Y UW AH K IY P IH K AH R AH N Y AE NG T EH R AH R G AE N AH S AH R D EH N G AH N B EY K K IY M EY SH AH N HH AH N AY AH L AH P EH T Y UW AH L AE N G AH N B IY Z ER B AH R AH K AH T N AY AH
 
SABA Kē BāDA, ACCHī TARAHA Sē āYōJITA MANA Mēṁ, MR̥TYU LēKINA AGALē MAHāNA SāHASIKA HAI.
 
35. S AA B AH K EY B D AA AE K T AE R AH HH AH EH S Y JH IH T AH M AA N AH EH M M R T AY UW L K IH N AH EY JH AH L M AE N AH SH AE S AH K AH HH EY
 
AZKEN FINEAN, ONDO ANTOLATUTA KONTUAN, HERIOTZA BAINA HURRENGO ABENTURA HANDI DA.
 
36. AE Z K AH N F AY N IY AH N AO N D OW AE N T AH L AH T Y UW T AH K AA N T Y UW AH N HH IY R IY AA T S AH B EY N AH HH AH R EH N G OW AH B EH N T Y AH R AH HH AE N D IY D AA
 
VPROčEM, NA DOBRO - ORGANIZIRAN UM , SMRTTA E , NO SLEDNIOT GOLEMA AVANTURA .
 
37. V P R OW AH M N AA D AA B R OW
 
TROSS ALT, TIL DEN VELORGANISERTE SINN, ER DøDEN, MEN DEN NESTE STORE EVENTYRET.
 
38. T R AA S AE L T T IH L D EH N V EH L AH R G AE N AH S AH R T S IH N ER D EH N M EH N D EH N N EH S T S T AO R AH V EH N T AH R AH T
 
WEDI'R CYFAN, ER Y MEDDWL YN DREFNUS, MARWOLAETH YW OND MAE'R ANTUR FAWR NESAF.
 
39. W EH D AH R S AY F AH N ER W AY M EH D W L Y N D R EH F N AH S M AA R W AA L EY AH TH Y W AA N D M IY R AE N T AH R F AO R N EH S AH F
 
QOVELIVE AMIS SHEMDEG , ROM KARGAD ORGANIZEBULI AZRIT’, SIKVDILI , MAGRAM MOMDEVNO DIDI T’AVGADASAVALI .
 
40. K AA V AH L AH V AE M IH S SH EH M D AH G R AA M K AA R G AH D AO R G AE N AH Z AH B Y UW L IY AE Z R AH T S IH K V D AH L IY M AE G R AH M M AA M D EH V N OW D IY D IY T AE V AH D AH S AE V AH L IY
 
I VERJO , VORPESZI LAV KAZMAKERPVATS MTK’I , MAH E, BAYTS’ HAJORD METS ARKATSAYIN.
 
41. AY V AH R JH OW V AO R P EH Z IY L AE V K AE Z M AE K AH R P V EY T S M T IY M AA B EY T S HH AE JH AH R D M EH T S AA R K AE T S EY AH N

In the end for my spoken piece I made the decision it wouldn’t be worth using the markov version simply because the idea didn’t fit with the hopeful phonetic blending of those words because the limited dictionary that CMU LOGIOS uses. I did end up taking that weakness for a different conceptual spin: the American lexicon for the Festival Speech Synthesis System attempt to phonetically translate 41 different languages and the errors it created. I chose a phrase from harry potter due to the gargantuan amount of languages it’s been translated into, for any language I couldn’t find I used google translate and the closest sister language for languages I couldn’t find as a basis for translation. Any phonetical translation with any non-western alphabet also used google’s phonetic/spoken feature from “internationally acceptable” translated source text. I chose the top 50 languages spoken in the world based on the time it would take me to read around 13 words x 50 times. In the end I only found 41 ways of getting accurate translations of the top 50 languages I wanted.

Any feedback on this project would be helpful because right now I believe it’s going in two different directions: the phonetic blending of phrases to form a super pidgin language and the chaos of phonetic translation when you use the wrong phonetic value tool on language.

Leave a Reply

Your email address will not be published. Required fields are marked *

*