@rommix0@mindly.social

Thread context 2 posts in path

Parent @erion@tardis.pw Open

@erion@tardis.pw

This is the worst # audio editing and mastering for CD that I have ever seen. There is a quite popular audio drama series and it turns out they have released audio cd versions of it, which should be b

Current reply

Anthony C. Bartman

@rommix0@mindly.social

Part of the text to speech and AI image generation communities

mindly.social

Anthony C. Bartman

@rommix0@mindly.social

Part of the text to speech and AI image generation communities

mindly.social

@rommix0@mindly.social · 16h ago

@erion here's the thing though. You'll most likely need a vintage Mac that has a daw with that kind of capability, and an old school teac cd burner that connects to scsi.

View full thread on mindly.social

0

Anthony C. Bartman

@rommix0@mindly.social

Part of the text to speech and AI image generation communities

mindly.social

Anthony C. Bartman

@rommix0@mindly.social

Part of the text to speech and AI image generation communities

mindly.social

@rommix0@mindly.social · Mar 10, 2026

here are a couple words from the phoneme dictionary in BeSTspeech:

.data:100964B4 unk_100964B4 db 0Ch ; jh
.data:100964B5 db 28h ; O
.data:100964B6 db 36h ; '
.data:100964B7 db 12h ; r
.data:100964B8 db 0Ch ; jh
.data:100964B9 db 0h ;
.data:100964BA db 0h ;
.data:100964BB db 0h ;

.data:100964BC unk_100964BC db 1h ; f
.data:100964BD db 12h ; r
.data:100964BE db 26h ; E
.data:100964BF db 36h ; '
.data:100964C0 db 14h ; d
.data:100964C1 db 0h ;
.data:100964C2 db 0h ;
.data:100964C3 db 0h ;

.data:100964C4 unk_100964C4 db 14h ; d
.data:100964C5 db 1Eh ; e
.data:100964C6 db 36h ; '
.data:100964C7 db 4h ; v
.data:100964C8 db 24h ; =
.data:100964C9 db 14h ; d
.data:100964CA db 0h ;
.data:100964CB db 0h ;

View on mindly.social

0

Anthony C. Bartman

@rommix0@mindly.social

Part of the text to speech and AI image generation communities

mindly.social

Anthony C. Bartman

@rommix0@mindly.social

Part of the text to speech and AI image generation communities

mindly.social

@rommix0@mindly.social · Mar 10, 2026

The phoneme dictionary for BeSTspeech has been found in the DLL.

The phonemes are simply stored as indexes pointing to the phoneme inventory table within array entries for such words like "one" "two" "three" and so on. This is looking promising.

View on mindly.social

2

0

3

0

Anthony C. Bartman

@rommix0@mindly.social

Part of the text to speech and AI image generation communities

mindly.social

Anthony C. Bartman

@rommix0@mindly.social

Part of the text to speech and AI image generation communities

mindly.social

@rommix0@mindly.social · Mar 08, 2026

BestSpeech uses bit masking for retrieving phoneme segmental features from a lookup table. Here are my notes thus far:

first bit is v/c switch (0 - vowel, 1 - consonant)

second bit is plosive flag

third bit is voicing flag

fourth bit is alveolar flag

fifth bit is affricate flag

sixth bit is nasal flag

seventh bit is strident flag (pertains to sibilants and some fricative sounds)

eighth bit is v/c switch but in reverse (0 - consonant, 1 - vowel)

View on mindly.social

1

0

3

0

Anthony C. Bartman

@rommix0@mindly.social

Part of the text to speech and AI image generation communities

mindly.social

Anthony C. Bartman

@rommix0@mindly.social

Part of the text to speech and AI image generation communities

mindly.social

@rommix0@mindly.social · Mar 08, 2026

This was quite amusing. BeSTspeech would work well as a wolf simulator.

View on mindly.social

3

0

7

0

Anthony C. Bartman

@rommix0@mindly.social

Part of the text to speech and AI image generation communities

mindly.social

Anthony C. Bartman

@rommix0@mindly.social

Part of the text to speech and AI image generation communities

mindly.social

@rommix0@mindly.social · Mar 07, 2026

So turns out BeSTspeech has a couple feature translation tables for ASCII. One of them being a bit masked letter class feature. That class table determines whether an ASCII symbol is a vowel, letter, number, or punctuation.

View on mindly.social

0

2

0

Thread context 2 posts in path

Parent @fastfinge@fed.interfree.ca Open

@fastfinge@fed.interfree.ca

Thanks, AI diversity proofreader! Because yes of course that’s a thing that exists, sigh. Highlighting the word “Walkman” and suggesting “Consider a more gender neutral term, like walkperson” was sure

Current reply

Anthony C. Bartman

@rommix0@mindly.social

Part of the text to speech and AI image generation communities

mindly.social

Anthony C. Bartman

@rommix0@mindly.social

Part of the text to speech and AI image generation communities

mindly.social

@rommix0@mindly.social · Mar 06, 2026

@fastfinge Eh. I blame the data curators for that more than the AI. Virtue signaling on both sides has been quite a thing for the last decade.

View full thread on mindly.social

0

Anthony C. Bartman

@rommix0@mindly.social

Part of the text to speech and AI image generation communities

mindly.social

Anthony C. Bartman

@rommix0@mindly.social

Part of the text to speech and AI image generation communities

mindly.social

@rommix0@mindly.social · Mar 05, 2026

Someone confused my github profile as a business yesterday. Had a freelancer send me an email introducing himself. I assumed he was looking for a freelance gig.

View on mindly.social

0

Thread context 2 posts in path

Parent @spacepup@mastodon.stickbear.me Open

on mastodon.stickbear.me

Open ancestor post

Current reply

Anthony C. Bartman

@rommix0@mindly.social

Part of the text to speech and AI image generation communities

mindly.social

Anthony C. Bartman

@rommix0@mindly.social

Part of the text to speech and AI image generation communities

mindly.social

@rommix0@mindly.social · Feb 27, 2026

@spacepup@mastodon.stickbear.me @datajake1999@dragonscave.space There always was a full version. The full CD of Monologue 97 is on the Internet Archive.

View full thread on mindly.social

0

Anthony C. Bartman

Posts