How would nanochan anonymize a spoken recording of someone in a way completely resistant to analysis from glow in the dark adversaries? I imagine that even if you fucked with the pitch of the voice and all that you could still be identified your general cadence and style of phrasing.
All I can think of is something like speech to text > anonymouth > text to speech which should be pretty secure but would be potentially error prone and probably would have a fairly significant delay. Can any nanons think of something better?
What's the best privacy respecting text to speech option?
The nanochan internet radio thread got me interested in whether or not we could have a functional anonymous DJ. Having an actual girl as the host doesn't seem very on brand to me but having a female digital voice seems cyberpunk as fuck.
Speech synthesizers don't leak information in speech patterns (as long as you didn't use your own voice for it and the crowd you're trying to blend into also uses the same speech synthesizer), but they do leave you vulnerable to textual analysis if you haven't got that covered.
>>4483 >textual analysis
That's what I put anonymouth in the middle of the process for, although last I looked into the project it was pretty much dead and also written in java. It's the only software I know about that protects against that sort of thing too. I would have thought there'd be enough paranoid security enthusiasts to create an variety of different alternatives but for whatever reason we're stuck with anonymouth.
>>4485 If you're feeding it self-made data and restricting it from the internet, what's the problem with using a Java program? I think that the reason there is no other alternatives is because, in the end, each anon will have his own training corpus and it will still leak your style (at least it's my uneducated guess). There's a more up-to-date R library for stylo, but I'm not sure there are any non-manual anonymization options in it: https://github.com/computationalstylistics/stylo
>>4482 I want to add that if it IS a requirement, you're opening yourself for a tracking attack.
>something like speech to text > anonymouth > text to speech
Why not just type what you want to say, feed it to a speech synthesizer and play a respective file over voicecomms or, better, just send it via 3rd-party file hosting, preferrably through a proxy?
I just want to say that talking to actual random people through voice synthesizers/voice scramblers really draws attention to you. You're better off just never doing it ever. Just type.
>>4497 Outside of general interest in the topic, I want to come up with a solution that would work for the proposed radio show. The biggest downside of typing is that it creates further delay between when you think the word and when the other person hears/reads the word, as they have to wait for you to fully finish your sentence or paragraph before they can even hear/read it, then start on their response. What would be best is if you could send each word or a small chunk of words at a time to be queued up for the voice synthasizer to read out on the radioshow, which is a system well suited for speech to text I think. That way you could have live discussions with only a couple of seconds of delay instead of 30+ seconds, which wouldn't really resemble a live conversation at that point.
Also I found some open source tensorflow based stuff for each, such as google's tacotron for text to speech and mozilla's deepspeech for speech to text. Something not from those specific companies would be way more optimal for cyberpunk larp but they seem like quality options that fit the requirements. Tacotron has a handful of different versions made by different people on github too
https://github.com/pagea/unstyle I found a small project for anonymizing writing but the author warns that it's still too easly in development and also it's dead
>>4500 I see it as you have contradictory goal.
One goal is to escape the glow, to scramble the REAL LIFE speech in a way for glows not being able to do anything, "completely resistant".
Another one is to actually preserve the realtime speech.
Realtime communication would be even more individual than personal writing style, as I see it, with a personal way to pronounce, to build phrases, to listen, etc.
Of course, we're looking into very advanced glowers here, but you said "completely resistant", so… Just drop something man.
>>4506 If the same processes that would anonymize the speech after being recorded could be applied to real time speech (assuming it's even possible to properly subvert stylometric analysis without the whole sentence to work with), then then I don't see why you wouldn't want the convenience of something that works in real time.
In practice I'll accept that it wouldn't be practical especially for this hypothetical nanochan radio, and honestly a text to speech on air discussion would probably be extremely messy even if both voices were able to smoothly talk to eachother.
>>4507 > If the same processes that would anonymize the speech after being recorded
Well, OBVIOUSLY not everything can be applied. You will need to send each word separately, essentially, which means no styling fixes, no pauses fixes, possibly no even tone fixes. I think it's so obvious that you didn't give this much thought at all, and I'm this close to flaming you. xD
>>4481 It just sounds like you want to be le ebin death note style 1337 h4x0rm4n with le ebin voice changer and spoopy iconz.
Just use text and stop being a larper.
>>4508 Okay, disregard the sending single words at a time part and instead just think of a small batch of words, because I just want to limit the delay to a couple of seconds and it doesn't need to convert every every word live in real time as I talk like a babblefish. When I used the phrase real time I meant to make the destinction between full prerecorded sentences but it can have a small time buffer for having enough information to work with. A couple of seconds of words should be enough to fuzz most minor pauses and even switch around word choice, not sure what you mean by tone. You don't need the entire text to anonymize a section of it so surely there's some ideally small snippet you could pass through without making it completely useless.
>>4509 I want an actual anonymous way to convert speech, but yeah it's for larp purposes
>>4510 >ideally small snippet you could pass through without making it completely useless.
Well, I think, for "ideal anonymization" (or rather, close to ideal) you need the whole sentence, no less. Ideally the whole text should be restyled, but I think it's way too paranoid myself.
I say the whole sentence because sometimes the meaning on the whole sentence depends on the last word, and word meanings might become invalid or rather awkward if you don't take the context of the whole sentence into account. Like, it may become legitimately hard to understand, as with poor machine translation.
So, probably your originally proposed solution might be just the right one. You'd just need a better text anonymizer than anonymouth probably. Not that I know any. Like, I clearly have no degree in linguistics and I have no idea why you reply to me even, as I cannot add much else other than what already've been said.
Oh the memes created with speech synthesizers: rRbY3TMUcgQ
>>4481 <start libary of audio/video snippets (randomized sources)
<create program converting txt input to composition of audio/video src/lib
<upgrade to voice to text recognition + combine with previous version
something like this ? ^
also(vid somewhat related)
what about cutting words and phrases from tv and movies and editing them together, this would be completely anonymous.
this would be the process:
>you say "i love nanochan"
>text-to-speech trasforms to text
>search in audio database
>finds i_love.mp3 scraped from a random movie
>nano.mp3 scraped from the anime Nichijou
>chan.mp3 from a random anime
>puts them together with correct pauses
>play the result
next to no processing is involved since the audio scraping is done beforehand, you would probably need to assign metatadata to each audio file, and as long as the scraped sources are public is completely anonymous, you can also do this by hand with a video editor if you don't need realtime.
>>4485 I think the best solution is beginning practice writing in voice that is separate from one we usually use like what is observable in star posters posts :))))).
No it wouldn't be perfect or totally 100% l33t anonymous, but some kinda real voice changing thing would be terrific, that I can tell you, believe me.
Maybe (and im outside my retarded bounds of retarded knowledge, like a retard here), but could you not just pipe the audio through something like Ardour (which I don't know how to use), mess it up a little, and spew it back out?
How would nanochan anonymize a spoken recording of someone in a way completely resistant to analysis from glow in the dark adversaries? I imagine that even if you fucked with the pitch of the voice and all that you could still be identified your general cadence and style of phrasing.
All I can think of is something like speech to text > anonymouth > text to speech which should be pretty secure but would be potentially error prone and probably would have a fairly significant delay. Can any nanons think of something better?
What's the best privacy respecting text to speech option?
The nanochan internet radio thread got me interested in whether or not we could have a functional anonymous DJ. Having an actual girl as the host doesn't seem very on brand to me but having a female digital voice seems cyberpunk as fuck.