by rottytooth on 11/2/13, 4:27 PM with 18 comments
by lelf on 11/2/13, 5:20 PM
Λ̊1 → ⊻∪ά → Λ̊⋌
𝄞 → 뤔뷾 → 駴點
Edit: anyway, even with correct (a+b)%n it's plain bad idea.
Unicode is not English alphabet. Everything not in basic multilingual plane is broken automatically. And even in BMP there's going to be bag of glitches starting from hanging combining characters and ending to ‘oops someone normalised our string and it's now different’ (for site, not for user / Unicode).
by mischanix on 11/2/13, 5:09 PM
by aculver on 11/2/13, 7:08 PM
[ArgumentException: Error serializing value 'ᄳᅳᅋᅁᅏტ㈣䳷ᅇᄹᄫ�' of type 'System.String.']
After realizing it was "?" that was breaking everything, I ended up with this round trip:"こんにちは。元気ですか。" → "ᄳᅳᅋᅁᅏტ㈣䳷ᅇᄹᄫტ" → "こんにちは。ጃですか。"
It's broken. I suspect Unicode requires more careful manipulation than OP anticipated. :-)
by peterwaller on 11/2/13, 6:40 PM
It also bypasses 32 control characters, technically making it rot7968, sometimes with an additional offset.
-> It also bypasses ⋍2 control characters, technically making it rot⋏⋬68, sometimes with an additional offset.
by rottytooth on 11/7/13, 4:15 AM
by njharman on 11/3/13, 12:54 AM
I miss "obvious" stuff like that all the time.
by jloughry on 11/2/13, 4:47 PM