from Hacker News

Rot8000 – Rot13 for the Unicode generation

by rottytooth on 11/2/13, 4:27 PM with 18 comments

by lelf on 11/2/13, 5:20 PM
It's broken.
Λ̊1 → ⊻∪ά → Λ̊⋌
𝄞 → 뤔뷾 → 駴點
Edit: anyway, even with correct (a+b)%n it's plain bad idea.
Unicode is not English alphabet. Everything not in basic multilingual plane is broken automatically. And even in BMP there's going to be bag of glitches starting from hanging combining characters and ending to ‘oops someone normalised our string and it's now different’ (for site, not for user / Unicode).
by mischanix on 11/2/13, 5:09 PM
Not reciprocal for CJK input, e.g. "한글" takes 5 iterations to reach stability. I believe this has to do with the utf-16 encoding of codepoints > 0x10000
by aculver on 11/2/13, 7:08 PM
Inputting "こんにちは。元気ですか？" caused an application error:
```
    [ArgumentException: Error serializing value 'ᄳᅳᅋᅁᅏტ㈣䳷ᅇᄹᄫ�' of type 'System.String.']
```
After realizing it was "？" that was breaking everything, I ended up with this round trip:
"こんにちは。元気ですか。" → "ᄳᅳᅋᅁᅏტ㈣䳷ᅇᄹᄫტ" → "こんにちは。ጃ⷗ですか。"
It's broken. I suspect Unicode requires more careful manipulation than OP anticipated. :-)

by peterwaller on 11/2/13, 6:40 PM

Copy-pasting the contents of rot8000.com/info in and hitting cypher twice ends up scrambling the contents quite a bit..

  It also bypasses 32 control characters, technically making it rot7968, sometimes with an additional offset.

  It also bypasses ⋍2 control characters, technically making it rot⋏⋬68, sometimes with an additional offset.

by rottytooth on 11/7/13, 4:15 AM
I put in a fix for CJK and the result is: nearly everything that's not CJK now rotates into it and back out; CJK is an huge section of the Basic Multilingual Plane. The fix invalidates rotations done with rot8000 before the fix, unfortunately.
by njharman on 11/3/13, 12:54 AM
I just realized that 13 was probably chosen for rot13 cause that's half the number of letters in English alphabet.
I miss "obvious" stuff like that all the time.
by jloughry on 11/2/13, 4:47 PM
Why not call it Rot8192 or Rot0x7777 ?