This is not a duplicate of Why are usernames allowed to be composed entirely of non-visible Unicode characters? as that simply asks why this ability exists. This is a feature request asking for this ability to be removed. This answer by former employee @waffles doesn't really apply either; while I do agree that some problematic usernames can only be detected by humans (especially when they depend on context), this type of usernames can be blocked automatically.
Consider this post.
It's made by a user who decided to put some strange characters in their username, which results in them not being rendered. Where can you click to navigate to that user?
...
It took me a while to realize you can click the (blank) avatar as well, I've never used that feature before.
I propose to disallow usernames like this; there should always be something of a name to click on. I mean, it's called a 'display name', so it should be ... displayed, right?
The characters are from the Latin-1 Supplement, alternating between a LATIN CAPITAL LETTER I WITH ACUTE (U+00CD) and a SINGLE SHIFT THREE (U+008F). The first one is unusual but rather innocent (just a vowel with a diacritical mark), the second one is 'suspicious' in as far as Unicode characters can be like that. The Unicode code point is still visible in the URL, encoded as %cd%8f%cd%8f%cd%8f; their username has been reset networkwide by an unknown ♦ moderator (whoever it was, thank you!).
2 Answers 2
I consider this as abuse of the system, plain and simple.
As also mentioned in comments, one way to handle it is by custom flagging such a user (i.e. flagging one of their posts) and ask the moderators to reset the name, mod on a single site can also reset network wide.
While there is no official policy banning such names, it's common sense that using such a name is harmful and anything but funny.
That byte sequence is UTF-8 for U+034F COMBINING GRAPHEME JOINER. According to the UCD, this is not a whitespace character, but everyone knows the UCD is full of errors. Including the UCD, considering the note on this very character:
the name of this character is misleading; it does not actually join graphemes
An easy solution would be to grab Florian Pigorsch's characters.json, convert it to a regex, and ensure that all usernames contain at least one non-whitespace character not on that list.
-
1We should be much stricter about characters permissible in names, at a system-automated level. It's not just a matter of annoyance; it increases security risks such as impersonating other users or exploiting text rendering bugs. For a start I would suggest requiring at least one character explicitly recognized as a letter; and disallowing "other" characters and separators except for space, ideographic (full-width) space and possibly soft hyphen.Karl Knechtel– Karl Knechtel2024年05月20日 08:03:41 +00:00Commented May 20, 2024 at 8:03
-
3@KarlKnechtel Usernames aren't unique. You don't need to do anything fancy to impersonate another user; if text rendering can create a security vulnerability, post bodies are surely a bigger attack surface.wizzwizz4– wizzwizz42024年05月20日 08:19:19 +00:00Commented May 20, 2024 at 8:19
You must log in to answer this question.
Explore related questions
See similar questions with these tags.
CD 8F CD 8F CD 8F%) shouldn't be allowed in usernames? It might get complicated to assemble a list of every possible character combination that could result in something strange.