Cantonese swearing characters and other Chinese neologisms

When I lived in Hong Kong, I was aware that key Cantonese swear words - and the Cantos love to swear; they are masters of the art - were officially unwriteable. There simply weren’t the characters to do it.

And yet, I noticed graffiti that was using character neologisms that represented key swear words.

Many of these were a benign character inserted into the word for door - 門 (mun). The implication being that this mutates the sound of the benign word into a taboo one.

The most obvious one was diu - “to fuck”. The character comprised the word for small - 小 (siu) inserted into 門.

Another one was the word for west - 西 (sai) inside 門, which made it into hai, or “cunt”.

A brief search reveals this site which has a few of these ‘door’ characters - including a third one for lun, “fucking”, which uses a character that I recognise but can’t remember. Is it a simplified dragon - 龍 (loong) or am I barking up the wrong tree?

Note that the site uses images rather than Unicode characters.

Does anyone know if this phenomenon has been documented anywhere? Anyone know how old these characters are - are they actually neologisms or are they ancient but taboo? Am I correct in assuming that these are not official, and therefore haven’t been encoded into official character sets? Finally, what is the mechanism for charater neologisms? When a new character arises, or is officially coined, how long before Unicode etc. incorporates it? Is there a schedule? Or are neologisms ignored in favour of word combining to express new concepts?

Not sure about the swearing aspects, but that third character, , means “can” (as in “being able to”).

Hmm… “nang” - doesn’t sound much like lun, so there’s 33.3333% of my theory out of the water.

Native Cantonese speaker here…

These characters belong to the “Hong Kong Supplementary Character Set”, and they are now officially incorporated into Unicode. They are not universally used on the Internet, however, because they are standardised relatively late. On most computers, the user needs to install additional software in order to view and input them.

Wiki
Microsoft

These characters have been in use for a long time. I was quite familiar with them while growing up in the 80s and early 90s. :smiley: They follow a path of development similar to other non-swearing Cantonese characters. The Wiki article on written Cantonese should provide more info on the mechanism at work.

𨶙 (門+能) actually means “penis” literally, but more vulgar. It is like the masculine version of “cunt”. Its pronunciation is lan2, compared to nang4 for 能. If you consider the tendency for younger speakers to mispronounce n and the -ng ending, they do sound pretty similar.