...
There are many different categories of characters that a programmer might decided decide to exclude. For example, The Unicode Standard [Unicode 2012] defines the following categories of characters all of which can be matched using an appropriate regular expression:
...
[API 2006] |
|
3.5, Deletion of Noncharacters | |
Handling the Unexpected: Character-deletion | |
| |
|
...