...
Applications that accept untrusted input, especially Unicode-based input, should normalize the input before validating it. Character information in The following table shows the version of Unicode supported by the previous three release of Java SE.
Java Version | Unicode Version |
---|---|
Java SE 6 |
...
Unicode Standard, version 4.0 [Unicode 2003] |
...
Java SE 7 |
...
Unicode Standard, version 6.0.0 [Unicode 2011] | |
Java SE 8 | Unicode Standard, version 6.2.0 [Unicode 2012] |
Normalization . Normalization is important because in Unicode, the same string can have many different representations. According to the Unicode Standard [Davis 2008], annex #15, Unicode Normalization Forms:
...