Home
Solved
- javascript
- c++
- java
- git
- php
- arrays
- html
- python
- jquery
- mysql
- .net
- json
- ajax

unicode-normalization

[Solved] Is this case a weird UTF-8 encoding conversion?

September 9, 2022 by Kirat

[ad_1] C0 is an invalid start byte for a two-byte UTF-8 sequence, but if a bad UTF-8 decoder accepts it C0 B1 would be interpreted as ASCII 31h (the character 1). Quoting Wikipedia: …(C0 and C1) could only be used for an invalid “overlong encoding” of ASCII characters (i.e., trying to encode a 7-bit ASCII … Read more

Categories Solved Tags character-encoding, encoding, unicode, unicode-normalization

Search for: