Hacker News with Generative AI: UTF-8

Branchless UTF-8 Encoding (cceckman.com)
Can you encode UTF-8 without branches?
The WTF-8 Encoding (simonsapin.github.io)
WTF-8 (Wobbly Transformation Format − 8-bit) is a superset of UTF-8 that encodes surrogate code points if they are not in a pair. It represents, in a way compatible with UTF-8, text from systems such as JavaScript and Windows that use UTF-16 internally but don’t enforce the well-formedness invariant that surrogates must be paired.
You probably don't need to validate UTF-8 strings (viralinstruction.com)
Decoding UTF8 with parallel extract (nrk.neocities.org)
PEP 686 – Make UTF-8 mode default (python.org)