4.1 Encodings

All textual entities shall be UTF-8 encoded. The API docs state that “’characters’ means logical characters rather than bytes”. pin interprets that to mean grapheme clusters and uses the unicode_segmentation crate to identify them.