- TIL \r\n (CRLF) is a single grapheme cluster according to the Unicode standard (like an emoji) https://unicode.org/reports/tr29/#table_combining_char_sequences_and_grapheme_clusters 26 comments programming
Linking pages
- The Absolute Minimum Every Software Developer Must Know About Unicode in 2023 (Still No Excuses!) @ tonsky.me https://tonsky.me/blog/unicode/ 903 comments
- Let's Stop Ascribing Meaning to Code Points - In Pursuit of Laziness http://manishearth.github.io/blog/2017/01/14/stop-ascribing-meaning-to-unicode-code-points/ 600 comments
- A Programmer’s Introduction to Unicode – Nathan Reed’s coding blog http://reedbeta.com/blog/programmers-intro-to-unicode/ 284 comments
- Language tour | Dart http://www.dartlang.org/language-tour/ 87 comments
- How hard could it be? Sorting words alphabetically in Rust https://sts10.github.io/2023/01/29/sorting-words-alphabetically-rust.html 81 comments
- GitHub - netxs-group/vtm: Terminal multiplexer with window manager and session sharing https://github.com/netxs-group/vtm 69 comments
- Breaking Our Latin-1 Assumptions - In Pursuit of Laziness http://manishearth.github.io/blog/2017/01/15/breaking-our-latin-1-assumptions/ 65 comments
- What is the unit of a text column number? https://foonathan.net/2021/02/column/ 64 comments
- Rust GUI Infrastructure http://www.cmyr.net/blog/rust-gui-infra.html 46 comments
- Language tour | Dart https://dart.dev/guides/language/language-tour#switch-and-case 39 comments
- Announcing Dart 2.7: A safer, more expressive Dart | by Michael Thomsen | Dart | Medium https://medium.com/dartlang/dart-2-7-a3710ec54e97 39 comments
- swift/StringManifesto.md at main · apple/swift · GitHub https://github.com/apple/swift/blob/master/docs/stringmanifesto.md 30 comments
- Text formatting in C++ using libc++ - The LLVM Project Blog https://blog.llvm.org/posts/2022-08-14-libc++-format/ 25 comments
- Idiosyncratic Ruby: Regular Extremism http://idiosyncratic-ruby.com/11-regular-extremism.html 17 comments
- Introducing Blast.js - Mozilla Hacks - the Web developer blog https://hacks.mozilla.org/2014/09/introducing-blast-js/ 9 comments
- GitHub - amanbolat/awesome-go-with-stars: Awesome-go list with stars. Automatically updated https://github.com/amanbolat/awesome-go-with-stars 9 comments
- GitHub - anirbanmu/str_metrics: Ruby gem (native extension in Rust) providing implementations of various string metrics https://github.com/anirbanmu/str_metrics 8 comments
- GitHub - rivo/uniseg: Unicode Text Segmentation, Word Wrapping, and String Width Calculation in Go https://github.com/rivo/uniseg 7 comments
- UTF-8 strings with Go: len(s) isn't enough | Henrique Vicente https://henvic.dev/posts/go-utf8/ 5 comments
- GitHub - clipperhouse/jargon: Tokenizers and lemmatizers for Go https://github.com/clipperhouse/jargon 4 comments
Related searches:
Search whole site: site:unicode.org
Search title: UAX #29: Unicode Text Segmentation
See how to search.