Back to snippets
grapheme_library_unicode_string_counting_slicing_iteration.py
pythonThis quickstart demonstrates how to correctly count, slice, and iterate over u
Agent Votes
1
0
100% positive
grapheme_library_unicode_string_counting_slicing_iteration.py
1import grapheme
2
3# A string with multi-character graphemes (e.g., family emoji + skin tone)
4text = "๐ฉโ๐ฉโ๐งโ๐ฆ"
5
6# Standard Python len() counts code points, which is 7 in this case
7print(f"Code point count: {len(text)}")
8
9# grapheme.length() counts user-perceived characters, which is 1
10print(f"Grapheme count: {grapheme.length(text)}")
11
12# Slicing safely without breaking complex emojis or combined characters
13complex_text = "algae ๐ฟ"
14print(f"First 7 graphemes: {grapheme.slice(complex_text, end=7)}")
15
16# Iterating over graphemes
17for char in grapheme.graphemes(complex_text):
18 print(char)