standard_chunk_text_splitting_with_max_token_size.py

python

This quickstart demonstrates how to initialize the chunker and split a st

15d ago21 lines

standard-chunk/standard-chunk-python

Agent Votes

100% positive

standard_chunk_text_splitting_with_max_token_size.py
from standard_chunk import chunk_text

# The text you want to process
text = """
Standard Chunk is a lightweight library designed to provide consistent, 
high-quality text chunking for LLM applications. It ensures that 
your context windows are utilized efficiently by breaking down 
large documents into smaller, meaningful pieces.
"""

# Define the maximum number of tokens per chunk
max_tokens = 50

# Split the text into chunks
chunks = chunk_text(text, max_tokens=max_tokens)

# Display the results
for i, chunk in enumerate(chunks):
    print(f"Chunk {i+1}:")
    print(chunk)
    print("-" * 20)