What is semantic chunking?
Semantic chunking is the process of splitting content based on meaning and structure instead of arbitrary token counts.
A semantic chunk should preserve a coherent unit of information.
That can be:
- a section under one heading
- a paragraph group
- a code example with explanation
- a table and its surrounding context
- a list with its introduction
- a documentation endpoint and its response shape