What is Tokenization? — AI Glossary

In Plain English

Tokenization is how AI language models prepare text for processing. Before an AI can understand or respond to your question, it breaks your words into bite-sized pieces called tokens—which might be whole words, parts of words, or punctuation marks. Think of it like a scanner at a grocery store reading barcodes instead of looking at entire products. Different AI models use different tokenization rules, which is why the same sentence might break into different-sized chunks depending on which AI system reads it. This step matters because it determines how the AI "sees" your input and directly affects how well it understands and responds to you.

Tokenization

In Plain English

💡Real-World Example

Related Terms

Want to learn more about AI?