Menu
Inshorts
For the best experience use inshorts app on your smartphone
inshortsinshorts
Perplexity open-sources Unigram tokeniser to cut CPU usage by 5-6x
short by Jessica Rajan / on Friday, 29 May, 2026
Perplexity has open-sourced a rebuilt Unigram tokeniser designed to cut CPU usage by 5-6 times and improve inference efficiency for smaller AI models. The tool focuses on XLM-RoBERTa's 250,000-token vocabulary, widely used in ranking and retrieval tasks. It matches the reference implementation's output while reducing processing overhead by avoiding costly string rebuilding and hash-maps.
read more at Perplexity