DeepSeek has unveiled DSpark, a speculative decoding framework, that can speed up AI response generation (or inference) by up to 85% and reduce the need for advanced chips. It uses a draft model that quickly suggests answers and a main model that checks them. The Chinese AI startup has open-sourced DSpark implementation, a joint effort with Peking University, via GitHub.