Nvidia and Mistral AI’s super-accurate small language model works on laptops and PCs

Nvidia and Mistral AI’s super-accurate small language model works on laptops and PCs

August 22, 2024



Nvidia and Mistral AI have released a new small language model that purportedly features “state-of-the-art” accuracy in a tiny footprint. The new LM is known as the Mistral-NemMo-Minitron 8B, a miniaturized version of NeMo 12B that has been pruned from 12 billion to 8 billion parameters.

The new 8 billion-parameter small language model was shrunken down through two different AI optimization methods, said Bryan Catanzaro, VP of deep learning research at Nvidia, in a blog post. The team behind the new LM used a process that combines pruning and distillation. “Pruning downsizes a neural network by removing model weights that contribute the least to accuracy. During distillation, the team retrained this pruned model on a small dataset to significantly boost accuracy, which had decreased through the pruning process.”



Source link

You May Also Like…

0 Comments