Llama 3.2 mlp pruned

oopere 's Collections

updated 4 days ago

Created by pruning the MLP (feedforward) layers, reducing the size of Llama models while improving their performance.