Resoureces:

Pruning

unstructured pruning

https://arxiv.org/pdf/1506.02626.pdf

set zero weights in a weight matrix → increase sparsity in architecture

structured pruning

Quantization

Low-rank factorization

Knowledge distillation