01-31 Paper-Analysis-TensorGPT: Efficient Compression of the Embedding Layer in LLMs based on the Tensor-Train Decomposition
01-26 Paper-Analysis-LQ-LoRA: Low-rank Plus Quantized Matrix Decomposition for Efficient Language Model Finetuning
01-18 Paper Analysis: ASVD: Activation-aware Singular Value Decomposition for Compressing Large Language Models