07-18 Paper-Analysis-Feature-based Low-Rank Compression of Large Language Models via Bayesian Optimization
04-29 Paper-Analysis-SVD-LLM: Truncation-aware Singular Value Decomposition for Large Language Model Compression
01-31 Paper-Analysis-TensorGPT: Efficient Compression of the Embedding Layer in LLMs based on the Tensor-Train Decomposition
01-26 Paper-Analysis-LQ-LoRA: Low-rank Plus Quantized Matrix Decomposition for Efficient Language Model Finetuning
01-18 Paper Analysis: ASVD: Activation-aware Singular Value Decomposition for Compressing Large Language Models