Abstract: The deployment of large language models (LLMs) on edge hardware presents multilayered challenges, including compression, compiler behavior, and system-level trade-offs. This survey provides ...