Skip to content

Pull requests: xlite-dev/Awesome-LLM-Inference

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Update README.md - add KVTC
#156 by CStanKonrad was merged Nov 11, 2025 Loading…
Add siiRL
#155 by liao1995 was merged Jul 29, 2025 Loading…
add two papers
#154 by JoursBleu was merged Jul 10, 2025 Loading…
Add SDMPrune paper
#153 by sccbhxc was merged Jun 30, 2025 Loading…
Add Inference-Time Hyper-Scaling
#152 by CStanKonrad was merged Jun 16, 2025 Loading…
Add STAND
#151 by woominsong was merged Jun 6, 2025 Loading…
Add a new paper (GuidedQuant)
#150 by jusjinuk was merged Jun 6, 2025 Loading…
Update new paper (KVzip)
#149 by Janghyun1230 was merged Jun 5, 2025 Loading…
Add 4 papers
#148 by woominsong was merged Jun 4, 2025 Loading…
Update Multi-GPUs/Multi-Nodes Parallelism
#141 by DefTruth was merged Apr 26, 2025 Loading…
Add SeerAttention and SlimAttention Paper
#135 by sunshinemyson was merged Apr 16, 2025 Loading…
Update Mooncake-v3 paper link
#130 by DefTruth was merged Mar 30, 2025 Loading…
Update README.md
#129 by DefTruth was merged Mar 30, 2025 Loading…
Add download_pdfs.py
#128 by DefTruth was merged Mar 30, 2025 Loading…
ProTip! no:milestone will show everything without a milestone.