Skip to content

Pull requests: ModelTC/LightLLM

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Cache aware for pd
#1379 opened Jul 2, 2026 by kingder Collaborator Loading…
feat(visual): reserve ViT worst-case activation memory
#1378 opened Jul 2, 2026 by sufubao Collaborator Loading…
fix: correct fialed->failed typo in Exception message
#1376 opened Jun 30, 2026 by Jah-yee Loading…
static cost
#1374 opened Jun 30, 2026 by shihaobai Collaborator Loading…
feat: opt autotuner
#1373 opened Jun 30, 2026 by blueswhen Collaborator Loading…
Support GLM-5.2 (glm_moe_dsa, DeepSeek-V3.2-style DSA MoE)
#1370 opened Jun 25, 2026 by sufubao Collaborator Loading…
feat: fuse add and rmsnorm
#1368 opened Jun 24, 2026 by blueswhen Collaborator Loading…
feat(quant): support tensorwise fp8 w8a8 (--quant_type fp8w8a8-pt)
#1366 opened Jun 22, 2026 by sufubao Collaborator Loading…
fix stream fc for qwen3_coder
#1364 opened Jun 18, 2026 by shihaobai Collaborator Loading…
Support default chat template kwargs
#1363 opened Jun 18, 2026 by sufubao Collaborator Loading…
support ds4
#1355 opened Jun 15, 2026 by WANDY666 Contributor Loading…
feat: add gguf support
#1354 opened Jun 15, 2026 by zhangts20 Collaborator Loading…
feat(qwen3_5_mtp): Qwen3.5 / Qwen3.5-MoE MTP speculative decoding
#1338 opened Jun 9, 2026 by sufubao Collaborator Loading…
feat: add multi-platform support with ascend and maca
#1335 opened Jun 8, 2026 by zhangts20 Collaborator Loading…
feat: update disk cache params and benchmark_multiturn.py
#1333 opened Jun 8, 2026 by blueswhen Collaborator Loading…
Fa4 support
#1327 opened Jun 2, 2026 by blueswhen Collaborator Loading…
add in-process URL pool caching
#1325 opened Jun 1, 2026 by Owleye4 Contributor Loading…
update cpu cache load use async way.
#1318 opened May 25, 2026 by hiworldwzj Collaborator Loading…
support mtp for gemma4
#1316 opened May 22, 2026 by WANDY666 Contributor Loading…
feat(RL): add RL support for verl
#1298 opened May 8, 2026 by shihaobai Collaborator Loading…
import flashqla to speedup gdn prefill
#1295 opened May 8, 2026 by WANDY666 Contributor Loading…
ProTip! Type g i on any issue or pull request to go back to the issue listing page.