llm_engine
llm_engine import atexit from dataclasses import f…
utils/contex
utils/context.py 这段代码定义了一个全局上下文管理器(Global Context …
scheduler
Scheduler Scheduler是一个推理调度器,,其核心功能是协调序列在等待队列(waiti…
block_manager
block_manager.py class Block: def __init__(self, b…
linear
linear.py LinearBase class LinearBase(nn.Module): …
utils/loader
  loader.py import os from glob import glob i…
engine/sequence
  engine/sequence.py 单个请求进来以后被封存成Sequence对象,这…