- 添加 transformers 和 torch 依赖 - 创建 bert_router.py 封装 RouteLLM BERT 分类器 - 新增 select_model_by_bert() 函数替代 token 长度路由 - BERT 输出映射: strong->qwen-max, weak->qwen-flash - 保留 token 长度路由作为 fallback
12 lines
196 B
Plaintext
12 lines
196 B
Plaintext
fastapi>=0.104.0
|
|
uvicorn[standard]>=0.24.0
|
|
pydantic>=2.5.0
|
|
litellm>=1.0.0
|
|
tiktoken>=0.5.0
|
|
httpx>=0.25.0
|
|
python-dotenv>=1.0.0
|
|
transformers>=4.30.0
|
|
torch>=2.0.0
|
|
pytest>=7.4.0
|
|
pytest-asyncio>=0.21.0
|