使用 FastChat 运行 CodeLlama-7b-Instruct-hf

业界资讯 2023-09-09 17:49:53 阅读次数: 0

使用 FastChat 运行 CodeLlama-7b-Instruct-hf

1. 确认 FactChat 支持的 Model
2. 升级依赖
3. 启动 controller
4. 启动 CodeLlama
5. 启动 api server
6. VSCode 中使用 CodeLlama

1. 确认 FactChat 支持的 Model

访问 model_support.md，确认 codellama/CodeLlama-7b-Instruct-hf 在支持列表中，

在这里插入图片描述

2. 升级依赖

pip install -e ".[model_worker,webui]"
pip install git+https://github.com/huggingface/transformers.git@main accelerate

3. 启动 controller

python -m fastchat.serve.controller

4. 启动 CodeLlama

python -m fastchat.serve.model_worker --model-names "codellama-34b-instruct,gpt-3.5-turbo,gpt-3.5-turbo-16k,gpt-4,gpt-4-32k,text-davinci-003" --model-path codellama/CodeLlama-7b-Instruct-hf

5. 启动 api server

python -m fastchat.serve.openai_api_server --host 0.0.0.0 --port 8000

6. VSCode 中使用 CodeLlama

参考连接：

https://continue.dev/docs/walkthroughs/codellama
https://continue.dev/docs/customization#local-models-with-openai-compatible-server

配置 Continue 插件的 config 如下，

from continuedev.src.continuedev.libs.llm.openai import OpenAI
...
config = ContinueConfig(
    ...
    models=Models(default=OpenAI(
        api_key="EMPTY",
        model="CodeLlama-7b-Instruct-hf",
        api_base="http://localhost:8000/v1")
    ),

完结！

猜你喜欢

转载自blog.csdn.net/engchina/article/details/132661826

使用 FastChat 运行 CodeLlama-7b-Instruct-hf

【deepseek】（2）：使用3080Ti显卡，运行deepseek-coder-6.7b-instruct模型，因fastchat并没有说支持这个版本，或者模型有问题，出现死循环输出EOT问题

【baichuan2】（1）：使用 fastchat 部署Baichuan2-13b服务，启动8bit的worker，可以运行openai_api服务和web界面方便进行测试

使用FastChat部署Baichuan2

【大模型知识库】（4）：本地环境运行dity+fastchat的ChatGLM3模型，可以使用chat/completions接口调用chatglm3模型

【ChatGLM3】（9）：使用fastchat和vllm部署chatlgm3-6b模型，并简单的进行速度测试对比。vllm确实速度更快些。

【ChatGLM3】（5）：使用 fastchat 部署ChatGLM3服务，启动8bit的worker，可以运行openai_api服务和web界面方便进行测试。还支持embeddings 接口！

【大模型知识库】（5）：本地环境运行dity+fastchat的BGE模型，可以使用embedding接口对知识库进行向量化，连调成功。

NLP（五十九）使用FastChat部署百川大模型

NLP（六十四）使用FastChat计算LLaMA-2模型的token长度

FastChat(小羊驼模型)部署体验

centos 7 使用mariadb 兼容运行 MySQL

Fastchat：基于fastapi构建大模型加载服务

sublime text中使用cmd+B运行shell的设置

使用CentOS7创建Docker运行环境

centos7使用monit监控服务运行状态

centos7 运行jenkins.war 的形式使用jenkins

使用maven-tomct7插件运行web工程

centos 7: 使用systemctl，Root 身份运行php-fpm

在Linux(Centos7)上使用Docker运行.NetCore

CentOS 7下使用Docker运行.Net Core

在CentOS 7 上使用Docker 运行.NetCore项目

Centos7下使用Docker运行SpringBoot项目

python的使用和运行

使用Python运行paraview

使用Docker运行spark

使用Docker运行hadoop

使用Docker运行TensorFlow

使用 MYNTEYE 运行 OKVIS

HDFS运行原理及使用

今日推荐

周排行

rac环境打PSU补丁ERROR:This patch is not applicable to GI home.

科学活动《离园倒计时》（时间）

Windows 沙箱开发踩坑

secureCRT 改变显示宽度

hdu多校第六场1008 （hdu6641）TDL 暴力

【low向】注册用户时密码强度的判定

__int64

context-params与init-params

JS三个编码函数和net编码System.Web.HttpUtility.UrlEncode比较

springboot通过重写addResourceHandlers拦截请求访问本地资源

每日归档

更多

2024-08-08(0)

2024-08-07(0)

2024-08-06(0)

2024-08-05(0)

2024-08-04(0)

2024-08-03(0)

2024-08-02(0)

2024-08-01(0)

2024-07-31(0)

2024-07-30(0)