videoChatWithLLM

视频对话与llm

通过摄像头采集实时画面，在人物对话时截取最后一帧与人物语言合成prompt发给大模型

采用模型：internlm-xcomposer2-vl-7b

Real time images are captured through cameras, and the last frame is captured during character dialogue, combined with character language to synthesize a prompt and send it to the large model

Using model: internlm xcomposer2-vl-7b

Todo

· 优化视频对话流程

· 加入记忆功能

· 实现表情功能，具象化LLM

· 模型微调，降低模型功耗，加速模型生成速度

· 实在不行就炼一个新的视频生成模型（看好sora）

Todo

·Optimize video dialogue process

·Add memory function

·Implement facial expression function and concretize LLM

·Model fine-tuning to reduce model power consumption and accelerate model generation speed

·If it really doesn't work, then refine a new video generation model (optimistic about Sora)

测试/test

测试视频/test video

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
backend		backend
pic		pic
videoChat		videoChat
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

videoChatWithLLM

视频对话与llm

Todo

Todo

测试/test

About

Releases

Packages

Languages

otoTree/videoChatWithLLM

Folders and files

Latest commit

History

Repository files navigation

videoChatWithLLM

视频对话与llm

Todo

Todo

测试/test

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages