Skip to content
Open
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Original file line number Diff line number Diff line change
@@ -0,0 +1,22 @@
### 卢林军

### 实习项目

推理 Predictor 及 IR Pass 建设

### 本周工作

1. 调研Hugging Face中的设计,并在PaddleNLP中设计并实现了AutoInferenceModel类,实现了推理模型的自动加载。

- https://github.com/PaddlePaddle/PaddleNLP/pull/9416

2. 调研TRT-LLM推理流程,学习用户端推理使用流程。尝试在PaddleNLP中设计AutoPredictor。

- https://github.com/PaddlePaddle/PaddleNLP/pull/9445

### 下周工作

1. 在PaddleNLP中完善AutoPredictor设计并实现
2. 参考append_atten的集成PR,思考SageAttention集成到PaddleNLP中的方式

### 导师点评
Loading