goodmodeler commited on
Commit
07a7833
·
1 Parent(s): ee07cd3

organize files

Browse files
README.md CHANGED
@@ -60,11 +60,22 @@ python sft_train.py
60
  # 3 Build RAG index
61
  python build_embeddings.py
62
 
63
- # 4 (可选) 收集偏好 → 训练 reward model
64
  python reward_model.py
65
 
66
- # 5 PPO RLHF 微调
67
  python ppo_tune.py
68
 
69
  # 6 Inference with RAG
70
- python rag_infer.py
 
 
 
 
 
 
 
 
 
 
 
 
60
  # 3 Build RAG index
61
  python build_embeddings.py
62
 
63
+ # 4 (可选) 收集偏好 → 训练 reward model
64
  python reward_model.py
65
 
66
+ # 5 PPO RLHF 微调LLM
67
  python ppo_tune.py
68
 
69
  # 6 Inference with RAG
70
+ python rag_infer.py
71
+
72
+
73
+
74
+ system flow:
75
+ input: business or product description text
76
+ 1. 根据input用RAG取embedding
77
+ 1. GPT‑OSS 生成 4 个广告文案 + 标题 + 口号(可选语气:专业/活泼/极简)
78
+ 2. GPT‑OSS 基于选中文案生成 扩展视觉提示词(主体、配色、镜头、艺术风格)
79
+ 3. stablediffusion model 生成 4 张草图(可选 ControlNet-Layout/Logo 插入)
80
+ 4. 返回4张海报+后处理
81
+ output: an advertisement sentence and post image
download.py → data_loader/download.py RENAMED
File without changes
download_dataset.py → data_loader/download_dataset.py RENAMED
File without changes
ppo_tune.py → fine_tune_llm/ppo_tune_llm.py RENAMED
File without changes
reward_model.py → fine_tune_llm/reward_model.py RENAMED
File without changes
sft_train.py → fine_tune_llm/sft_llm_train.py RENAMED
File without changes
train_lora.py → fine_tune_stablediffusion/train_lora.py RENAMED
File without changes
build_embeddings.py → retrieval_augmented_generation/build_embeddings.py RENAMED
File without changes
train_model.py → train_model_test.py RENAMED
File without changes