Taka2024
/

gemma-2-27b-dpo-1

Inference Endpoints

Model card Files Files and versions Community

Taka2024 commited on 30 days ago

Commit

b329e2e

·

verified ·

1 Parent(s): cb653d5

Update README.md

Files changed (1) hide show

README.md +6 -6

README.md CHANGED Viewed

@@ -12,25 +12,25 @@ base_model:
 - google/gemma-2-27b
 ---
-# 学習データセット
 gemma-2利用にあたり、ライセンス制約上の懸念のあるデータセットは利用していない。
-## SFT使用データ
 - [llm-jp/magpie-sft-v1.0](https://huggingface.co/datasets/llm-jp/magpie-sft-v1.0) (apache-2.0)
 - [DeL-TaiseiOzaki/Tengentoppa-sft-qwen2.5-32b-reasoning-100k](https://huggingface.co/datasets/DeL-TaiseiOzaki/Tengentoppa-sft-qwen2.5-32b-reasoning-100k) (apache-2.0)
 - [weblab-GENIAC/Open-Platypus-Japanese-masked](https://huggingface.co/datasets/weblab-GENIAC/Open-Platypus-Japanese-masked) (MIT)
   - MITライセンスのデータのみ抽出して使用。
-## DPO使用データ
 - [weblab-GENIAC/aya-ja-nemotron-dpo-masked](https://huggingface.co/datasets/weblab-GENIAC/aya-ja-nemotron-dpo-masked) (apache-2.0)
-# モデル作成手順
 - ベースモデル（google/gemma-2-27b）にSFT使用データ（サンプリング）を使って、Loraアダプタを作成（Taka2024/gemma-2-27b-it-2_lora）
 - ベースモデルとLoraアダプタをマージ（Taka2024/gemma-2-27b-it-2_lora_merged）
 - マージしたモデルにDPO使用データ（サンプリング）を使って、DPOアダプタを作成（Taka2024/gemma-2-27b-dpo-1）
-# 推論手順
 unsloth版のサンプルコード（Google Colab L4使用）をベースとし、推論は１時間以内で終了するようになっている。
 ```
@@ -126,7 +126,7 @@ with open(f"/content/{json_file_id}_output_IF.jsonl", 'w', encoding='utf-8') as
 ```
-# Uploaded  model
 - **Developed by:** Taka2024
 - **License:** gemma

 - google/gemma-2-27b
 ---
+## 学習データセット
 gemma-2利用にあたり、ライセンス制約上の懸念のあるデータセットは利用していない。
+### SFT使用データ
 - [llm-jp/magpie-sft-v1.0](https://huggingface.co/datasets/llm-jp/magpie-sft-v1.0) (apache-2.0)
 - [DeL-TaiseiOzaki/Tengentoppa-sft-qwen2.5-32b-reasoning-100k](https://huggingface.co/datasets/DeL-TaiseiOzaki/Tengentoppa-sft-qwen2.5-32b-reasoning-100k) (apache-2.0)
 - [weblab-GENIAC/Open-Platypus-Japanese-masked](https://huggingface.co/datasets/weblab-GENIAC/Open-Platypus-Japanese-masked) (MIT)
   - MITライセンスのデータのみ抽出して使用。
+### DPO使用データ
 - [weblab-GENIAC/aya-ja-nemotron-dpo-masked](https://huggingface.co/datasets/weblab-GENIAC/aya-ja-nemotron-dpo-masked) (apache-2.0)
+## モデル作成手順
 - ベースモデル（google/gemma-2-27b）にSFT使用データ（サンプリング）を使って、Loraアダプタを作成（Taka2024/gemma-2-27b-it-2_lora）
 - ベースモデルとLoraアダプタをマージ（Taka2024/gemma-2-27b-it-2_lora_merged）
 - マージしたモデルにDPO使用データ（サンプリング）を使って、DPOアダプタを作成（Taka2024/gemma-2-27b-dpo-1）
+## 推論手順
 unsloth版のサンプルコード（Google Colab L4使用）をベースとし、推論は１時間以内で終了するようになっている。
 ```
 ```
+## Uploaded  model
 - **Developed by:** Taka2024
 - **License:** gemma