DeL-TaiseiOzaki
/

Tengentoppa-llm-jp-13B-reasoning-it-adapter

+---
+license: apache-2.0
+language:
+- ja
+- en
+base_model:
+- llm-jp/llm-jp-3-13b
+pipeline_tag: text-generation
+library_name: transformers
+datasets:
+- DeL-TaiseiOzaki/Tengentoppa-sft-elyza-news-specialized
+- DeL-TaiseiOzaki/Tengentoppa-sft-reasoning-v2.0
+- DeL-TaiseiOzaki/Tengentoppa-sft-base-v1.0
+---
+# Enhanced LLM-JP Model with Extended Tokenizer and Chat Template
+This is an enhanced version of [llm-jp-13B](https://huggingface.co/llm-jp-13B) with an extended tokenizer that includes additional special tokens for structured conversations and advanced prompting.
+![image/jpg](tengentoppa.jpg)
+## Model Information
+- Base Model: [llm-jp-13B](https://huggingface.co/llm-jp-13B)
+- Added Features: Extended tokenizer with special tokens for structured conversations and chat template
+- Vocabulary Size: Extended from the base model
+## Special Tokens
+### Basic Tokens
+- UNK Token: `{token_config.unk_token}`
+- BOS Token: `{token_config.bos_token}`
+- EOS Token: `{token_config.eos_token}`
+- PAD Token: `{token_config.pad_token}`
+- CLS Token: `{token_config.cls_token}`
+- SEP Token: `{token_config.sep_token}`
+- MASK Token: `{token_config.mask_token}`
+### Conversation Structure Tokens
+- System: `{token_config.system_token}` and `{token_config.system_end_token}`
+- User: `{token_config.user_token}` and `{token_config.user_end_token}`
+- Assistant: `{token_config.assistant_token}` and `{token_config.assistant_end_token}`
+### Reasoning Process Tokens
+- Reasoning: `{token_config.reasoning_token}` and `{token_config.reasoning_end_token}`
+- Solution: `{token_config.solution_token}` and `{token_config.solution_end_token}`
+- Response: `{token_config.response_token}` and `{token_config.response_end_token}`
+### Hint and Supplementary Information Tokens
+- Hint: `{token_config.hint_token}` and `{token_config.hint_end_token}`
+- Note: `{token_config.note_token}` and `{token_config.note_end_token}`
+- Context: `{token_config.context_token}` and `{token_config.context_end_token}`
+- Reference: `{token_config.reference_token}` and `{token_config.reference_end_token}`
+- Example: `{token_config.example_token}` and `{token_config.example_end_token}`
+### Control Tokens
+- Important: `{token_config.important_token}` and `{token_config.important_end_token}`
+- Warning: `{token_config.warning_token}` and `{token_config.warning_end_token}`
+- Error: `{token_config.error_token}` and `{token_config.error_end_token}`
+## Chat Template Usage
+このモデルは以下の役割（roles）をサポートしています：
+- system: システムプロンプト用
+- user: ユーザーの入力用
+- hint: ヒントやガイダンス用
+- reasoning: 推論プロセス用
+- assistant: アシスタントの応答用
+### Basic Usage:
+```python
+from transformers import AutoModelForCausalLM, AutoTokenizer
+model = AutoModelForCausalLM.from_pretrained("{model_name}")
+tokenizer = AutoTokenizer.from_pretrained("{model_name}")
+# チャット形式での使用例
+messages = [
+    {
+        "role": "system",
+        "content": "あなたは親切で有能なAIアシスタントです。"
+    },
+    {
+        "role": "user",
+        "content": "次の数学の問題を解いてください：2x + 3 = 7"
+    },
+    {
+        "role": "hint",
+        "content": "方程式を解くときは、まず両辺から数を移項することを考えてみましょう。"
+    },
+    {
+        "role": "reasoning",
+        "content": "この方程式を解くために以下のステップで考えます：\\n1. 3を両辺から引く\\n2. 両辺を2で割る"
+    },
+    {
+        "role": "assistant",
+        "content": "x = 2 が方程式の解です。"
+    }
+]
+# チャットテンプレートを使用してメッセージを整形
+prompt = tokenizer.apply_chat_template(messages, tokenize=False)
+print("\\nGenerated prompt:\\n", prompt)
+# トークン化と推論
+inputs = tokenizer(prompt, return_tensors="pt", max_length=2048, truncation=True)
+outputs = model.generate(**inputs, max_length=2048, temperature=0.7)
+response = tokenizer.decode(outputs[0])
+print("\\nModel response:\\n", response)
+```
+### Advanced Usage:
+# カスタムシステムメッセージを使用
+messages = [
+    {
+        "role": "system",
+        "content": "あなたは数学の専門家です。"
+    },
+    {
+        "role": "user",
+        "content": "二次方程式 x² - 4x + 4 = 0 を解いてください。"
+    }
+]
+# 生成プロンプトを追加せずにテンプレートを適用
+prompt = tokenizer.apply_chat_template(
+    messages,
+    tokenize=False,
+    add_generation_prompt=False
+)
+# 手動でヒントを追加
+prompt += "\\n<|HINT|>因数分解を使うと簡単に解けるかもしれません。</|HINT|>"
+# 手動で推論プロセスを追加
+prompt += "\\n<|REASONING|>1. この式は(x-2)²の形に似ています\\n2. 実際に展開すると同じ式になります</|REASONING|>"
+# アシスタントの応答用の���ロンプトを追加
+prompt += "\\n<|ASSISTANT|>"
+# 以降は通常通り処理
+inputs = tokenizer(prompt, return_tensors="pt", max_length=2048, truncation=True)
+```
+## Chat Template Specification
+モデルのチャットテンプレートは以下の要素を含みます：
+- 5つの異なるロール（system, user, hint, reasoning, assistant）
+- 各ロールに対応する特殊トークン
+- デフォルトのシステムメッセージ
+- 柔軟なテンプレート構造
+特徴：
+- メッセージの順序は保持されます
+- 各ロールは明確に区別されます
+- システムメッセージは任意です
+- ヒントと推論は必要に応じて追加できます
+## Additional Notes
+### トークナイザーの拡張について
+- 元のトークナイザーの全機能を保持
+- 新しい特殊トークンの追加による機能拡張
+- チャットテンプレートによる構造化された会話のサポート
+### 使用上の注意
+- 特殊トークンは必要な場合にのみ使用してください
+- 特殊トークのスキップをTrueにしないでください．
+- チャットテンプレートは柔軟に調整可能です
+- システムメッセージは対話の文脈に応じてカスタマイズできます