AIDX-ktds
/

ktdsbaseLM-v0.11-based-on-openchat3.5

Text Generation

text-generation-inference

Model card Files Files and versions Community

AIDXteam commited on Sep 27, 2024

Commit

150fb85

·

verified ·

1 Parent(s): c792e3f

Update README.md

Files changed (1) hide show

README.md +58 -0

README.md CHANGED Viewed

	@@ -58,3 +58,61 @@ pipeline_tag: text-generation
58	</code></pre>
59
60

 </code></pre>
+Here’s the English version of the provided text:
+---
+## ❶ Model Description
+**Model Name and Key Features**:
+KTDSbaseLM v0.11 is based on the OpenChat 3.5 model, fine-tuned using the SFT method on the Mistral 7B model.
+It is designed to understand Korean and various cultural contexts, utilizing data from 135 domains in Korean society.
+The model supports tasks such as text generation, conversation inference, document summarization,
+question answering, sentiment analysis, and other NLP tasks.
+Its applications span fields like law, finance, science, education, business, and cultural research.
+**Model Architecture**:
+KTDSBaseLM v0.11 is a high-performance language model with 7 billion parameters based on the Mistral 7B model.
+It uses OpenChat 3.5 as the foundation and is fine-tuned using SFT to excel in Korean language and culture.
+The streamlined Mistral 7B architecture ensures fast inference and memory efficiency,
+optimized for various NLP tasks like text generation, question answering, document summarization, and sentiment analysis.
+---
+## ❷ Training Data
+KTDSbaseLM v0.11 was trained on 3.6GB of data, comprising 2.33 million Q&A instances.
+This includes 1.33 million multiple-choice questions across 135 domains such as history,
+finance, law, tax, and science, trained with the Chain of Thought method. Additionally,
+1.3 million short-answer questions cover 100 domains including history, finance, and law.
+**Training Instruction Dataset Format**:
+`{"prompt": "prompt text", "completion": "ideal generated text"}`
+---
+## ❸ Use Cases
+KTDSbaseLM v0.11 can be used across multiple fields, such as:
+- **Education**: Answering questions and generating explanations for subjects like history, math, and science.
+- **Business**: Providing responses and summaries for legal, financial, and tax-related queries.
+- **Research and Culture**: Performing NLP tasks, sentiment analysis, document generation, and translation.
+- **Customer Service**: Generating conversations and personalized responses for users.
+This model is highly versatile in various NLP tasks.
+---
+## ❹ Limitations
+KTDSBaseLM v0.11 is specialized in Korean language and culture.
+However, it may lack accuracy in responding to topics outside its scope,
+such as international or specialized data.
+Additionally, it may have limited reasoning ability for complex logical problems and
+may produce biased responses if trained on biased data.
+---
+## ❺ Usage Instructions