kalisai
/

Nusantara-4b-Indo-Chat

@@ -23,33 +23,57 @@ language:
-## Model Details
 ### Model Description
-<!-- Provide a longer summary of what this model is. -->
-This is the model card of a 🤗 transformers model that has been pushed on the Hub. This model card has been automatically generated.
-- **Developed by:** [More Information Needed]
 - **Funded by [optional]:** [More Information Needed]
 - **Shared by [optional]:** [More Information Needed]
 - **Model type:** [More Information Needed]
 - **Language(s) (NLP):** [More Information Needed]
 - **License:** [More Information Needed]
-- **Finetuned from model [optional]:** [More Information Needed]
-### Model Sources [optional]
-<!-- Provide the basic links for the model. -->
-- **Repository:** [More Information Needed]
-- **Paper [optional]:** [More Information Needed]
-- **Demo [optional]:** [More Information Needed]
-## Uses
-<!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
 ### Direct Use
@@ -182,9 +206,18 @@ Carbon emissions can be estimated using the [Machine Learning Impact calculator]
 [More Information Needed]
-## Citation [optional]
-<!-- If there is a paper or blog post introducing the model, the APA and Bibtex information for that should go in this section. -->
 **BibTeX:**

 ### Model Description
+Nusantara is a series of Open Weight Language Model of Indonesia Language (Bahasa Indonesia). Nusantara is based from Qwen1.5 Language Model, finetuned by domain specific of datasets.
+As Chat-implemented language model, Nusantara is capable to do Question-Answering and respond to instructions given in Bahasa Indonesia.
+- **Developed by:** Kalis AI /
 - **Funded by [optional]:** [More Information Needed]
 - **Shared by [optional]:** [More Information Needed]
 - **Model type:** [More Information Needed]
 - **Language(s) (NLP):** [More Information Needed]
 - **License:** [More Information Needed]
+- **Finetuned from model:** Qwen1.5-4B
+## Quickstart
+Here provides a code snippet with `apply_chat_template` to show you how to load the tokenizer and model and how to generate contents.
+```python
+from transformers import AutoModelForCausalLM, AutoTokenizer
+device = "cuda" # the device to load the model onto
+model = AutoModelForCausalLM.from_pretrained(
+ "Qwen/Qwen1.5-72B-Chat",
+ torch_dtype="auto",
+ device_map="auto"
+)
+tokenizer = AutoTokenizer.from_pretrained("kalisai/Nusantara-4B-Indo-Chat")
+prompt = "Berikan saya resep memasak nasi goreng yang lezat."
+messages = [
+ {"role": "system", "content": "Kamu adalah Nusantara, asisten AI yang pintar."},
+ {"role": "user", "content": prompt}
+]
+text = tokenizer.apply_chat_template(
+ messages,
+ tokenize=False,
+ add_generation_prompt=True
+)
+model_inputs = tokenizer([text], return_tensors="pt").to(device)
+generated_ids = model.generate(
+ model_inputs.input_ids,
+ max_new_tokens=512
+)
+generated_ids = [
+ output_ids[len(input_ids):] for input_ids, output_ids in zip(model_inputs.input_ids, generated_ids)
+]
+response = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)[0]
+```
 ### Direct Use
 [More Information Needed]
+## Citation
+If you use the Nusantara language model in your research or project, please cite it as:
+```
+@article{Nusantara,
+ title={Nusantara: A Series of Language Model in Bahasa Indonesia},
+ author={Zulfikar Aji Kusworo},
+ publisher={Hugging Face}
+ journal={Hugging Face Repository},
+ year={2024}
+}
+```
 **BibTeX:**