daytoy-models commited on
Commit
b9b8cac
·
1 Parent(s): 3fd64f2

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +184 -14
README.md CHANGED
@@ -1,15 +1,185 @@
1
  ---
2
- license: ecl-2.0
3
- datasets:
4
- - lmsys/lmsys-chat-1m
5
- - fka/awesome-chatgpt-prompts
6
- language:
7
- - en
8
- metrics:
9
- - accuracy
10
- - bleurt
11
- library_name: fairseq
12
- pipeline_tag: tabular-classification
13
- tags:
14
- - code
15
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ license: llama2
3
+ base_model: codellama/CodeLlama-13b-Instruct-hf
4
+ model-index:
5
+ - name: NexusRaven-13B
6
+ results: []
7
+ ---
8
+ # NexusRaven-13B: Surpassing the state-of-the-art in open-source function calling LLMs.
9
+
10
+ <p align="center">
11
+ <a href="https://huggingface.co/Nexusflow" target="_blank">Nexusflow HF</a> - <a href="http://nexusflow.ai/blog" target="_blank">NexusRaven blog post</a> - <a href="https://huggingface.co/Nexusflow/NexusRaven-13B" target="_blank">NexusRaven-13B</a> - <a href="https://x.com/NexusflowX/status/1707470614012035561?s=20" target="_blank">NexusRaven-13B Twitter Thread</a> - <a href="https://github.com/nexusflowai/NexusRaven/" target="_blank">NexusRaven-13B Github</a> - <a href="https://huggingface.co/datasets/Nexusflow/NexusRaven_API_evaluation" target="_blank">NexusRaven API evaluation dataset</a>
12
+ </p>
13
+
14
+ <p align="center" width="100%">
15
+ <a><img src="NexusRaven.png" alt="NexusRaven" style="width: 40%; min-width: 300px; display: block; margin: auto;"></a>
16
+ </p>
17
+
18
+ Table of contents
19
+ - [NexusRaven-13B: Surpassing the state-of-the-art in open-source function calling LLMs.](#nexusraven-13b-surpassing-the-state-of-the-art-in-open-source-function-calling-llms)
20
+ - [Introducing NexusRaven-13B](#introducing-nexusraven-13b)
21
+ - [NexusRaven model usage](#nexusraven-model-usage)
22
+ - [Training procedure](#training-procedure)
23
+ - [Training hyperparameters](#training-hyperparameters)
24
+ - [Framework versions](#framework-versions)
25
+ - [Limitations](#limitations)
26
+ - [License](#license)
27
+ - [References](#references)
28
+ - [Citation](#citation)
29
+ - [Contact](#contact)
30
+
31
+
32
+ This model is a fine-tuned version of [codellama/CodeLlama-13b-Instruct-hf](https://huggingface.co/codellama/CodeLlama-13b-Instruct-hf).
33
+
34
+ ## Introducing NexusRaven-13B
35
+ NexusRaven is an open-source and commercially viable function calling LLM that surpasses the state-of-the-art in function calling capabilities.
36
+
37
+ 📊 Performance Highlights: With our demonstration retrieval system, NexusRaven-13B achieves a 95% success rate in using cybersecurity tools such as CVE/CPE Search and VirusTotal, while prompting GPT-4 achieves 64%. It has significantly lower cost and faster inference speed compared to GPT-4.
38
+
39
+ 🔧 Generalization to the Unseen: NexusRaven-13B generalizes to tools never seen during model training, achieving a success rate comparable with GPT-3.5 in zero-shot setting, significantly outperforming all other open-source LLMs of similar sizes.
40
+
41
+ 🔥 Commercially Permissive: The training of NexusRaven-13B does not involve any data generated by proprietary LLMs such as GPT-4. You have full control of the model when deployed in commercial applications.
42
+
43
+ <p align="center" width="100%">
44
+ <a><img src="Single-Attempt_Function_Calling.png" alt="NexusRaven" style="width: 80%; min-width: 300px; display: block; margin: auto;"></a>
45
+ <a><img src="Zero-shot_Evaluation.png" alt="NexusRaven" style="width: 80%; min-width: 300px; display: block; margin: auto;"></a>
46
+ </p>
47
+
48
+
49
+ ## NexusRaven model usage
50
+ NexusRaven accepts a list of python functions. These python functions can do anything (including sending GET/POST requests to external APIs!). The two requirements include the python function signature and the appropriate docstring to generate the function call.
51
+
52
+ NexusRaven is highly compatible with langchain. See [langchain_example.py](https://huggingface.co/Nexusflow/NexusRaven-13B/blob/main/langchain_example.py). An example without langchain can be found in [non_langchain_example.py](https://huggingface.co/Nexusflow/NexusRaven-13B/blob/main/non_langchain_example.py).
53
+
54
+ Please note that the model will reflect on the answer sometimes, so we highly recommend stopping the model generation at a stopping criteria of `["\nReflection:"]`, to avoid spending unnecessary tokens during inference, but the reflection might help in some rare cases. This is reflected in our langchain example.
55
+
56
+ More information about how to prompt the model can be found in [prompting_readme.md](prompting_readme.md).
57
+
58
+ The "Initial Answer" can be executed to run the function.
59
+
60
+ ### Quickstart
61
+ You can run the model on a GPU using the following code.
62
+ ```python
63
+ # Please `pip install transformers accelerate`
64
+ from transformers import pipeline
65
+
66
+
67
+ pipeline = pipeline(
68
+ "text-generation",
69
+ model="Nexusflow/NexusRaven-13B",
70
+ torch_dtype="auto",
71
+ device_map="auto",
72
+ )
73
+
74
+ prompt_template = """
75
+ <human>:
76
+ OPTION:
77
+ <func_start>def hello_world(n : int)<func_end>
78
+ <docstring_start>
79
+ \"\"\"
80
+ Prints hello world to the user.
81
+
82
+ Args:
83
+ n (int) : Number of times to print hello world.
84
+ \"\"\"
85
+ <docstring_end>
86
+
87
+ OPTION:
88
+ <func_start>def hello_universe(n : int)<func_end>
89
+ <docstring_start>
90
+ \"\"\"
91
+ Prints hello universe to the user.
92
+
93
+ Args:
94
+ n (int) : Number of times to print hello universe.
95
+ \"\"\"
96
+ <docstring_end>
97
+
98
+ User Query: Question: {question}
99
+
100
+ Please pick a function from the above options that best answers the user query and fill in the appropriate arguments.<human_end>
101
+ """
102
+ prompt = prompt_template.format(question="Please print hello world 10 times.")
103
+
104
+ result = pipeline(prompt, max_new_tokens=100, return_full_text=False, do_sample=False)[0]["generated_text"]
105
+
106
+ # Get the "Initial Call" only
107
+ start_str = "Initial Answer: "
108
+ end_str = "\nReflection: "
109
+ start_idx = result.find(start_str) + len(start_str)
110
+ end_idx = result.find(end_str)
111
+ function_call = result[start_idx: end_idx]
112
+
113
+ print (f"Generated Call: {function_call}")
114
+ ```
115
+ This will output:
116
+ ```text
117
+ Generated Call: hello_world(10)
118
+ ```
119
+ Which can be executed.
120
+
121
+
122
+ ## Training procedure
123
+
124
+ ### Training hyperparameters
125
+
126
+ The following hyperparameters were used during training:
127
+ - learning_rate: 3e-05
128
+ - train_batch_size: 1
129
+ - eval_batch_size: 1
130
+ - seed: 42
131
+ - distributed_type: multi-GPU
132
+ - num_devices: 8
133
+ - gradient_accumulation_steps: 16
134
+ - total_train_batch_size: 128
135
+ - total_eval_batch_size: 8
136
+ - optimizer: Adam with betas=(0.9,0.95) and epsilon=1e-08
137
+ - lr_scheduler_type: constant
138
+ - num_epochs: 2.0
139
+
140
+
141
+ ### Framework versions
142
+
143
+ - Transformers 4.33.2
144
+ - Pytorch 2.0.1+cu118
145
+ - Datasets 2.14.5
146
+ - Tokenizers 0.13.3
147
+
148
+
149
+ # Limitations
150
+ 1. We highly recommend using a stop criteria of `["\nReflection:"]`. The model was trained to first generate an answer and then reflect on its answer to either improve the answer or keep the answer the same. However, this "chain of thought" is often not helpful, and the final answer is seldom better than the initial call. Therefore, we strongly recommend using the Initial Call as the main call to execute.
151
+ 2. The model works best when it is connected with a retriever when there are a multitude of functions, as a large number of functions will saturate the context window of this model.
152
+ 3. The model can be prone to generate incorrect calls. Please ensure proper guardrails to capture errant behavior is in place.
153
+
154
+
155
+ ## License
156
+ This model was trained on commercially viable data and is licensed under the [Llama 2 community license](https://huggingface.co/codellama/CodeLlama-13b-hf/blob/main/LICENSE) following the original [CodeLlama-13b-hf](https://huggingface.co/codellama/CodeLlama-13b-hf/) model.
157
+
158
+
159
+ ## References
160
+ We thank the CodeLlama team for their amazing models!
161
+
162
+ ```
163
+ @misc{rozière2023code,
164
+ title={Code Llama: Open Foundation Models for Code},
165
+ author={Baptiste Rozière and Jonas Gehring and Fabian Gloeckle and Sten Sootla and Itai Gat and Xiaoqing Ellen Tan and Yossi Adi and Jingyu Liu and Tal Remez and Jérémy Rapin and Artyom Kozhevnikov and Ivan Evtimov and Joanna Bitton and Manish Bhatt and Cristian Canton Ferrer and Aaron Grattafiori and Wenhan Xiong and Alexandre Défossez and Jade Copet and Faisal Azhar and Hugo Touvron and Louis Martin and Nicolas Usunier and Thomas Scialom and Gabriel Synnaeve},
166
+ year={2023},
167
+ eprint={2308.12950},
168
+ archivePrefix={arXiv},
169
+ primaryClass={cs.CL}
170
+ }
171
+ ```
172
+
173
+
174
+ ## Citation
175
+ ```
176
+ @misc{nexusraven,
177
+ title={NexusRaven: Surpassing the state-of-the-art in open-source function calling LLMs},
178
+ author={Nexusflow.ai team},
179
+ year={2023},
180
+ url={http://nexusflow.ai/blog}
181
+ }
182
+ ```
183
+
184
+ ## Contact
185
+ Please reach out to [email protected] for any questions!