Instructions to use hbx/Mistral-Interact with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use hbx/Mistral-Interact with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="hbx/Mistral-Interact")

# Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM

tokenizer = AutoTokenizer.from_pretrained("hbx/Mistral-Interact")
model = AutoModelForCausalLM.from_pretrained("hbx/Mistral-Interact")

Notebooks
Google Colab
Kaggle
Local Apps

vLLM

How to use hbx/Mistral-Interact with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "hbx/Mistral-Interact"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "hbx/Mistral-Interact",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'

Use Docker

docker model run hf.co/hbx/Mistral-Interact

SGLang

How to use hbx/Mistral-Interact with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "hbx/Mistral-Interact" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "hbx/Mistral-Interact",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "hbx/Mistral-Interact" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "hbx/Mistral-Interact",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'

Docker Model Runner
How to use hbx/Mistral-Interact with Docker Model Runner:
```
docker model run hf.co/hbx/Mistral-Interact
```
Browse Quantizations to use this model in llama.cpp, Ollama, LM Studio, or any compatible app.

Model Card for Mistral-Interact

Base IN3: https://huggingface.co/datasets/hbx/IN3
IN3-interaction: https://huggingface.co/datasets/hbx/IN3-interaction
Paper: https://arxiv.org/abs/2402.09205
Model: https://huggingface.co/hbx/Mistral-Interact
Repo: https://github.com/HBX-hbx/Mistral-Interact

Using the constructed interaction data, we adapt Mistral-7B into Mistral-Interact, a powerful and robust variant of Mistral, capable of judging the vagueness of user instruction, actively querying for missing details with suggestions, and explicitly summarizing the detailed and clear user intentions. It has the following features:

Better understanding of user judgments: Among all the open-source models, Mistral-Interact is the best at predicting task vagueness and missing details that users regard as necessary.
Comprehensive summarization of user intentions: Mistral-Interact is effective in making an explicit and comprehensive summary based on detailed user intentions.
Enhanced model-user interaction experience: Mistral-Interact inquires about missing details in vague tasks more reasonably and friendly than other open-source models, thus promoting a clearer understanding of the user’s implicit intentions.
Comparable performance with closed-source GPT-4: We prove that smaller-scale model experts can approach or even exceed general-purpose large-scale models across various aspects including vagueness judgment, comprehensiveness of summaries, and friendliness of interaction.

We utilize the model-center framework to conduct full-parameter fine-tuning of Mistral-7B-v0.1 using Intention-in-Interaction(IN3) dataset on two 80GB A800s. For full details and the usage of this model please read our paper and repo.

Citation

Feel free to cite our paper if you find it is useful.

@article{cheng2024tell,
  title={Tell Me More! Towards Implicit User Intention Understanding of Language Model Driven Agents},
  author={Cheng Qian, Bingxiang He, Zhong Zhuang, Jia Deng, Yujia Qin, Xin Cong, Zhong Zhang, Jie Zhou, Yankai Lin, Zhiyuan Liu, Maosong Sun},
  journal={arXiv preprint arXiv:2402.09205},
  year={2024}
}

Downloads last month: 15

Model tree for hbx/Mistral-Interact

Quantizations

3 models

Dataset used to train hbx/Mistral-Interact

Paper for hbx/Mistral-Interact

Tell Me More! Towards Implicit User Intention Understanding of Language Model Driven Agents

Paper • 2402.09205 • Published Feb 14, 2024