6cf commited on
Commit
aad629a
Β·
verified Β·
1 Parent(s): f80ef4b

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +53 -7
README.md CHANGED
@@ -21,25 +21,71 @@ tags:
21
  ![image/jpeg](https://cdn-uploads.huggingface.co/production/uploads/6205fefd3f1dc8a642d70b10/JEZgA_xV6oF8AIsya9dop.jpeg)
22
 
23
 
24
- # Model Card for Model ID
25
 
26
- Comming Soon
 
27
 
 
 
 
 
 
28
 
29
- <!-- Provide a quick summary of what the model is/does. -->
 
 
 
 
30
 
 
31
 
32
- ## Model Details
 
33
 
 
34
 
35
- ## Training Details
 
 
 
 
 
36
 
37
- ### Training Data
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
38
 
39
  <!-- This should link to a Dataset Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
40
 
41
 
42
- # πŸ€–πŸ’‘ LiveIdeaBench: Evaluating LLMs' Scientific Creativity and Idea Generation with Minimal Context
 
 
43
 
44
 
45
  ### Dataset
 
21
  ![image/jpeg](https://cdn-uploads.huggingface.co/production/uploads/6205fefd3f1dc8a642d70b10/JEZgA_xV6oF8AIsya9dop.jpeg)
22
 
23
 
24
+ # IdeaWhiz Model Card 🧠
25
 
26
+ ## Model Summary πŸ”¬
27
+ IdeaWhiz is a fine-tuned version of QwQ-32B-Preview, specifically optimized for scientific creativity and step-by-step reasoning. The model leverages the LiveIdeaBench dataset to enhance its capabilities in generating novel scientific ideas and hypotheses.
28
 
29
+ ## Key Features 🌟
30
+ - Base Model: QwQ-32B-Preview πŸ”‹
31
+ - Training Dataset: LiveIdeaBench πŸ“Š
32
+ - Main Focus: Scientific creativity and idea generation πŸ’‘
33
+ - Reasoning Style: o1-style step-by-step reasoning ⚑
34
 
35
+ ## Intended Use 🎯
36
+ - Scientific hypothesis generation πŸ§ͺ
37
+ - Creative problem-solving in research πŸ”
38
+ - Step-by-step scientific reasoning πŸ“
39
+ - Research direction brainstorming 🌱
40
 
41
+ ## Quickstart πŸš€
42
 
43
+ ```python
44
+ from transformers import AutoModelForCausalLM, AutoTokenizer
45
 
46
+ model_name = "6cf/QwQ-32B-Preview-IdeaWhiz-v1"
47
 
48
+ model = AutoModelForCausalLM.from_pretrained(
49
+ model_name,
50
+ torch_dtype="auto",
51
+ device_map="auto"
52
+ )
53
+ tokenizer = AutoTokenizer.from_pretrained(model_name)
54
 
55
+
56
+
57
+ prompt = """I'll be submitting your next responses to a "Good Scientific Idea" expert review panel. If they consider your idea to be a good one, you'll receive a reward. Your assigned keyword is: "cancer". You may provide background information. The idea MUST be within 100 words (including background information). (Note: good scientific ideas should be novel, verifiable, practically valuable, and able to advance the field.). NOTE: You MUST give your answer after **Final Idea:**
58
+ """
59
+
60
+
61
+ messages = [
62
+ {"role": "system", "content": "You are a helpful and harmless assistant. You are Qwen developed by Alibaba. You should think step-by-step."},
63
+ {"role": "user", "content": prompt}
64
+ ]
65
+ text = tokenizer.apply_chat_template(
66
+ messages,
67
+ tokenize=False,
68
+ add_generation_prompt=True
69
+ )
70
+ model_inputs = tokenizer([text], return_tensors="pt").to(model.device)
71
+
72
+ generated_ids = model.generate(
73
+ **model_inputs,
74
+ max_new_tokens=4096
75
+ )
76
+ generated_ids = [
77
+ output_ids[len(input_ids):] for input_ids, output_ids in zip(model_inputs.input_ids, generated_ids)
78
+ ]
79
+
80
+ response = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)[0]
81
+ ```
82
 
83
  <!-- This should link to a Dataset Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
84
 
85
 
86
+ # Traning Dataset
87
+
88
+ ## πŸ€–πŸ’‘ LiveIdeaBench: Evaluating LLMs' Scientific Creativity and Idea Generation with Minimal Context
89
 
90
 
91
  ### Dataset