File size: 1,819 Bytes
22e8b31
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
"""Guide content for the Quizbowl platform."""

GUIDE_MARKDOWN = """
# 🎯 Quizbowl Bot Guide

## Quick Start
1. Choose between Tossup or Bonus mode
2. Design your pipeline
3. Test on example questions
4. Submit for evaluation

## Competition Rules

### 🧠 Tossup Questions
- **Format**: Individual questions with progressive difficulty
- **Scoring**:
  - Correct early buzz: +10 points
  - Incorrect early buzz: -5 points
  - Correct after full read: +10 points
- **Required Outputs**:
  - `answer`: Your predicted answer
  - `confidence`: Score between 0-1
  - Buzz threshold: When to attempt answering

### 🎁 Bonus Questions
- **Format**: Three-part questions (10 points each)
- **Scoring**: +10 points per correct part (max 30)
- **Required Outputs**:
  - `answer`: Your predicted answer
  - `confidence`: Score between 0-1
  - `explanation`: Brief justification for human collaboration

## Building Your First Pipeline

### 1. Simple Pipeline (Recommended for First Submission)
- Single model step
- Configure:
  - Model selection
  - Temperature (0.0-1.0)
  - System prompt
  - Required outputs

### 2. Testing Your Pipeline
1. Select an example question
2. For Tossup:
   - Set buzz threshold (0.5-1.0)
   - Enable early stopping
3. Run and check:
   - Answer accuracy
   - Confidence scores
   - Performance metrics

### 3. Evaluation
- Test on multiple questions
- Monitor:
  - Accuracy
  - Confidence patterns
  - Response times

### 4. Submission
1. Log in
2. Name your model
3. Add description
4. Submit for evaluation

## Tips for Success
- Start with simple pipeline
- Test thoroughly before submission
- Use appropriate temperature (0.3-0.7 recommended)
- Monitor confidence scores
- Check example submissions

## Need Help?
- Review example submissions
- Check documentation
- Contact support
"""