aweber commited on
Commit
192e8d7
·
verified ·
1 Parent(s): de3b7e9

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +11 -23
README.md CHANGED
@@ -8,45 +8,39 @@ pinned: false
8
  ---
9
 
10
 
11
- # Random Baseline Model for Climate Disinformation Classification
12
 
13
  ## Model Description
14
 
15
- This is a random baseline model for the Frugal AI Challenge 2024, specifically for the text classification task of identifying climate disinformation. The model serves as a performance floor, randomly assigning labels to text inputs without any learning.
16
 
17
  ### Intended Use
18
 
19
- - **Primary intended uses**: Baseline comparison for climate disinformation classification models
20
  - **Primary intended users**: Researchers and developers participating in the Frugal AI Challenge
21
  - **Out-of-scope use cases**: Not intended for production use or real-world classification tasks
22
 
23
  ## Training Data
24
 
25
- The model uses the QuotaClimat/frugalaichallenge-text-train dataset:
26
- - Size: ~6000 examples
27
  - Split: 80% train, 20% test
28
- - 8 categories of climate disinformation claims
29
 
30
  ### Labels
31
- 0. No relevant claim detected
32
- 1. Global warming is not happening
33
- 2. Not caused by humans
34
- 3. Not bad or beneficial
35
- 4. Solutions harmful/unnecessary
36
- 5. Science is unreliable
37
- 6. Proponents are biased
38
- 7. Fossil fuels are needed
39
 
40
  ## Performance
41
 
42
  ### Metrics
43
- - **Accuracy**: ~12.5% (random chance with 8 classes)
44
  - **Environmental Impact**:
45
  - Emissions tracked in gCO2eq
46
  - Energy consumption tracked in Wh
47
 
48
  ### Model Architecture
49
- The model implements a random choice between the 8 possible labels, serving as the simplest possible baseline.
50
 
51
  ## Environmental Impact
52
 
@@ -57,15 +51,9 @@ Environmental impact is tracked using CodeCarbon, measuring:
57
  This tracking helps establish a baseline for the environmental impact of model deployment and inference.
58
 
59
  ## Limitations
60
- - Makes completely random predictions
61
- - No learning or pattern recognition
62
- - No consideration of input text
63
- - Serves only as a baseline reference
64
- - Not suitable for any real-world applications
65
 
66
  ## Ethical Considerations
67
 
68
- - Dataset contains sensitive topics related to climate disinformation
69
- - Model makes random predictions and should not be used for actual classification
70
  - Environmental impact is tracked to promote awareness of AI's carbon footprint
71
  ```
 
8
  ---
9
 
10
 
11
+ # Random Forest Model for Climate Disinformation Classification
12
 
13
  ## Model Description
14
 
15
+ This is a random forest model for the Frugal AI Challenge 2024, specifically for the audio classification task of identifying illegal deforestation.
16
 
17
  ### Intended Use
18
 
19
+ - **Primary intended uses**: Illegal deforestation classification model
20
  - **Primary intended users**: Researchers and developers participating in the Frugal AI Challenge
21
  - **Out-of-scope use cases**: Not intended for production use or real-world classification tasks
22
 
23
  ## Training Data
24
 
25
+ The model uses the rfcx/frugalai dataset:
26
+ - Size: ~50000 examples
27
  - Split: 80% train, 20% test
28
+ - 2 categories of audio category
29
 
30
  ### Labels
31
+ 0. chainsaw (positively identifying a chainsaw)
32
+ 1. environment (not containing a chainsaw).
 
 
 
 
 
 
33
 
34
  ## Performance
35
 
36
  ### Metrics
37
+ - **Accuracy**: ~89.3%
38
  - **Environmental Impact**:
39
  - Emissions tracked in gCO2eq
40
  - Energy consumption tracked in Wh
41
 
42
  ### Model Architecture
43
+ The model implements a random forest model used on pre-processed data. The pre-processing consists in a resampling, a Fourier decomposition and a standard scaler.
44
 
45
  ## Environmental Impact
46
 
 
51
  This tracking helps establish a baseline for the environmental impact of model deployment and inference.
52
 
53
  ## Limitations
54
+ - Takes some time to do the pre-processing.
 
 
 
 
55
 
56
  ## Ethical Considerations
57
 
 
 
58
  - Environmental impact is tracked to promote awareness of AI's carbon footprint
59
  ```