Ayushs799 commited on
Commit
1f2dcf6
1 Parent(s): 735454b

Upload Introduction.md

Browse files
Files changed (1) hide show
  1. Introduction.md +25 -5
Introduction.md CHANGED
@@ -9,15 +9,37 @@ Conformal prediction is a technique for quantifying such uncertainties for AI sy
9
 
10
  ---
11
 
12
- #### Theory
13
 
14
  ### 1. Prediction Regions
15
 
16
- Prediction regions represent intervals that contain the true value of the prediction with a certain confidence level. In regression, this is often expressed as a prediction interval. Let's denote the prediction region as \[$a$, $b$\], where $a$ and $b$ are the lower and upper bounds, respectively. The confidence level is denoted by $\alpha$. In classification, the prediction region is a set of classes that's above a certain threshold. The threshold is calculated by $\alpha$.
 
 
 
 
 
 
 
 
 
 
 
 
17
 
18
  ### 2. Validity
19
 
20
- A conformal predictor is considered valid if the true value falls within the predicted region with the specified confidence level over repeated experiments. Mathematically, for a given prediction $\hat{y}$ and a true outcome $y$, the validity condition is given by $P(y \in [a, b]) \geq 1 - \alpha$. In classification, the validity condition is given by $P(y \in \hat{C}) \geq 1 - \alpha$, where $\hat{C}$ is the predicted set of classes.
 
 
 
 
 
 
 
 
 
 
21
 
22
  ### 3. Inductive Conformal Prediction
23
 
@@ -36,5 +58,3 @@ Inductive Conformal Prediction is characterized by its adaptability to the data
36
  - validity score $s_i$ by counting the number of times the true value $y_i$ falls within the prediction region $R_i$ over repeated experiments.
37
  - p-value $p_i$ by dividing the validity score $s_i$ by the number of repeated experiments.
38
  - prediction region $R_i$ using the top $k$ errors and the p-value $p_i$.
39
-
40
-
 
9
 
10
  ---
11
 
12
+ # Theory
13
 
14
  ### 1. Prediction Regions
15
 
16
+ Prediction regions in conformal prediction are intervals that provide a range of possible values for the prediction. For a regression task, this is often referred to as a prediction interval. Let's denote the prediction region as $[a, b]$, where $a$ and $b$ represent the lower and upper bounds, respectively. The confidence level is denoted by $\alpha$. The prediction region is constructed in such a way that it contains the true value with a probability of at least $(1 - \alpha)$.
17
+
18
+ Mathematically, for a prediction $\hat{y}$, the prediction region is defined as:
19
+
20
+ $$ P(a \leq y \leq b) \geq 1 - \alpha $$
21
+
22
+ This ensures that the true value $y$ falls within the predicted interval with a confidence level of at least $(1 - \alpha)$.
23
+
24
+ For a classification task, the prediction region is a set of classes that's above a certain threshold. The threshold is calculated by $\alpha$. Mathematically, for a prediction $\hat{C}$, the prediction region is defined as:
25
+
26
+ $$ P(y \in \hat{C}) \geq 1 - \alpha $$
27
+
28
+ This ensures that the true value $y$ falls within the predicted set of classes with a confidence level of at least $(1 - \alpha)$.
29
 
30
  ### 2. Validity
31
 
32
+ The validity of a conformal predictor is a crucial aspect. It ensures that, over repeated experiments, the true value falls within the predicted region with the specified confidence level. Mathematically, for a given prediction $\hat{y}$ and a true outcome $y$, the validity condition is expressed as:
33
+
34
+ $$ P(y \in [a, b]) \geq 1 - \alpha $$
35
+
36
+ This means that the probability of the true value $y$ lying within the predicted interval $[a, b]$ is greater than or equal to $(1 - \alpha)$.
37
+
38
+ For a classification task, the validity condition is expressed as:
39
+
40
+ $$ P(y \in \hat{C}) \geq 1 - \alpha $$
41
+
42
+ This means that the probability of the true value $y$ lying within the predicted set of classes $\hat{C}$ is greater than or equal to $(1 - \alpha)$.
43
 
44
  ### 3. Inductive Conformal Prediction
45
 
 
58
  - validity score $s_i$ by counting the number of times the true value $y_i$ falls within the prediction region $R_i$ over repeated experiments.
59
  - p-value $p_i$ by dividing the validity score $s_i$ by the number of repeated experiments.
60
  - prediction region $R_i$ using the top $k$ errors and the p-value $p_i$.