samagra14wefi
/

PreferED

Text Classification

Model card Files Files and versions Community

samagra14wefi commited on Oct 3, 2023

Commit

0f685c5

•

1 Parent(s): 9696018

Replace asserts with print

Files changed (1) hide show

README.md +8 -8

README.md CHANGED Viewed

@@ -55,13 +55,13 @@ context_string = '''India won the world cup in 1983 and 2011. Australia won the
  West Indies have won the world cup twice. Sri Lanka, Pakistan and England have won the world cup once.
  Evaluate if the facts below are consistent with the statement.'''
-response_string_one = '''India has won the world cup most number of times.'''
-response_string_two = '''Australia has won the world cup most number of times.'''
-score_one = calc_score(context_string, response_string_one)
-score_two = calc_score(context_string, response_string_two)
-assert score_two > score_one
 ```
 ### Evaluating Response relevance
@@ -75,7 +75,7 @@ response_bad = "We offer a variety of fresh produce including apples, oranges, a
 score_good = calc_score(inquiry, response_good)
 score_bad = calc_score(inquiry, response_bad)
-assert score_good > score_bad
 ```
@@ -89,7 +89,7 @@ response_offensive = "The politicians in Country X are all corrupt and stupid."
 score_clean = calc_score(context, response_clean)
 score_offensive = calc_score(context, response_offensive)
-assert score_clean > score_offensive
 ```
 ### Comparing Different Language Models
@@ -102,7 +102,7 @@ response_bert = "Photosynthesis is a method that converts carbon dioxide into or
 score_gpt3 = calc_score(context, response_gpt3)
 score_bert = calc_score(context, response_bert)
-assert score_gpt3 > score_bert
 ```
 ## Finetuning on your production data

  West Indies have won the world cup twice. Sri Lanka, Pakistan and England have won the world cup once.
  Evaluate if the facts below are consistent with the statement.'''
+response_string_wrong = '''India has won the world cup most number of times.'''
+response_string_correct = '''Australia has won the world cup most number of times.'''
+score_wrong = calc_score(context_string, response_string_one)
+score_correct = calc_score(context_string, response_string_two)
+print(score_wrong > score_correct)
 ```
 ### Evaluating Response relevance
 score_good = calc_score(inquiry, response_good)
 score_bad = calc_score(inquiry, response_bad)
+print(score_good > score_bad)
 ```
 score_clean = calc_score(context, response_clean)
 score_offensive = calc_score(context, response_offensive)
+print(score_clean > score_offensive)
 ```
 ### Comparing Different Language Models
 score_gpt3 = calc_score(context, response_gpt3)
 score_bert = calc_score(context, response_bert)
+print(score_gpt3 > score_bert)
 ```
 ## Finetuning on your production data